We’re looking for a Site Reliability Engineer (SRE) to keep Pinterest running strong and scaling fast. As a tech lead for the Site Reliability Engineering team, you’ll drive reliability, performance, efficiency and security for Pinterest’s Commerce and Ads platform. You’ll work on building and operating the next generation of commerce and monetization platforms and services.

What You’ll Do

  • Design and deliver software systems, tools and infrastructure to improve the scalability, availability, performance, latency, and efficiency of Pinterest’s commerce, ads serving, logging and data pipelines
  • Influence and create new designs, architectures, standards and methods for large-scale distributed systems with operability being the core tenet
  • Collaborate with developers in the deployment and scaling of new product features to facilitate rapid iteration and massive growth
  • Perform deep dives into reliability issues and partner with software and systems engineers across the organization to produce and roll out fixes
  • Engage in service capacity planning, demand forecasting, software performance analysis and system tuning

What We’re Looking For

  • Over 7 years of experience operating and scaling services in a distributed, internet-scale environment
  • Proficiency in Linux/Unix/BSD and a dynamic programming language (Python, Java or Go)
  • Systematic problem solving approach, coupled with a strong sense of ownership and drive
  • Enthusiasm for working quickly and collaboratively with talented people

Desired Skills and Experience

See application page for details