Site Reliability Engineer - NYT Beta
The New York Times is hiring a Site Reliability Engineer to join a small team within NYT Beta. About the Team The NYT Beta team sits at the intersection of our technology organization and our newsroom. As a member of the team, you will not only interact with exceptional engineers, designers and product managers, but also members of the best editorial operation in the world. Over the last two years, Beta has launched numerous products including NYT Now and NYT Cooking, named Best apps of 2014 by Apple, and the popular website cooking.nytimes.com. We are a small, nimble team that cares as much about the product as the underlying technology. We have weekly product show-and-tell sessions, regular hack weeks, and frequently bring in industry leaders to discuss their work in product and technology development. Role Description This is an opportunity to help establish the SRE presence at an early stage in a growing NYT Beta organization, and contribute to the success of various new products with unique problem sets. The Site Reliability Engineering team’s mission is to ensure NYT Beta products are performant, highly available, scalable, and secure. Our goal is to accelerate product engineering productivity without sacrificing reliability. We achieve this by defining design principles, process and tooling for product teams to take operational responsibility for their software. Responsibilities:
- Work closely with product engineering teams to help design systems for performance, fault tolerance, and scalability
- Develop the tools and training needed for product engineers to assume operational responsibility for their own software
- Monitor and audit production application stacks for opportunities to improve performance and capacity utilization
- Troubleshoot, isolate and fix production issues along with product engineers and help prevent them from happening again
-
Participate in on-call rotation Required experience:
- Proficiency in at least one programming language, and willing to learn Go (our primary language), experience with Ruby is a plus
- Operating and debugging production systems
- Designing and implementing infrastructure, deployment, monitoring, and logging tools
- Designing pragmatic systems with an eye for performance, reliability and security
- Linux environment and fundamentals
- Designing web and mobile stacks at scale Tech we use: Go, Ruby, Bash, AWS, GCP, Terraform, Packer, Consul, Docker, Postgres, MySQL, Redis, Memcached, Varnish, Elasticsearch
Desired Skills and Experience
See application page for details