Site Reliability Engineer - Software Delivery

With Apple in Cupertino CA US

More jobs from Apple

Posted on April 28, 2021

About this job

Job type: Full-time
Role: System Administrator
Industry: Consumer Electronics
Company size: 10k+ people
Company type: Public

Job description

Our team is on a mission to deliver reliable services that help engineers at Apple do their work faster than ever before. Allowing rapid prototyping of ideas and code can help improve our products for other engineers and our customers. We are looking for a site reliability engineer to join our efforts. In this role you will be developing tooling, monitoring, and improving distributed services that help improve software build and test cycles. You would join a small, tight-knit group of engineers who build exciting new technology (and have fun while doing so). We are applying unique and innovative ideas to old problems and welcome new contributors who are excited by the opportunity to significantly impact the shape and direction of our product. We believe that every engineer brings unique skills and perspective. Far more than any specific experience or skill, we are looking for engineers who want to build great software, learn and develop their skills, and ultimately enable others to develop amazing applications.

You will be working hand in hand with the engineering team to combine established distributed system techniques and site reliability techniques to achieve high performance, availability, and reliability. In this role you will be expected to analyze problem requirements and drive results within the team by using existing tooling or generating new tooling while helping define long short term goals. You will be most successful here as a flexible and proactive site reliability engineer who thrives in a supportive, respectful, and balanced work environment with interesting and challenging problems to tackle. You have excellent judgement and integrity with the ability to make timely and sound decisions. You know how to drive constructive discussion of design points, learn from your team, and use your experience to advocate and teach others. We welcome all applicants, but familiarity with the following areas will help you come up to speed fastest on our project: - Existing modern distributed systems. - Familiar with application and service monitoring tools and techniques. - Involvement with incident management and response - Familiarity with Cassandra / Swift / C++ - Familiar with virtualization technology In your role as a site reliability engineer on our team: - You’ll spend a majority of your working on tooling and contributing to SRE standards used across the organization. - You’ll collaborate with your teammates to help refine problem requirements and work with teams to implement long lasting solutions. - You’ll engage with other SRE teams to help identify issues and establish long term relationships that include knowledge sharing. - You’ll provide valuable feedback loops to engineering teams backed by application metrics and log data. - You’ll drive areas of SRE focus (resiliency, redundancy, performance) by collaborating in planning meetings. - You’ll use modern continuous delivery development practices. - You’ll develop expertise and be given ownership over components critical to Apple's success. - You’ll contribute to engineering a high-performance system through algorithm design, optimization, and performance testing. - You’ll design new ways to scale Apple's build systems to ever increasing problem sizes. - You’ll prioritize and schedule issues, estimate scope of work, and investigate new issue reports in your areas of responsibility. - You’ll analyze and improve the operability of our server software. - You’ll provide and be given technical guidance, software architectural review, and mentorship. - You’ll have access to the engineers who create Apple's full technology, tools, and operating systems stack. Here you’ll find that: - We’re a worldwide team responsible for various properties in Apple's Software Engineering organization. - We strongly value the security and privacy of our users. - We collaborate extensively with other SRE teams inside Apple. - We prioritize developing automation and infrastructure to ease our own testing and development. - We practice a blameless approach to operations and outages, and work together to define safety mechanisms and processes. - We have a wide range of experience, from engineers who’ve just joined to those who’ve thrived at Apple their whole careers. - We provide a peaceful work environment where you can choose when to socialize and when to close your office door and focus. - We have an encouraging and growth-focused workplace culture. - We value integrity and respect for others, and take the safety and comfort of our employees seriously. - We encourage you to use accrued vacation time and maintain a healthy work-life balance. - We offer excellent benefits.

Skills & requirements

  • Positive and respectful attitude.
  • Strong interest in building reliable services.
  • Ability to efficiently design and develop new technologies in a changing environment.
  • Experience building and maintaining infrastructural distributed systems such as orchestration systems, distributed databases, or network proxies. Experience should include a strong understanding of distributed system fundamentals like consistency and availability, networking subsystems and implementation techniques, as well as security.
  • Excellent collaborative skills, with strong written and verbal communication.

Apply here