Site Reliability Engineer, Ad Platforms

With Apple in Austin TX US

More jobs from Apple

Posted on May 25, 2021

About this job

Job type: Full-time
Role: System Administrator
Industry: Consumer Electronics
Company size: 10k+ people
Company type: Public


java, python

Job description

At Apple, we work every day to create products that enrich people’s lives. Our Advertising Platforms group makes it possible for people around the world to easily access informative and imaginative content on their devices while helping publishers and developers promote and monetize their work. Today, our technology and services power advertising in Search Ads in App Store and Apple News. Our platforms are highly-performant, deployed at scale, and setting new standards for enabling effective advertising while protecting user privacy. We are seeing an experienced and talented Application Site Reliability Engineer with a passion for designing and building reliable systems, and partnering closely across with engineers across platforms, development, quality, and security to deliver advertisements in a reliable and scalable way that results in awesome user experiences. We achieve this mission via tooling, automation, and defining and implementing standards, patterns, processes, and education to our partner engineering teams.

You’ll be part of the team delivering hosting infrastructure for Ad Platforms, supporting the continued growth of Ad Platforms, by helping to ensure that Ad Platforms can continue to scale up and grow. With global deployments, and fast growth, we need solutions to deliver new capabilities for Apple’s customers. You work to ensure high availability/high resiliency patterns for application owners to build on, solve operational problems with our applications and infrastructure, and help drive the continued evolution of our hosting infrastructure. Your duties will include: - Own the end-to-end reliability of Ad Platforms - Implement and improve our infrastructure and application monitoring and observability capabilities that results in improving our reliability - Engage with application engineering teams to improve service operability and reliability, on-call efficiencies, drive incident management, and post-mortem analysis - Drive production readiness, and improve key areas like capacity planning, configuration management, and observability - Design and improve architectures of new and existing systems based on the principles of reliability and high availability with extensive logging and observability - Develop expertise in Apple Infrastructure and best practices and bring that to Ad Platforms to run a world class distributed systems - Create tooling and automation to improve the operations and operability of our infrastructure and applications - Build frameworks that enable engineers to interact with Apple Infrastructure - Provide operational and on-call support for Cloud and on-premises services and infrastructure, collaborating closely with Ad Platforms application and platforms teams as well as other key Apple platforms and infrastructure teams

Skills & requirements

  • 5+ years in reliability engineering, Devops, or systems engineering
  • Excellent experience building and supporting internet-facing production services and distributed systems at scale
  • Automation mindset and solid programming skills in one of Python, Go, Java, or C++
  • Experience designing, building, and operating solutions built in AWS, on-premises, and hybrid
  • Proficiency in infrastructure as code with Terraform and/or CloudFormation
  • In-depth experience with containers technologies, and orchestration platforms such as Kubernetes and Nomad
  • Expertise in deploying, supporting, and monitoring new and existing services, platforms, and application stacks
  • Expertise in operating Linux based systems, with a solid understanding of its internals
  • Excellent problem solving ability across the technology stack--utilizing creative and innovating thinking
  • Strong understanding of security principals and design
  • Significant experience partnering closely with developers on service reliability design and support
  • Extreme sense of ownership, customer service, and integrity
  • Excellent communication skills, partnering and collaboration mindset

Bachelor's degree in Computer Science/Engineering discipline or equivalent. Master's degree preferred

Apply here