System Reliability Engineer- Retail Store Apps

With Apple in Cupertino CA US

More jobs from Apple

Posted on April 24, 2021

About this job

Job type: Full-time
Industry: Consumer Electronics
Company size: 10k+ people
Company type: Public


java, python, sql

Job description

We are a diverse collection of thinkers and doers, continually reimagining our products, systems, and practices to help people do what they love in best user friendly efficient way. Apple is a deeply collaborative place, where everything we create is the result of people in different roles and teams working together to make each others ideas stronger. That same passion for innovation that goes into our products also applies to our practices, strengthening our commitment to leave the world better than we found it. Today, the Retail Engineering provides one of the best Apple Experience to Customers in the world, and operates in multiple countries worldwide. The system reliability engineering team within Apples Retail Store apps organization supports a wide variety of mobile and web apps used in Apples retail stores. The team is dedicated towards providing a world class experience to our end users by minimizing outages and by collaborating with product owners in an effective manner to understand user engagement on our app suites. We are looking for a software engineer with a curious mind and the energy and passion to build the next generation of SRE tools and processes within our organization

The SRE team is a new group formed under the Retail Store Apps organization with the goal of providing stability to users of our apps in apple stores across the world. The candidate would work closely with product owners and app owners to establish SLOs and identify areas of improvement across the app suite and support the roll out of new features across multiple regions and Geo. They would manage offshore and onsite support team in handling queries and problems reported by users from Apple stores. The candidate would also be involved in building robust monitoring and alerting systems for our suite of apps and Create tools to automate troubleshooting processes thereby reducing the downtime of our critical apps We're looking for a hardworking and passionate person to join this amazing team, if you feel this is you, we'd love to hear from you.

Skills & requirements

  • Should be proficient in Java - Oracle based app architecture.
  • Proficient in object-oriented programming and distributed systems
  • Proficient in navigating Linux based applications and networking protocols
  • Experience in Kafka, Cassandra, Couchbase/Elastic search, Redis and other no sql databases
  • Proficient is using logging tools like Splunk to do analysis and create reports
  • Experience in automation scripting using Python and Shell
  • Strong Experience working on Containers/Docker based applications
  • Should have experience in end to end development of Java based applications
  • Should have worked in supporting enterprise applications in a production environment that spans multiple timezones and languages
  • Should have worked with offshore teams - to facilitate round the clock application support
  • Should have experience of supporting and maintaining high impact applications that require high availability and low downtime

BS in Computer Science or equivalent

Apply here