Siri Operations Engineer at Apple (Cambridge, UK)
We are looking for an individual to be a part of a dynamic global operations team focused on automation development to support the highly successful Siri application. This position would support our engineering teams, runtime SRE, as well as our High Performance Compute engineering teams to ensure maximum systems reliability across the Siri stack. This position is specifically for someone experienced in building Operation tools for data analysis, automation, and monitoring coupled with a strong background in systems administration. This fast paced position offers potential job growth opportunities as you continue to develop operational tools and enhance process. You will be able to work with next generation technologies at a very large scale where everyone on this global team is a contributor and your enthusiasm and ideas are welcome. Note - the role can be based remotely but the successful candidate will be required to work from the Cambridge office at least 3 days a week.
Description
The Siri Operations Engineering team is focussed on the delivery of high quality code and application support. This position includes responsibilities for 3 core areas of Siri Operations (Production SRE, Dev Ops and Build, and HPC). The Production SRE duties revolve around supporting the massively scaled production footprint of Siri. For the Build/DevOps role, you will assist in automation tooling and supporting Siri developers to ensure changes are making their way to the various environments we support. The HPC area will involve debugging complex issues on our large grid and our Hadoop cluster which is used for analysis and enhancing the Siri application. You should be confident in the following areas: * Extensive communication and coordination between multiple engineering teams gathering input to design and build operations and automation tools * Multi-task multiple development projects and manage time accordingly - Develop automation solutions for repetitive operational tasks/processes * Support the build process for local engineering teams - Support and trouble shoot issues for data science teams using High Performance Compute resources * Design and develop dashboard and data visualizations for both engineering and management - Identify deficiencies in process/tools/etc. to improve performance and resolution time - 24x7 on-call rotation for issue escalations
Desired Skills and Experience
- Solid Systems administration experience - Linux/Networking/File system
- Proficient in troubleshooting wide range of issues across multiple technologies
- Comfortable writing scripts and applications in a variety of languages
- Experience with big Data technologies like Hadoop, Hbase, SGE, and file systems like GPFS and HDFS
- Experienced with monitoring, log aggregation, and visualization tools (nagios, splunk, logstash, graphite, grafana, Kibana)
- Experience with Puppet/Chef
- Experience using GIT
- Experience with Build Automation (Jenkins, Maven, TeamCity)
- Familiar with Continuous Delivery pipelines and concepts.
- Strong verbal and written communication skills