Site Reliability Engineer, Compute and Storage Group
In this highly visible role, you will have the responsibility of ensuring that Apple’s world class Silicon Engineering Group will have the infrastructure and tools needed to engineer and design the world’s most advanced silicon devices and products. You will utilize your deep understanding of building and maintaining Linux compute clusters, storage systems, web infrastructure & applications, database servers, tool/license management, monitoring systems, work flow optimization, and directory services. You will utilize your extensive communication skills to interface with internal teams, enabling Apple’s world class product development.Key Qualifications
- Typically requires at least 5+ years of experience in Linux or UNIX systems administration in a large engineering or R&D environment and demonstrated skills in the following:
- Linux (RHEL/CentOS preferred)
- NFS and NAS appliances (NetApp preferred)
- Layer 2 / Layer 3 networking (Arista or Cisco preferred)
- Scripting in shell, Perl, Python or Ruby
- Revision control systems (SVN, git, Perforce)
- Centralized configuration management (Puppet, cfengine)
- Software/tool compilation and installation
- Flexlm and similar licensing systems
- Monitoring systems such as Nagios, Zenoss, Groundwork
- LDAP (OpenLDAP, DSEE, OpenDirectory)
- IPAM with DNS (BIND) and DHCP
- Must be analytical and possess strong organizational/problem-solving skillsDescriptionYou will be responsible for supporting internal engineering teams by enhancing, maintaining, performance tuning, and planning capacity of compute clusters. Your role will directly impact the development, enhancement and maintenance of compute cluster queuing, storage systems, network interconnects, monitoring, LAMP stack, and load balancing needs. EducationMS/BS Degree or equivalent
Desired Skills and Experience
See application page for details