Site Reliability Engineer
Key Qualifications
- 4 years of experience with UNIX and TCP/IP network fundamentals
- 1year of experience developing software, implementing API functionality using REST, Thrift, JSON or similar
- 2+ years of experience using configuration management software like Chef, Puppet, or Ansible
- 3+ years of work experience or strong knowledge of continuous integration and continuous deployment
- Excellent troubleshooting skills across multiple interdependent services
- 2+ years of working with basic large-scale internet service architectures (such as load balancing, LAMP, CDN’s)
- 2+ years of experience handling configuration and maintenance of common applications such as Apache, memcached, MySQL/MariaDB, Couchbase and RabbitMQ
- Experience with Logstash/Graphite/InfluxDB/Grafana/Cabot and other diagnostic and alerting tools Bias for action; bias for ship
Duties
- Extend automation of the infrastructure platform, and drive the development of new features of our self service portal.
- Work on bringing Client engineering practices into infrastructure operations.
- Enabling services to utilize the vast resources available in our data centers Write and review code, develop documentation and capacity plans.
- You will share an on-call rotation and be an escalation contact for service incidents.
- Partnered alongside the best engineers in the industry on the coolest stuff around, the code and systems.
Education
Bachelor’s degree or equivalent in Computer Science, Physics, Mathematics, Engineering, Chemistry or other hard science
Desired Skills and Experience
See application page for details