Desired Skills and Experience

  • Work with other SREs to build a comprehensive set of tools to automate and monitor our production infrastructure
  • Work with Engineering to build resilient, operable, self-healing services
  • Participate in reasonable on-call rotations with the rest of Engineering
  • Managed groups of servers, preferably in AWS, at scale
  • Reasonably deep knowledge of Linux and internet technologies
  • Proficient in modern scripting languages like Python or Ruby
  • Configuration management tools like Ansible or Salt
  • Used advanced metrics to solve hard problems
  • Experience managing Big Data or high-throughput distributed systems like Hadoop and Kafka
  • Experience with continuous integration
  • Contribution to open source projects
  • An active interest in containerization technologies such as Docker and/or Kubernetes
  • Acts like a team
  • Avoids doing things twice
  • Solves hard problems for tomorrow, not just for today
  • Prefers fixing problems to complaining about them
  • Investigates, considers and adopts new technology where it makes sense
  • Doesn’t tolerate brilliant jerks

Apply