Desired Skills and Experience

  • Build platforms upon our cloud based infrastructure for all of engineering to leverage
  • Gather requirements and work to drive the adoption of the tools and services you build
  • Produce robust solutions that handle failure and are self-healing
  • Work directly with engineering teams building tooling ensuring we are fully automated
  • Participate in an on-call rotation with other members of the Site Reliability Team
  • Participate in code reviews and develop and maintain documentation
  • Debug and identify root cause code and systems issues
  • Mid-level to advanced coding skills
  • Ability to learn new languages (Go, Scala, Python, Ruby)
  • Understanding of computational complexity & working understanding of algorithms
  • Experience with Cloud Computing platforms (particularly AWS) a plus
  • Experience interacting with RESTful APIs
  • Ability to decompose complex systems and find failure scenarios
  • Experience with Linux and a good understanding of its fundamentals
  • A good understanding of large-scale distributed systems in practice, including multi-tier architectures, application security, monitoring and storage system