Desired Skills and Experience

  • Design and execute our Kubernetes clusters strategy to help our development teams deliver faster and more reliably
  • Drive adoption of Kubernetes and Kubernetes best practices across the company and industry
  • Create and/or provision reliable tools and infrastructure that enable rapid iteration amongst the product, research and development teams
  • Automate our infrastructure following the pattern Infrastructure as Code
  • Monitor, measure and troubleshoot infrastructure and services
  • Optimize business continuity capabilities and drive down incident recovery times
  • Capacity planning and management
  • Provide support during office hours
  • Mentor other members of the team (both inside and outside the SRE team)
  • At least 5 years of experience deploying, monitoring and troubleshooting multi-tier SOA applications and distributed systems at scale
  • Software development with any or all these programming languages: Ruby, Go, Java, Javascript, Python
  • Instrumentation for status and trend monitoring experience (CloudWatch, Prometheus, Graphite, etc.)
  • Experience with modern application system log management (Syslog, SumoLogic, Fluentd, Loggly, Splunk, etc.)
  • Container or cloud orchestration experience with at least one scheduler (Kubernetes, Docker Swarm, Mesos, etc.)
  • Highly developed cloud literacy with strong knowledge of AWS, GCE and Azure
  • Broad experience with Linux kernel and shell, TCP/IP and HTTP
  • Designing networks and systems for security, encryption, performance and agility
  • Backup and restoration automation, business continuity planning and testing
  • Database administration experience with MySQL replication and high availability
  • Knowledge of networking and security best practices with software defined networks
  • Experience with big data, streaming and search systems like Cassandra, Hadoop, Spark, Kafka and ElasticSearch
  • Competitive salary and stock options
  • Flexible time off policy; we believe everyone needs to recharge
  • Your choice of operating system and hardware
  • Annual trips to Spain (if working remotely)
  • Benefits vary based on location

Apply