Desired Skills and Experience

  • Troubleshoot and debug run-time issues
  • Automate operation, installation and monitoring of the ecosystem components/platforms
  • Implement OS and hardware level optimizations
  • Provide operations documentation to educate peer teams
  • Design and deploy solutions for problems such as high availability, elastic load distribution and high throughput
  • Focus on automation: this includes automating deployment and configuration management, quality (including functional and capacity testing), and reaction to problems
  • 3+ years of experience programming in Python or Ruby
  • Demonstrated experience working with Linux systems
  • Familiarity with GIT
  • Familiarity with configuration management tools such as Chef, Puppet, Ansible or Saltstack
  • Experience programming in C/C++, Java, Go, Perl, Scala or JavaScript
  • Familiarity with monitoring tools such as Splunk, Elk, Grafana, Nagios
  • Practical knowledge of networking such as TCP/UDP/IP
  • Familiarity with virtualization technologies such as Vagrant, Terraform, VMWare, KVM
  • Knowledge of cloud technologies such as OpenStack, AWS, Rackspace, CloudFoundry, OpenShift, WS02
  • Experience with big data technologies such as Hadoop, Spark, Cassandra
  • Familiarity with containerization technologies such as Docker, Mesos, Core OS, Kubernetes