Who you are: Got a lifelong passion for all things tech? Would you jump at the chance to see your work directly impact millions of IT pros who use our services to do their jobs? Spiceworks wants you! We’re looking for a Site Reliability Engineer to handle administration, automation, and improvements of an ever-evolving infrastructure. You’ll bridge the gap between development and operations as you help determine what can be launched and when. Think you’re up for the task? It’s time to apply!

Desired Skills and Experience

  • Handle the administration of Spiceworks production and non-production infrastructure located in datacenter, cloud, and in-office facilities
  • Work with the latest technologies and systems engineering concepts to help design and build our new container-based environment
  • Work with the development teams to deploy new services and troubleshoot infrastructure issues
  • Collect and analyze system and application metrics, utilizing StatsD and Graphite
  • Dive into systems administration of high volume, web-based services delivery environments
  • Bachelor’s degree in IS/IT, Computer Science, or equivalent experience
  • Advanced knowledge of Linux operating systems, preferably RedHat-based
  • Expertise configuring and troubleshooting Apache, NGINX, and HAProxy
  • Supporting services such as PostgreSQL, Redis, RabbitMQ, and ElasticSearch
  • Experience scripting in at least one of the following languages: Ruby, Bash, Perl, Python
  • Expertise with configuration management systems such as Puppet (preferred), Ansible, and/or Chef
  • Experience with package management in multi-datacenter environments
  • Experience with virtualized environments (KVM or Xen)
  • Hands-on experience with cloud environments such as AWS, CloudStack, or OpenStack
  • Intermediate knowledge of networking and load-balancing concepts
  • Experience with monitoring systems, such Nagios and Sensu
  • Experience with service discovery tools such as Consul is a plus
  • Experience collecting and aggregating log data in an ELK stack
  • Experience with Cassandra or other NoSQL databases is a plus
  • Experience working with Docker containers and Kubernetes is a plus
  • Be part of weekly on-call rotation
  • Ability to write clear and thorough documentation