Summary:

Come work within the Technology department at the Wikimedia Foundation!  We are administering a public OpenStack cloud (Infrastructure as a Service) with a modern Platform as a Service (Kubernetes) running on it.  We are dedicated to supporting developers within and outside of the Wikimedia Foundation.   Candidates need to be comfortable sharing ideas, providing guidance, following instructions, mentoring volunteers, and communicating in public and asynchronous ways (mailing lists/forums/irc). Our team works remotely and so can you!

 Responsibilities:

Desired Skills and Experience

  • Perform day-to-day operational tasks on Wikimedia’s Cloud Services infrastructure (deployment, maintenance, configuration, troubleshooting)
  • Support volunteer and staff developers using Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) products
  • Implement and utilize configuration management and deployment tools (Puppet, Kubernetes)
  • Assist in the architectural design of new services and making them operate at scale
  • Assist in or lead incident response, diagnosis and followup on system outages or alerts across our stack
  • Bachelor’s degree and 5+ years related work experience; or equivalent work experience; or Master’s degree and 3+ years related work experience
  • Minimum of 5+ years of professional experience with infrastructure support and Linux
  • Solid development history with interpreted languages and web stack technologies.  
  • Experience managing modern distributed container cluster management systems (Primarily Kubernetes but also Docker Swarm, Mesos, …)
  • Minimum of 3 years of experience with Open Source configuration management and orchestration tools (Puppet, Ansible, Chef, SaltStack, …)
  • Experience managing an elastic computing environment (Openstack, Cloudstack, …)
  • On-call support and off-hours coverage in a 24x7 environment
  • Solid understanding of networking and TCP/IP fundamentals
  • Ability and ambition to support staff and volunteer developers inside and outside of the Wikimedia Foundation
  • Strong verbal and written proficiency with the English language
  • Experience interacting with open source software projects and communities
  • Experience with Golang
  • Experience with advanced distributed storage and database systems (Swift, Ceph, Cassandra…)
  • Low level systems troubleshooting and debugging skills (CPU/memory profiling, C/C++ experience, in-depth Linux knowledge)
  • Experience with the use, maintenance and configuration of monitoring, metrics and logging infrastructure (Icinga/Nagios, Prometheus, Grafana, Graphite, Logstash/Kibana, etc.)