Desired Skills and Experience

  • Continuous integration and delivery of the current platform on top of AWS with the aim of improving operations reliability, performance and cost
  • Defining and deploying monitoring and logging systems to help the business and engineering better understand the performance of the live system
  • Encourage best practices and systems operations in an always-up, always-available service
  • Enhance current systems, creating a highly available, scalable, and self-healing infrastructure
  • Take part in automation process, help and coach developers about getting code they’ve written into production safely and reliably
  • Participate in technical meetings to understand business requirements in order to plan and integrate a system that is viable and scalable and ensures business continuity
  • Exercise a high level of autonomy within the problem space, but also feel confident that you have the support you need to deliver
  • At least 1 year of professional experience with site reliability or DevOps
  • Have written some Dockerfiles from scratch
  • Experience or interest in working with Kubernetes
  • Experience with maintaining infrastructure as code
  • Ability to use a wide variety of open source technologies and cloud servicesA working understanding of code and script (PHP, node.js and/or Golang)
  • Application programming experience in PHP/Python/node.js including writing tested code
  • Detailed experience with MySQL and performance profiling of queries
  • Experience with database performance and operations
  • Experience with automation/configuration management using Ansible/Salt/Puppet

Apply