Desired Skills and Experience
- Continuous integration and delivery of the current platform on top of AWS with the aim of improving operations reliability, performance and cost
- Defining and deploying monitoring and logging systems to help the business and engineering better understand the performance of the live system
- Encourage best practices and systems operations in an always-up, always-available service
- Enhance current systems, creating a highly available, scalable, and self-healing infrastructure
- Take part in automation process, help and coach developers about getting code they’ve written into production safely and reliably
- Participate in technical meetings to understand business requirements in order to plan and integrate a system that is viable and scalable and ensures business continuity
- Exercise a high level of autonomy within the problem space, but also feel confident that you have the support you need to deliver
- At least 1 year of professional experience with site reliability or DevOps
- Have written some Dockerfiles from scratch
- Experience or interest in working with Kubernetes
- Experience with maintaining infrastructure as code
- Ability to use a wide variety of open source technologies and cloud servicesA working understanding of code and script (PHP, node.js and/or Golang)
- Application programming experience in PHP/Python/node.js including writing tested code
- Detailed experience with MySQL and performance profiling of queries
- Experience with database performance and operations
- Experience with automation/configuration management using Ansible/Salt/Puppet
Apply