Desired Skills and Experience
- Improve our developer workflows and continuous integration tools and processes
- Identify ways to reduce release cycle times and execute them through automation and process definition
- Define standards and tools for logging, monitoring, alerting and operational metric dashboarding
- Develop a unified process for managing operational incidents in our test and production environments in collaboration with our production support and site reliability engineering teams
- Work with our portfolio management team to identify infrastructure growth needs and decommission opportunities
- Build tools to support infrastructure provisioning and lifecycle management needs
- Plan infrastructure migrations and support delivery teams in executing them
- Monitor change management and incident management SLOs and find opportunities to train and support delivery teams in consistently achieving these objectives
- Own disastery recovery planning and work with delivery and production support teams to improve our DR story
- Several years’ work experience as a hands-on devops or service engineer building software using software engineering principles, tools and workflows to solve operational problems
- Proficiency with scripting languages (shell, Perl, Ruby, Groovy etc.) and system programming in Linux and OSX
- Hands-on experience with Git, Jenkins, XL Release, Splunk, AppDynamics and other key tools of the trade
- Hands-on experience managing deployments in cloud infrastructure platforms
- Exceptional written and verbal communication skills - able to explain technical concepts to product managers and business people in ways that are meaningful to them
- Highly organized and able to multitask effectively
- Able to weigh several, often conflicting constraints and make rapid decisions in a high-pressure environment
- A thorough understanding of continuous integration and continuous delivery
- Finding joy in mentoring others and helping them grow in their careers
- The insight to notice problems in how we work, and the initiative to fix them
- The ability to see and to understand the larger context in which your team works and to craft solutions within that context
- Adaptability to changes in processes, organizational structures and business conditions
- A strong belief in your personal responsibility for ensuring quality craftsmanship
- An open, collaborative spirit
- Familiarity with Docker and Kubernetes
- Experience in a fast-paced startup environment
Apply