Site Reliability Engineer (m/f) at LOVOO GmbH (Dresden, Deutschland)
Your tasks
Desired Skills and Experience
- Create, monitor, and scale our operations efforts through automation procedures and configuration management
- Define and evangelize cloud-related optimizations and best practices to improve reliability and performance
- Collaborate with developers and QA engineers on release process, deployment and scaling
- Engage with product engineering teams to triage production outages as well as coordinate and resolve action items to improve ongoing reliability
- Research new techniques and explore the latest technologies
- Periodic on-call duty
- Good understanding of Linux, networking, databases and experience using deployment automation tools
- Proven knowledge of cloud architecture concepts and styles (SaaS, PaaS and IaaS), containerization (Docker), Kubernetes and related concepts
- Comfortable to work in a software development lifecycle, including version control (git), automated testing, code reviews, etc.
- Coding skills in Go or Python
- Ability to concentrate when it comes to firefighting and ambition to make firefighting a thing of the past
- Transparent and open communication style and also a developed team-player mentality
- A very good command of English and basic knowledge of German are preferred