Site Reliability Engineer (f/m) at Wimdu GmbH (Berlin, Germany)
Your Tasks:
Desired Skills and Experience
- Build, automate, and manage key parts of the global Wimdu production environment
- Help define a solid strategy for ensuring future growth of our platform and help with our migration to AWS
- Optimize existing systems and components for performance and stability
- Forge close relationships with development teams, and empower them to take ownership of their own parts of the infrastructure in a SOA environment
- Collaborate with developers and QA engineers on release processes, deployment and scaling
- Build and maintain core infrastructure components like monitoring services and automation recipes that our development teams can use
- Proactively monitor end-to-end system performance (client to database) to identify bottlenecks and potential failures, and help developers improve their code based on your analysis
- Help with maintaining our office infrastructure and lend an occasional hand to colleagues needing IT support
- Assume strong ownership for what you build - including being on-call for it
- 3+ years experience managing a high traffic consumer-facing website
- Expert level Linux ( Ubuntu/Debian) system administration skills, and understanding of TCP/IP and networking concepts (e.g. firewalls, load balancers, routing, DNS)
- Experience with cloud-based and on premise systems, provisioning tools, virtualization and monitoring
- You are very familiar with AWS
- You have first experience with Puppet/Chef
- Experience with Automation test
- Ideally, development experience in Ruby, Java, Python, Shell or other high-level languages
- You are open, communicative, and enjoy working as part of a diverse team