Site Reliability Engineer (f/m) at Wimdu GmbH (Berlin, Germany)

Your Tasks:

Desired Skills and Experience

Build, automate, and manage key parts of the global Wimdu production environment
Help define a solid strategy for ensuring future growth of our platform and help with our migration to AWS
Optimize existing systems and components for performance and stability
Forge close relationships with development teams, and empower them to take ownership of their own parts of the infrastructure in a SOA environment
Collaborate with developers and QA engineers on release processes, deployment and scaling
Build and maintain core infrastructure components like monitoring services and automation recipes that our development teams can use
Proactively monitor end-to-end system performance (client to database) to identify bottlenecks and potential failures, and help developers improve their code based on your analysis
Help with maintaining our office infrastructure and lend an occasional hand to colleagues needing IT support
Assume strong ownership for what you build - including being on-call for it
3+ years experience managing a high traffic consumer-facing website
Expert level Linux ( Ubuntu/Debian) system administration skills, and understanding of TCP/IP and networking concepts (e.g. firewalls, load balancers, routing, DNS)
Experience with cloud-based and on premise systems, provisioning tools, virtualization and monitoring
You are very familiar with AWS
You have first experience with Puppet/Chef
Experience with Automation test
Ideally, development experience in Ruby, Java, Python, Shell or other high-level languages
You are open, communicative, and enjoy working as part of a diverse team