Desired Skills and Experience
- Design, develop and implement solutions that improve the stability, scalability, availability and latency of the IT Infrastructure services;
- Solve problems and incidents that impact our employees experience and build solutions and automation to prevent them from happening again (root cause);
- Build, enhance and maintain tooling and scripts to automate repetitive tasks;
- Manage issues proactively, monitor user experience and identify opportunities to automate remediation of issues;
- Support in creating structural solutions instead of workarounds;
- Create integrations between multiple applications and services, both on premise and in the cloud;
- Take ownership of one or more services and have the freedom to do what is best for our business and customers;
- Build effective monitoring to monitor systems health and behavior, and jump in to handle outages;
- Create system health/performance dashboards to provide both high level and detailed views into monitored services;
- Collaborate in the design and development of business-critical systems to meet user needs and respond to/anticipate on technological advancements;
- Mentor, coach and steer more junior colleagues across technical challenges;
- Share the on-call rotation and be an escalation contact for incidents.
- Strong proficiency in at least one programming language. For example with .Net/C#, Python and/or Java;
- Proven experience with Shell scripting (Powershell/Bash) and automation;
- Experience with designing, implementing and maintaining complex and scalable system infrastructures;
- Strong Windows and/or Linux administration and troubleshooting skills;
- Be able to understand and formulate meaningful business metrics;
- Creative and not afraid to step outside of your comfort zone;
- Fluent in the English language both spoken and written.
- Logging infrastructure and tools such as Logstash, Elasticsearch, Kibana, Graphite, Splunk, and HDFS;
- Configuration management and provisioning tools such as Puppet, Chef and DSC;
- Experience with orchestration tools;
- Experience with RESTfull API or Webhook;
- Load balancing and identity management solutions;
- Additional experience in Networking, Security or Storage is a plus.