Desired Skills and Experience

  • Design, develop and implement solutions that improve the stability, scalability, availability and latency of the IT Infrastructure services;
  • Solve problems and incidents that impact our employees experience and build solutions and automation to prevent them from happening again (root cause);
  • Build, enhance and maintain tooling and scripts to automate repetitive tasks;
  • Manage issues proactively, monitor user experience and identify opportunities to automate remediation of issues;
  • Support in creating structural solutions instead of workarounds;
  • Create integrations between multiple applications and services, both on premise and in the cloud;
  • Take ownership of one or more services and have the freedom to do what is best for our business and customers;
  • Build effective monitoring to monitor systems health and behavior, and jump in to handle outages;
  • Create system health/performance dashboards to provide both high level and detailed views into monitored services;
  • Collaborate in the design and development of business-critical systems to meet user needs and respond to/anticipate on technological advancements;
  • Mentor, coach and steer more junior colleagues across technical challenges;
  • Share the on-call rotation and be an escalation contact for incidents.
  • Strong proficiency in at least one programming language. For example with .Net/C#, Python and/or Java;
  • Proven experience with Shell scripting (Powershell/Bash) and automation;
  • Experience with designing, implementing and maintaining complex and scalable system infrastructures;
  • Strong Windows and/or Linux administration and troubleshooting skills;
  • Be able to understand and formulate meaningful business metrics;
  • Creative and not afraid to step outside of your comfort zone;
  • Fluent in the English language both spoken and written.
  • Logging infrastructure and tools such as Logstash, Elasticsearch, Kibana, Graphite, Splunk, and HDFS;
  • Configuration management and provisioning tools such as Puppet, Chef and DSC;
  • Experience with orchestration tools;
  • Experience with RESTfull API or Webhook;
  • Load balancing and identity management solutions;
  • Additional experience in Networking, Security or Storage is a plus.