ING is hiring SRE Engineer(s) Site Reliability Engineering - WBS

Your working environment: 

​The Site Reliability Engineering (SRE) teams is a concept that is initially implemented by Google, to improve their reliability. The idea of SRE teams has been replicated in multiple industries around the world. A SRE team is a multidisciplinary team that has as primary goal to continuously and structurally improve the reliability and maintainability of the IT environments in different domains using their software engineering skills. The SRE team also acts as temporary support when a certain Dev/Ops Team is in need. Once the SRE Team’s help is called upon by a Dev/Ops team, the focus and the available capacity of the team automatically shifts to recovering the stressed service(s). In this stage, complete focus is on restoration of the service back to normal. The SRE Team does not take over any of Dev/Ops responsibilities. Once the level of service is structurally restored, the SRE Team terminates their support and moves on to the next Dev/Ops team.In the Netherlands, SRE teams are set up per domain (4 domains in NL) and in Belgium there is one SRE team serving all domains. For Wholesale Banking Services we are looking for a highly skilled engineer.

Activities done by SRE (e.g.):

Your new job!

You are an enthusiastic Software Engineer with a focus on creating amazing code. You will develop consumable, automated end-to-end software solutions for your consumers together with your squad. You are able to move all typical user stories and tasks from “To-Do” to “Done”. You are able to inspire squads to spread SRE through the bank.

You are enthusiastic about transferring your knowledge to others within your team, but also with all DevOps teams in WBS and other SRE teams within ING.

What will you do? (Capabilities)

You recognize yourself in this profile

Competencies - Skills

Your education and background

We offer:

Like it ?

Don’t apply

Love it ?

( hell yes ) apply ….. 

Desired Skills and Experience

  • Prevention of Incident Reoccurrence
  • Improving Incident Mitigation Capabilities
  • Follow up on improvement actions after high impact incidents (root cause)
  • Support in creating structural solutions instead of workarounds
  • Support in automation of the services (create consumable services)
  • Continuous Improvement of Continuous Delivery & Software Engineering Practices
  • Improve the MTTR (Mean Time To Repair) and MTBSF (Mean Time Between Service Failures) of service impacting incidents
  • Create and sustain excellent monitoring of chains
  • Helping teams improve the resilience of their assets
  • Create real-time and standardized insights of production chain for faster incident analysis
  • Help improving teams in data mining- and analysis skills
  • Develop consumable, standardized software solutions (resilient, high quality, highly automated and up-to-date)
  • Develop an ecosystem of tools to provide self-service capabilities to the consumer
  • Manage all resources in version-controlled repositories (incl., code, scripts, configurations, artefacts, static resources)
  • Refactor and reuse existing code/modules/functionality
  • Write fully automated tests (e.g., unit-, functional-, non-functional- and integration testing)
  • Build, enhance and maintain tooling and scripts to automate repetitive or error prone tasks
  • Develop micro services and APIs
  • Get to know the services in WBS through analysis of the current level of service and the underlying causes.
  • Formulate an improvement plan to gain a higher reliability level
  • Prioritise the improvement plan with the responsible devOps teams
  • Recognise shortcomings and deliver the necessary resources and skills to devOps teams where needed.
  • Mastery in at least one programming language, Java and/or .net are a plus;
  • Working knowledge of configuration tools like Puppet, Chef or Ansible
  • Scripting experience in at least one of the following: Ruby, Python, Bash, Powershell
  • Solid foundation in Linux administration and troubleshooting;
  • Proven experience with automation.
  • Be able to understand and formulate meaningful business metrics;
  • Professional and intellectual IT ability at university level - at least Bachelor’s degree
  • Experience in Agile/scrum
  • Certifications and/or proven proficiency in one or more programming/scripting languages
  • Excellent command of spoken and written Dutch and English
  • Great salary and benefits like 13th month, 8% holiday allowance, personal allowances etc.
  • Challenging, professional and fun work environment.