Desired Skills and Experience

  • Managing global HPC systems
  • Participating in the planning and deployment of new machines
  • Hiring and overseeing the HPC support and devops team
  • Setting the monitoring, automation, alerting, and problem response strategy
  • Managing priorities for the team

Ensuring an appropriate balance between time spent solving one-off problems vs. long-term solutions and automation

  • Ensuring an appropriate balance between time spent solving one-off problems vs. long-term solutions and automation
  • Management reporting
  • Day-to-day management of the physical computer room facilities (e.g. power and cooling)
  • Operating the LAN / WAN networks
  • Procurement and asset management of relevant IT equipment
  • Working with the IT Manager to ensure that all IT functions are covered.
  • Linux system administration and architecture experience
  • Management / leadership experience in a role of significant responsibility
  • Demonstrated ability to document and organise complex IT systems
  • Experience managing a significant budget, and external vendors
  • Excellent communication skills, including written and spoken business and technical English.
  • Large ethernet networks
  • The Lustre cluster file system
  • High-performance electrical and cooling systems.

Apply