HPC Manager at DownUnder GeoSolutions (London, UK)
Desired Skills and Experience
- Managing global HPC systems
- Participating in the planning and deployment of new machines
- Hiring and overseeing the HPC support and devops team
- Setting the monitoring, automation, alerting, and problem response strategy
- Managing priorities for the team
Ensuring an appropriate balance between time spent solving one-off problems vs. long-term solutions and automation
- Ensuring an appropriate balance between time spent solving one-off problems vs. long-term solutions and automation
- Management reporting
- Day-to-day management of the physical computer room facilities (e.g. power and cooling)
- Operating the LAN / WAN networks
- Procurement and asset management of relevant IT equipment
- Working with the IT Manager to ensure that all IT functions are covered.
- Linux system administration and architecture experience
- Management / leadership experience in a role of significant responsibility
- Demonstrated ability to document and organise complex IT systems
- Experience managing a significant budget, and external vendors
- Excellent communication skills, including written and spoken business and technical English.
- Large ethernet networks
- The Lustre cluster file system
- High-performance electrical and cooling systems.