The Position
We are seeking an exceptional and visionary Chief Data Center Engineer to lead the design deployment operation and optimization of next-generation High Performance Computing (HPC) Centers mission-critical facilities engineered for complex AI/ML workloads at hyperscale. This role is central to realizing our vision of a world-class AI/ML infrastructure platform.
You will be responsible for overseeing the end-to-end engineering lifecycle of group HPC Centers spanning mechanical electrical controls system and digital solutions. The ideal candidate will have deep expertise in data center engineering with a strong track record of managing interdisciplinary teams and delivering robust scalable and efficient facilities tailored to HPC environments.
Because our projects are schedule driven decisiveness strong instincts assertive leadership and executive communication skills are all key characteristics.
Responsibilities:
- Define the engineering roadmap for our HPC Center buildout aligning with long-term HPC and AI/ML platform growth objectives.
- Serve as the technical authority on infrastructure architecture resiliency and operations for the facility.
- Ensure HPC Center designs meet industry standards and keep pace with the evolution of technology.
- Implement disciplined engineering baseline control and change management processes
- Develop engineering KPIs to drive high probability outcomes and learning curve improvements on future design build iterations to include performance reliability and PUE.
- Oversee all phases of facility development: site assessment conceptual design detailed engineering construction and commissioning.
- Ensure full compliance with safety environmental and industry regulatory requirements.
- Develop and manage a multidisciplinary team of engineers: mechanical electrical plumbing controls ( reliability engineering and more.
- Ensure all systems are optimized for high-density compute liquid/air cooling technologies and energy efficiency.
- Keep a pulse on engineering and technological breakthroughs that can be applied to improve the efficiency effectiveness reliability and/or performance of our HPC infrastructure.
- Interface with external stakeholders such as construction managers general contractors OEM vendors and third-party design engineers.
- Ensure monitoring alerting and fault management systems are in place across HPC infrastructure.
- Maintain engineering documentation and enforce rigorous QA/QC processes.
Requirements:
- Bachelor s or Master s degree in Mechanical Electrical or a related Engineering field.
- 10 years of experience in mission-critical engineering (Data Centers Weapons Systems Air/Space platforms Offshore Oil &Gas) with at least 5 years in a senior leadership capacity.
- Understanding the marriage of mechanical and power systems via controls to meet a dynamic power and cooling environment.
- Proven experience in designing and operating data centers or similar high-performance environments as noted above
- Effective leader & communicator who thrives in a collaborative team environment.
- Bonus: Understanding of AI problem sets and their flow down of requirements to compute and supporting power and cooling
- Bonus: Lead engineer experience through full lifecycle of large scale program(s)