drjobs HPC Kubernetes Engineering Manager

HPC Kubernetes Engineering Manager

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Dallas - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips

G-Research is a leading quantitative research and technology are proud to employ some of the best people in their field and to nurture their talent in a dynamic flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.

This is a hybrid role based in our new Dallas infrastructure hub where we work on the latest technologies in a cutting-edge environment.

The role

We are seeking a highly skilled Kubernetes Engineering Manager with a focus on HPC to join our Platform Engineering function in Dallas.

Kubernetes underpins all facets of our Research platforms and HPC. As HPC Kubernetes Engineering Manager you will take ownership of the strategic roadmap design and delivery of our Kubernetes platform. In addition you will focus on continuous optimisations and performance enhancements of our kubernetes platform as Research demands augment.

We are looking for a highly experienced technical manager who can lead the significant scaling up of our existing compute platforms. You will excel working on the bleeding edge of technology pushing the boundaries of HPC compute performance and providing an innovative approach to solving complex technical challenges that arise.

Working closely with the Kubernetes Platform Management team you will ensure a smooth transition of new engineering capabilities with a strong focus on operational excellence in all aspects of design and implementation.

Key responsibilities of the role include:

  • Designing deploying and scaling a high-performance Kubernetes platform to meet current and future demands

  • Engaging proactively with stakeholders to ensure the Kubernetes platform aligns with and supports broader business and research demands

  • Driving cross-functional engineering initiatives across the Technology and Research organisations through confident communication and collaboration

  • Managing vendor relationships providing continuous feedback to influence product roadmaps and ensuring efficient deployment support and maintenance of critical platforms

  • Leading and developing a high-performing engineering team across the UK and US fostering technical excellence and professional growth

  • Monitoring and evaluating emerging trends in the Kubernetes ecosystem and working with Architecture and Innovation teams to assess and adopt relevant technologies

  • Ensuring platform reliability availability and security by applying a DevOps mindset and managing infrastructure using Infrastructure-as-Code tools

  • Overseeing budgeting capacity forecasting and resource management for Kubernetes platform operations and future scaling

Who are we looking for

The ideal candidate will have the following skills and experience:

  • Deep technical expertise in designing and scaling high-performance Kubernetes platforms for HPC and ML workloads in distributed environments

  • Strong capability in performance tuning for ML workloads across GPU and CPU clusters including workload scheduling GPU integration and resource optimisation

  • Skilled in managing multi-tenant compute environments and integrating distributed file systems and high-speed interconnects such as InfiniBand and RoCE

  • Strong collaboration and stakeholder management skills aligning engineering outcomes with business value and ensuring smooth capability handover

  • Proven leadership and project management abilities fostering a high-performance culture and accountable engineering teams

  • Advocate of best practices across CI/CD automation and tooling configuration management and Site Reliability Engineering (SRE)

  • Committed to designing and building secure high-integrity systems with a security-first mindset

Why should you apply

  • Sick days military leave and family and medical leave

  • Generous 401(k) plan

  • 16-weeks fully paid parental leave

  • Medical and Prescription Dental and Vision insurance

  • Life and Accidental Death & Dismemberment (AD&D) insurance

  • Employee Assistance and Wellness programs

  • Generous relocation allowance and support

  • Great selection of office snacks and hot and cold drinks

  • Free on-site gym and car parking

This role is employed through our US affiliate.

G-Research is committed to cultivating and preserving an inclusive work environment. We are an ideas-driven business and we place great value on diversity of experience and opinions.

We want to ensure that applicants receive a recruitment experience that enables them to perform at their best. If you have a disability or special need that requires accommodation please let us know in the relevant section


Required Experience:

Manager

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.