Senior Cluster Engineer High-performance Computing

Not Interested
Bookmark
Report This Job

profile Job Location:

Den Bosch - Netherlands

profile Monthly Salary: Not Disclosed
Posted on: 4 hours ago
Vacancies: 1 Vacancy

Job Summary

The organization

Our client operates one of the largest GPU infrastructures in the world 30000 GPUs and 10InfiniBand fabrics across five global data centers. Our infrastructure doubles in size everyyear. Were looking for engineers who love getting deep into Linux systems pushinghardware and software to their limits and making the worlds fastest AI and HPC workloadsrun even faster

The role

Youll join a small senior team that works between the hardware and Linux OS layers solving performance problems that affect tens of thousands of GPUs. This is hands-on high-impact engineering where microsecond gains matter and every optimization is felt at globalscale.

What youll do
  • Profile and optimize Linux kernel subsystems (CPU scheduling memorymanagement networking stack) for GPU clusters and InfiniBand fabrics

  • Troubleshoot and resolve complex performance bottlenecks

  • Integrate and validate new GPU hardware (KVM/QEMU PCIe devices Kubernetes)

  • Improve monitoring alerting and automation for large-scale distributed systems

  • Occasionally assist customers in optimizing workloads

Your profile
  • Wed love to hear from you if you have

  • Solid Linux internals knowledge ideally with kernel tuning or profiling experience(perf ftrace eBPF sysprof etc.)

  • Experience reading/debugging C or C system-level code

  • Scripting or development skills in Go Python or similar

  • A background in low-level complex environments such as HPC large-scale clustersor high-performance networking

Bonus points for:

  • GPU or HPC cluster experience

  • InfiniBand or other high-performance interconnect knowledge

  • Virtualization stacks (KVM/QEMU) Slurm Kubernetes

This is for you if you

Love solving deep technical challenges care about performance downto the microsecond and want to work on infrastructure that pushes the limits of whats possible.

Whats offered
  • Salary: up to 160k 25% bonus (200k OTE).

  • Flexible working arrangements.

  • A dynamic and collaborative work environment that values initiative and innovation.

  • Location: Amsterdam or remote.

The organizationOur client operates one of the largest GPU infrastructures in the world 30000 GPUs and 10InfiniBand fabrics across five global data centers. Our infrastructure doubles in size everyyear. Were looking for engineers who love getting deep into Linux systems pushinghardware and software...
View more view more

Key Skills

  • JProfiler
  • Splunk
  • Performance Testing
  • Fiddler
  • Apache
  • HP Performance Center
  • LoadRunner
  • New Relic
  • Scalability
  • J2EE
  • Java
  • Scripting

About Company

Company Logo

We focus on job opportunities in The Netherlands for IT and engineering professionals. We share relevant tips and tricks with jobseekers and we can support employers with regards to relocation, work permit rules, 30% ruling et cetera. We value transparancy, honesty and a no-nonsense a ... View more

View Profile View Profile