Staff Engineer AI Compute Blade and Rack Validation

Graphcore

Not Interested
Bookmark
Report This Job

profile Job Location:

Austin, TX - USA

profile Monthly Salary: Not Disclosed
Posted on: 2 days ago
Vacancies: 1 Vacancy

Job Summary

About us

Graphcore is one of the worlds leading innovators in Artificial Intelligence compute.

It is developing hardware software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.

As part of the SoftBank Group Graphcore is a member of a best-in-class family of companies responsible for some of the worlds most transformative technologies. Together they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.

Graphcores teams are drawn from a diverse group of backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists silicon designers software engineers and systems architects Graphcore enjoys a culture of continuous learning and constant innovation.

Job Summary

We are seeking a senior validation lead engineer to lead at-scale rack validation efforts for next-generation AI hyperscale systems. This role focuses on post-silicon system validation across the full lifecycle ensuring functional electrical and thermal performance meets product objectives. You will own end-to-end blade and rack validation including planning development execution and debug while collaborating across firmware systems and hardware teams.

The Team

The Rack Validation team is responsible for ensuring system readiness and quality at scale. The team works cross-functionally with firmware silicon and system engineering teams to validate complex AI compute platforms.

Responsibilities and Duties

  • Lead post-silicon validation of AI compute blades and racks including test planning development and automation.
  • Drive provisioning and integration of system components (SoC FW BMC RMC OS) for rack-level readiness.
  • Own execution against program achievements and report validation progress and risks.
  • Triage test failures collect debug data and collaborate on root cause analysis.
  • Track validation coverage and continuously improve test processes and infrastructure.
  • Collaborate with ODM/JDM partners on validation and quality.
  • Mentor engineers and drive engineering excellence.

Candidate Profile

Essential:

  • Bachelors or Masters degree or equivalent experience in Computer Engineering Electrical Engineering Computer Science or related field.
  • Proven track record in system rack or embedded validation with leadership experience.
  • Strong experience in large-scale hardware validation environments.
  • Expertise in CPU/GPU memory IO and firmware validation.
  • Experience with Linux/server OS and automation using Python/Bash.
  • Knowledge of IPMI Redfish PLDM.
  • Experience with CI/CD pipelines and hardware interfaces.

Desirable:

  • Experience in hyperscale environments.
  • Familiarity with OpenBMC and processes for verifying firmware functionality.
  • Knowledge of firmware security and HIL testing.
  • Experience with test management tools.

Required Experience:

Staff IC

About usGraphcore is one of the worlds leading innovators in Artificial Intelligence compute.It is developing hardware software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.As part of the S...
View more view more

About Company

Company Logo

GoImplement.it - HL7 v2, v3, FHIR, CDA, IHE and Conecthatons. We know all about healthcare IT standards so you can focus on your core product.

View Profile View Profile