drjobs AIML - Staff Software Engineer, ML Compute - ML Platform & Technologies

AIML - Staff Software Engineer, ML Compute - ML Platform & Technologies

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Seattle - USA

Monthly Salary drjobs

USD 171600 - 302200

Vacancy

1 Vacancy

Job Description

As a staff software engineer you have the following responsibilities:- Own the architecture design development and operations of large-scale systems designed for machine learning.- Develop custom scheduling resource management solutions and fleet management for our ML model training compute infrastructure. - Collaborate with multi-functional teams integrate with Kubernetes in on-premises and cloud provider clusters and enable seamless integration with NVIDIA GPUs and other ML accelerators.- Partner with data scientists and machine learning engineers across different Apple organizations to define high-impact product features and deliver them with this role you are building the platform upon which other teams will develop data pipelines and machine learning applications.- Lead a group of engineers to deliver high-quality products/services. Be able to stay on top of innovative technologies and apply them in the job. - During the process support junior engineers by providing advice mentoring and educational opportunities.


  • Bachelors in Computer Science
  • 10 years of industry related experience working in collaborate environments
  • Experience with public cloud infrastructure like: Kubernetes Amazon EC2 and EKS Google Cloud Platform
  • Strong software development skills with proficiency in relevant languages (ex. Golang Python)
  • Strong problem solving skills and ability to write performant and high-quality code
  • Proficient understanding of the software development process including unit testing and release management
  • Excellent interpersonal skills; able to work independently as well as in a team; can take feedback and iterate on a solution in a collaborative setting


  • Strong understanding of batch scheduling systems and high-performance computing environments
  • Experience using system monitoring tools automated testing frameworks and CI/CD pipelines
  • Experience with GPUs and/or other ML accelerators in the context of Machine Learning
  • A passion for making simple robust and scalable platforms used by other engineering teams
  • Flexibility/adaptability for working in a dynamic environment with different frameworks and requirements
  • Masters or PhD in Computer Science

Required Experience:

Staff IC

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.