Annapurna Labs builds custom Machine Learning accelerators that are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Neuron Compiler team is searching for compilerskilled engineering talent to support the development and scaling of a compiler to enable the worlds largest ML workloads to run performantly on these custom Annapurna systems.
The Product: The AWS Machine Learning accelerators represent a pinnacle of AWS technologies specifically designed for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through stateoftheart software stack the AWS Neuron Software Development Kit (SDK). This SDK comprises an ML compiler runtime and application framework which seamlessly integrate into popular ML frameworks like PyTorch. AWS Neuron running on Inferentia and Trainium is trusted and used by leading customers such as Snap Autodesk and Amazon Alexa.
The Team: Annapurna Labs was a startup company acquired by AWS in 2015 and is now fully integrated. If AWS is an infrastructure company then think Annapurna Labs as the infrastructure provider of AWS. Our org covers multiple disciplines including silicon engineering hardware design and verification software and operations. AWS Nitro ENA EFA Graviton and F1 EC2 Instances AWS Neuron Inferentia and Trainium ML Accelerators and in storage with scalable NVMe are some of the products we have delivered over the last few years.
Within this ecosystem the Neuron Compiler team is developing a deep learning compiler stack that takes state of the art LLM and Vision models created in frameworks such as TensorFlow PyTorch and JAX and makes them run performantly on our accelerators. The team is comprised of some of the brightest minds in the engineering research and product communities focused on the ambitious goal of creating a toolchain that will provide a quantum leap in performance.
You: As a PhD Machine Learning Compiler Engineer on the AWS Neuron Compiler team you will be supporting the groundup development and scaling of a compiler to handle the worlds largest ML workloads. Architecting and implementing businesscritical features publish cuttingedge research and contributing to a brilliant team of experienced engineers excites and challenges you. You will leverage your technical communications skill as a handson partner to AWS ML services teams and you will be involved in presilicon design bringing new products/features to market and many other exciting projects.
A background in compiler development is strongly preferred. A background in Machine Learning and AI accelerators is preferred but not required.
In order to be considered for this role candidates must be currently located or willing to relocate to Cupertino (preferred) Seattle or Toronto.
To qualify applicants should have earned (or will earn) a PhD between December 2023 and September 2025.
Proficiency in C and Python programming applied to compiler or verification projects
Experience developing compiler optimizations or ML framework internals through research projects
Understanding of systems design and distributed architecture principles
Experience optimizing Tensorflow PyTorch or JAX deep learning models
Experience with multiple toolchains like LLVM OpenXLA/XLA MLIR TVM
Knowledge of CUDA programming for GPU acceleration
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race national origin gender gender identity sexual orientation protected veteran status disability age or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees supervisors and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees supervisors and staff to ensure exceptional customer service; and follow all federal state and local laws and Company policies. Criminal history may have a direct adverse and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above as well as the abilities to adhere to company policies exercise sound judgment effectively manage stress and work safely and respectfully with others exhibit trustworthiness and professionalism and safeguard business operations and the Companys reputation. Pursuant to the Los Angeles County Fair Chance Ordinance we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit
for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129300/year in our lowest geographic market up to $223600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on jobrelated knowledge skills and experience. Amazon is a total compensation company. Dependent on the position offered equity signon payments and other forms of compensation may be provided as part of a total compensation package in addition to a full range of medical financial and/or other benefits. For more information please visit This position will remain posted until filled. Applicants should apply via our internal or external career site.