Software Development Engineer — CICD, Trainium Manufacturing Test Infrastructure
Cupertino, CA - USA
Department:
Job Summary
validation server-level testing and rack-level testing at scale. We directly enable the manufacturing ramp of AWSs custom AI training chips.
We are looking for a Software Development Engineer to own and evolve the CI/CD infrastructure that delivers software to Trainium manufacturing sites worldwide. You will build and maintain deployment pipelines that push tested validated code to production Outpost environments across multiple manufacturing partners. Your work directly impacts how fast Trainium servers move from factory floor to customer every hour of pipeline latency is lost customer revenue.
Key job responsibilities
- Design build and maintain CI/CD pipelines (AWS CDK Pipelines) that deploy containerized services to AWS Outposts at global manufacturing sites
- Extend the manufacturing infrastructure platform (TypeScript CDK Python microservices) to support new workflows for Trainium accelerator cards baseboards and rack-level integration
- Build integration test frameworks and canary systems that validate service health across all production sites before and after deployments
- Develop automated alarming rollback mechanisms and deployment wave strategies to ensure zero-downtime releases to active manufacturing lines
- Develop infrastructure-as-code for containerized services databases artifact storage messaging queues and authentication systems deployed on Outposts
- Collaborate with Test Engineering teams Hardware Engineers and Supply Chain to resolve bottlenecks in the manufacturing process
About the team
Annapurna Labs is a wholly owned subsidiary of AWS focused on developing custom silicon and servers including the Nitro Graviton Inferentia and Trainium families of processors.
Machine Learning Annapurna (MLA) functions as a vertically integrated team including software firmware hardware and silicon design in a single organization.
We are the Training Servers and Systems organization under MLA focused on Hardware Development Software Development Fleet Ops Systems and Manufacturing Quality and Reliability.
This position is in the Manufacturing Quality and Reliability team.
- BS degree in computer science or equivalent
- Experience with at least one general-purpose programming language such as Java Python C C# Go Rust or TypeScript
- Experience with CI/CD pipeline design and implementation (AWS Pipelines CircleCI GitLab CI GitHub Actions Jenkins or similar)
- Experience with cloud services (AWS GCP or Azure) particularly IaC tools such as CDK CloudFormation Terraform or Pulumi
- Experience deploying software to edge/hybrid environments (AWS Outposts on-premises)
- Experience with containerized microservice architectures (Docker ECS/EKS Kubernetes)
- Familiarity with hardware test automation or manufacturing systems
- Experience with setting up CI/CD for system software
- Familiarity with network configuration in constrained environments (VPN CIDR management site connectivity)
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees supervisors and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees supervisors and staff to ensure exceptional customer service; and follow all federal state and local laws and Company policies. Criminal history may have a direct adverse and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above as well as the abilities to adhere to company policies exercise sound judgment effectively manage stress and work safely and respectfully with others exhibit trustworthiness and professionalism and safeguard business operations and the Companys reputation. Pursuant to the Los Angeles County Fair Chance Ordinance we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience qualifications and location. Amazon also offers comprehensive benefits including health insurance (medical dental vision prescription Basic Life & AD&D insurance and option for Supplemental life plans EAP Mental Health Support Medical Advice Line Flexible Spending Accounts Adoption and Surrogacy Reimbursement coverage) 401(k) matching paid time off and parental leave. Learn more about our benefits at CA Cupertino - 127100.00 - 185000.00 USD annually
Required Experience:
IC
About Company
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa Devices, sporting goods, toys, automotive ... View more