drjobs Machine Learning Engineer II AWS Just-Walk-Out Science Team

Machine Learning Engineer II AWS Just-Walk-Out Science Team

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Seattle - USA

Yearly Salary drjobs

$ 129300 - 223600

Vacancy

1 Vacancy

Job Description

As part of the AWS Solutions organization we have a vision to provide business applications leveraging Amazons unique experience and expertise that are used by millions of companies worldwide to manage daytoday operations. We will accomplish this by accelerating our customers businesses through delivery of intuitive and differentiated technology solutions that solve enduring business challenges. We blend vision with curiosity and Amazons realworld experience to build opinionated turnkey solutions. Where customers prefer to buy over build we become their trusted partner with solutions that are nobrainers to buy and easy to use.

The Team

Just Walk Out (JWO) is a new kind of store with no lines and no checkoutyou just grab and go! Customers simply use the Amazon Go app to enter the store take what they want from our selection of fresh delicious meals and grocery essentials and go!

Our checkoutfree shopping experience is made possible by our Just Walk Out Technology which automatically detects when products are taken from or returned to the shelves and keeps track of them in a virtual cart. When youre done shopping you can just leave the store. Shortly after well charge your account and send you a receipt. Check it out at Designed and custombuilt by Amazonians our Just Walk Out Technology uses a variety of technologies including computer vision sensor fusion deep learning and foundation models. Innovation is part of our DNA! Our goal is to be Earths most customer centric company and we are just getting started. We need people who want to join an ambitious program that continues to push the state of the art in computer vision deep learning realtime and distributed systems and hardware design.

Everyone on the team needs to be entrepreneurial wear many hats and work in a highly collaborative environment thats more startup than big company. The team works on designing autonomous AI agents that can make intelligent decisions based on visual inputs understand customer behavior patterns and adapt to dynamic retail environments. This includes developing systems that can perform complex scene understanding reason about object permanence and predict customer intentions through visual cues.


Key job responsibilities
Collaborate with Applied Scientists to integrate stateoftheart model architectures into the training pipeline integrate stateoftheart MLLMs into the autolabelling pipeline.
Collaborate with Applied Scientists to process massive data scale machine learning models while optimizing GPU utilization memory management and the training workflows (like kernel fusion mixedprecision training gradient accumulation offloading optimizer states massive parallelization etc).
Design and maintain largescale distributed training systems to support multimodal foundation models for autonomous retailing. Optimize GPU utilization for efficient model training and finetuning on massive datasets.
Develop robust monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU clusters. Design and maintain largescale autolabeling pipeline.
Collaborate with Engineers and Applied Scientists to investigate design approaches prototype new technology and evaluate technical feasibility identify and solve complex problems.

A day in the life
As a MLE with the JWO team you will be responsible for leading the development of novel algorithms and modeling techniques to advance the state of the art of model training using hardware like NVDIA GPUs. Your work will directly impact our customers in the form of products and services that make use of JustWalkOut innovations. You will leverage Amazons heterogeneous data sources and largescale computing resources to accelerate development with multimodal Foundation Models and other Artificial Intelligence (AI) applications. As a key player in our team youll have a significant influence on our overall strategy shaping the future direction of JWO at Amazon. Youll be the driving force behind our system architecture and the champion of best practices that will ensure an unparalleled infrastructure of the highest quality. Work in an Agile/Scrum environment to move fast and deliver high quality software.

3 years of noninternship professional software development experience including coding standards code reviews source control management build processes testing and operations.
2 years of noninternship design or architecture (design patterns reliability and scaling) of new and existing systems experience.
Proficient in Python or related language.
Handson model training experience in PyTorch and deep learning frameworks such as MMEngine or MegatronLM; experienced in largescale deep learning or machine learning operations.
Familiar with modern visuallanguage models multimodal AI systems pretraining and posttraining techniques. Proficient in training profilers and performance analysis tools to identify and optimize bottlenecks in model training.

Masters or PhD degree in computer science or equivalent.
1 years of experience in developing deploying or optimizing ML models. Exceptional engineering skills in building testing and maintaining scalable distributed GPU training frameworks. Familiar with HuggingFace Transformers for visionlanguage modeling.
Handson experience in largescale multimodal LLM and generative model training. Contributions to popular opensource LLM frameworks or research publications in toptier AI conferences such as CVPR ECCV ICCV ICLR etc.
Experience in GPU utilization and memory optimization techniques like kernel fusion and custom kernels mixed precision training using lower precision and dynamic loss scaling gradient (activation) checkpointing gradient accumulation offloading optimizer states and smart prefetching Fully Sharded Data Parallel (FSDP) tensor and pipeline model parallelism.
Proven experience in largescale video understanding tasks with a focus on multimodal learning that integrates visual and/or textual information; includes experience designing efficient data preprocessing pipelines building and scaling multimodal model architectures and conducting robust evaluation at scale.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees supervisors and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees supervisors and staff to ensure exceptional customer service; and follow all federal state and local laws and Company policies. Criminal history may have a direct adverse and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above as well as the abilities to adhere to company policies exercise sound judgment effectively manage stress and work safely and respectfully with others exhibit trustworthiness and professionalism and safeguard business operations and the Companys reputation. Pursuant to the Los Angeles County Fair Chance Ordinance we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129300/year in our lowest geographic market up to $223600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on jobrelated knowledge skills and experience. Amazon is a total compensation company. Dependent on the position offered equity signon payments and other forms of compensation may be provided as part of a total compensation package in addition to a full range of medical financial and/or other benefits. For more information please visit
This position will remain posted until filled. Applicants should apply via our internal or external career site.

Employment Type

Full-Time

Department / Functional Area

Software Development

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.