Senior, Data Scientist (Machine Learning Engineer)

Walmart

Not Interested
Bookmark
Report This Job

profile Job Location:

Sunnyvale, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

Position Summary...

What youll do...

Position Summary...

The Catalog Data Science team at Walmart plays a pivotal role in maintaining and enhancing the data quality of Walmarts massive catalog. We aid supplier onboarding merchandise acquisition inventory management and shopper experience by leveraging cutting-edge technologies in GenAI Machine Learning Deep Learning and Engineering. We tackle complex problems spanning natural language understanding image classification and recommendation to outlier detection visualization and model serving. We take pride in writing solid production code in Python deploying and supporting model services and pipelines and pushing the boundaries in latency throughput and scalability.

Trust and Safety (T&S) is an integral part of the Catalog Data Science Org responsible for maintaining customer trust in the Walmart marketplace. We employ state-of-the-art GenAI and ML models to identify products that violate Walmarts marketplace policies. Our end-to-end ML pipelines are designed to scale and detect policy violations across hundreds of violation classes and billions of catalog items ensuring a safe marketplace for our customers. Our work carries high visibility directly impacting marketplace growth and compliance at Walmart.

As a Senior Data Scientist (Machine Learning Engineer) on the Trust and Safety team you will collaborate with other Data Scientists and ML Engineers to develop deploy and scale machine learning models in production. You will play a key role in building the next generation of our compliance detection platform driving model serving pipeline reliability and the adoption of GenAI-powered solutions to more accurately detect items that violate compliance policies.

What youll do

  • Design and deploy production-grade ML systems for Walmarts Catalog Trust & Safety platform spanning classification detection and segmentation

  • Apply GenAI NLP and Computer Vision techniques to build and continuously improve models for compliance detection content moderation and policy violation classification

  • Own the full model lifecycle from experimentation and offline evaluation through serving monitoring and iterative improvement in production

  • Build and optimize high-throughput batch and real-time inference pipelines using frameworks like Ray Triton and vLLM with a focus on latency cost and reliability

  • Drive ML architecture decisions including model selection distillation quantization and serving strategies

  • Partner with Compliance Product and Operations teams to translate business requirements into model KPIs evaluation frameworks and measurable impact

  • Establish and enforce ML engineering best practices across the team: reproducible training robust evaluation datasets versioned artifacts and production readiness standards

  • Contribute to the broader ML engineering community at Walmart through technical documentation internal talks and cross-team knowledge sharing

What youll bring

  • PhD or Masters in Computer Science or equivalent experience; 3 years building and deploying production ML systems at scale

  • Deep expertise in model serving and inference optimization experience with Triton Inference Server vLLM TorchServe or comparable frameworks

  • Hands-on experience with Generative AI technologies: LLMs multimodal models RAG architectures prompt engineering and fine-tuning (LoRA/QLoRA PEFT)

  • Strong foundation in classical ML deep learning and modern architectures CNNs Transformers and domain-specific variants

  • Proven ability to build and operate large-scale batch and real-time inference pipelines handling high QPS with strict latency and throughput SLAs

  • Proficiency in Python and ML ecosystem tooling PyTorch HuggingFace scikit-learn NumPy; familiarity with distributed compute frameworks (Ray Spark)

  • Experience deploying and managing ML workloads on Kubernetes; solid working knowledge of Docker Helm and container orchestration

  • Familiarity with ML observability model monitoring data drift detection performance degradation alerting and online evaluation strategies

  • Practical experience with MLOps tooling: experiment tracking (MLflow W&B) pipeline orchestration (Airflow Kubeflow) and CI/CD for ML

  • Hands-on with at least one major cloud platform (GCP Azure etc.) and comfort with managed ML services and GPU infrastructure

  • Working knowledge of relational and NoSQL databases

  • Experience with vector databases (Pinecone Weaviate pgvector) and hybrid retrieval systems for GenAI applications

  • Experience with Version Control Systems especially Git

  • Strong verbal and written communication skills; ability to translate complex ML systems into clear technical and business narratives

  • Proactive in tracking the latest AI/ML research and translating advancements into production-grade solutions

Minimum Qualifications...

Outlined below are the required minimum qualifications for this position. If none are listed there are no minimum qualifications.

Option 1- Bachelors degree in Statistics Economics Analytics Mathematics Computer Science Information Technology or related field and 3 years experience in an analytics related field. Option 2- Masters degree in Statistics Economics Analytics Mathematics Computer Science Information Technology or related field and 1 years experience in an analytics related field. Option 3 - 5 years experience in an analytics or related field.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed there are no preferred qualifications.

Data science machine learning optimization models Masters degree in Machine Learning Computer Science Information Technology Operations Research Statistics Applied Mathematics Econometrics Successful completion of one or more assessments in Python Spark Scala or R Using open source frameworks (for example scikit learn tensorflow torch) We value candidates with a background in creating inclusive digital experiences demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards assistive technologies and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmarts accessibility standards and guidelines for supporting an inclusive culture.

Primary Location...

1375 Crossman Ave Sunnyvale CA 94089-1114 United States of America

Walmart and its subsidiaries are committed to maintaining a drug-free workplace and has a no tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.

Required Experience:

IC

Position Summary...What youll do...Position Summary...The Catalog Data Science team at Walmart plays a pivotal role in maintaining and enhancing the data quality of Walmarts massive catalog. We aid supplier onboarding merchandise acquisition inventory management and shopper experience by leveraging ...
View more view more

About Company

Company Logo

Walmart started with one man. In 1962, Sam Walton began with just one store and one mission: help people save money so they could live better. As a growing global digital enterprise and with over 11,500 stores, we maintain Mr. Sam’s vision, but now, we are able to help more customers ... View more

View Profile View Profile