Employer Active
- USA
Not Disclosed
Salary Not Disclosed
1 Vacancy
Senior Data Scientist
Mandatory Areas
MustHave Skills
Python 6 Yrs of Exp
Pyspark 6 Yrs of Exp
Pytorch 6 Yrs of Exp
GCP 3 Yrs of Exp
Web development Prior experience 3 Years
Docker 4 Years
KubeFlow 4 Years
Domain Experience (If any ) Retail Experience
looking for a highly energetic and collaborative Senior Data Scientist with experience building enterprise level GenAI applications designed and developed MLOps pipelines . The ideal candidate should have deep understanding of the NLP field hands on experience in design and development of NLP models and experience in building LLMbased applications. Excellent written and verbal communication skills with the ability to collaborate effectively with domain experts and IT leadership team is key to be successful in this role. We are looking for candidates with expertise in Python Pyspark Pytorch Langchain GCP Web development Docker Kubeflow etc.
Key Responsibilities:
Work with Walmarts AI/ML Platform Enablement team within the eCommerce Analytics team. The broader team is currently on a transformation path and this role will be instrumental in enabling the broader teams vision.
Work closely with other Data Scientists to help with production models and maintain them in production.
Deploy and configure Kubernetes components for production cluster including API Gateway Ingress Model Serving Logging Monitoring Cron Jobs etc. Improve the model deployment process for MLE for faster builds and simplified workflows
Be a technical leader on various projects across platforms and a handson contributor of the entire platforms architecture
Responsible for leading operational excellence initiatives in the AI/ML space which includes efficient use of resources identifying optimization opportunities forecasting capacity etc.
Design and implement different flavors of architecture to deliver better system performance and resiliency.
Develop capability requirements and transition plan for the next generation of AI/ML enablement technology tools and processes to enable Walmart to efficiently improve performance with scale.
Tools/Skills (handson experience is must):
Ability to transform designs ground up and lead innovation in system design
Deep understanding of GenAI applications and NLP field
Hands on experience in the design and development of NLP models
Experience in building LLMbased applications
Design and development of MLOps pipelines
Fundamental understanding on the data science parameterized and nonparameterized algorithms.
Knowledge on AI/ML application lifecycles and workflows.
Experience in the design and development of an ML pipeline using containerized components.
Have worked on at least one Kubernetes cloud offering (EKS/GKE/AKS) or onprem Kubernetes (native Kubernetes Gravity MetalK8s)
Programming experience in Python Pyspark Pytorch Langchain Docker Kubeflow
Ability to use observability tools (Splunk Prometheus and Grafana ) to look at logs and metrics to diagnose issues within the system.
Experience with Web development
Education & Experience:
6 years relevant experience in roles with responsibility over data platforms and data operations dealing with large volumes of data in cloud based distributed computing environments.
Graduate degree preferred in a quantitative discipline (e.g. computer engineering computer science economics math operations research).
Proven ability to solve enterprise level data operations problems at scale which require crossfunctional collaboration for solution development implementation and adoption.
Notes : We are looking for a data scientist who can contribute to the following domains.Design and development of GenAI applications Deeper understanding of the NLP field. Hands on experience in the design and development of NLP models Experience in building LLMbased applications.Design and development of MLOps pipelines Fundamental understanding on the data science parameterized and nonparameterized algorithms. Knowledge on AI/ML application lifecycles and workflows. Experience in the design and development of an ML pipeline using containerized components.
Skills: Python Pyspark Pytorch Langchain GCP Web development Docker KubeFlow
Full Time