Job Description:
-
Translate data science prototypes into production-grade ML services and pipelines.
-
Build training and inference code with reproducibility versioning and automated testing.
-
Implement scalable model serving (online/offline) batching and latency/throughput optimization.
-
Integrate model lifecycle tooling (tracking registry deployment automation monitoring).
-
Collaborate with Data Engineering on feature pipelines and data contracts.
-
Own production health: drift detection performance regression rollback strategies and incident response.
-
5 years software engineering with 2 years shipping ML models to production.
-
Strong Python skills and experience with ML frameworks (TensorFlow/PyTorch).
-
Experience with containers and orchestration (Docker/Kubernetes) and API development.
-
Understanding of ML system design (data leakage training-serving skew drift).
-
CI/CD and DevOps practices applied to ML workloads (MLOps).
-
Experience with feature stores model registries and model monitoring stacks.
-
GPU optimization and distributed training experience.
-
Experience with responsible AI toolkits and compliance requirements.
-
Python TensorFlow PyTorch Docker REST APIs
Job Description: Translate data science prototypes into production-grade ML services and pipelines. Build training and inference code with reproducibility versioning and automated testing. Implement scalable model serving (online/offline) batching and latency/throughput optimization. Integra...
Job Description:
-
Translate data science prototypes into production-grade ML services and pipelines.
-
Build training and inference code with reproducibility versioning and automated testing.
-
Implement scalable model serving (online/offline) batching and latency/throughput optimization.
-
Integrate model lifecycle tooling (tracking registry deployment automation monitoring).
-
Collaborate with Data Engineering on feature pipelines and data contracts.
-
Own production health: drift detection performance regression rollback strategies and incident response.
-
5 years software engineering with 2 years shipping ML models to production.
-
Strong Python skills and experience with ML frameworks (TensorFlow/PyTorch).
-
Experience with containers and orchestration (Docker/Kubernetes) and API development.
-
Understanding of ML system design (data leakage training-serving skew drift).
-
CI/CD and DevOps practices applied to ML workloads (MLOps).
-
Experience with feature stores model registries and model monitoring stacks.
-
GPU optimization and distributed training experience.
-
Experience with responsible AI toolkits and compliance requirements.
-
Python TensorFlow PyTorch Docker REST APIs
View more
View less