Nextlink Solutions is looking for a skilled ML Platform Engineer to take charge of automating deploying patching and maintaining machine learning platform infrastructure. This role requires hands-on expertise with Cloudera Data Science Workbench (CDSW) Cloudera Data Platform (CDP) Docker Kubernetes and scripting with Python and Ansible. The successful candidate will contribute to building a reliable and scalable ML environment ensure high platform availability apply MLOps best practices and collaborate closely with cross-functional engineering teams to deliver seamless deployments and operational stability.
Key Responsibilities
Automate deployment and management processes for ML platforms using Ansible and Python
Deploy patch and monitor components like CDSW Docker and Kubernetes clusters
Maintain high availability and performance of infrastructure with proactive monitoring
Write and maintain detailed documentation on platform configurations and procedures
Troubleshoot infrastructure issues to ensure minimal downtime
Implement security scalability and automation best practices across the ML platform.
Requirements
8 years of experience in ML platform engineering
5 years Hands-on experience with Cloudera Data Science Workbench
8 years Strong proficiency in Docker and Kubernetes for container orchestration
8 years Expert scripting and automation with Python and Ansible
8 years experience with GitLab for CI/CD and source control
Familiarity with MLOps principles and practices
Proven experience in patching and maintaining infrastructure systems
Excellent problem-solving skills with a strong collaborative mindset
Expertise in Unix
+8 years of experience in ML platform engineering, +5 years Hands-on experience with Cloudera Data Science Workbench +8 years Strong proficiency in Docker and Kubernetes for container orchestration + 8 years Expert scripting and automation with Python and Ansible +8 years experience with GitLab for CI/CD and source control Familiarity with MLOps principles and practices Proven experience in patching and maintaining infrastructure systems Excellent problem-solving skills with a strong collaborative mindset Expertise in Unix