Sr AI DevOps Engineer

Momento USA


Job Location:

San Jose, CA - USA

Monthly Salary: Not Disclosed
Posted on: 9 days ago
Vacancies: 1 Vacancy

Job Summary

Momento USA is a global technology consulting talent acquisition and creative development firm that addresses clients most pressing needs and challenges.

We currently looking for Sr AI DevOps Engineer for a client based out in San Jose CA. Please see the job description below for your reference.

Position: Sr AI DevOps Engineer

Location: San Jose CA (On-site)---Need locals or Nearby states

Duration: 12 Months

Experience with Generative AI LLMOps RAG architectures and AI platform engineering.

Knowledge of NVIDIA GPU infrastructure and CUDA-based deployments.

Experience with Kubernetes-based AI platforms such as Kubeflow and KServe.

Position Summary

We are seeking an experienced AI DevOps Engineer to design implement and maintain scalable infrastructure and deployment pipelines for AI/ML applications. The ideal candidate will have strong expertise in Kubernetes Docker Infrastructure as Code (IaC) cloud platforms and CI/CD automation. This role will be responsible for enabling reliable secure and efficient deployment of AI/ML workloads across development testing and production environments.

Key Responsibilities

Design deploy and manage cloud-native infrastructure supporting AI/ML applications.

Build and maintain Kubernetes clusters for scalable container orchestration.

Develop and manage Docker containers for AI/ML services and microservices.

Implement Infrastructure as Code (IaC) using tools such as Terraform CloudFormation or Pulumi.

Create and optimize CI/CD pipelines for automated deployment of AI/ML models and applications.

Collaborate with Data Scientists ML Engineers and Software Developers to operationalize machine learning workflows.

Monitor system performance availability and security across cloud environments.

Implement logging monitoring and observability solutions using tools such as Prometheus Grafana ELK or Datadog.

Automate infrastructure provisioning configuration management and application deployments.

Manage cloud resources and optimize infrastructure costs.

Ensure compliance with security best practices and organizational standards.

Troubleshoot production issues and provide operational support for AI platforms.

Required Qualifications

Bachelors degree in Computer Science Information Technology Engineering or a related field.

5 years of DevOps Platform Engineering or Cloud Engineering experience.

Hands-on experience with Kubernetes administration and container orchestration.

Strong experience with Docker and containerized application deployment.

Expertise in Infrastructure as Code (IaC) tools such as Terraform CloudFormation or Pulumi.

Experience building and maintaining CI/CD pipelines using Jenkins GitHub Actions GitLab CI Azure DevOps or similar tools.

Proficiency in scripting and automation using Python Bash or PowerShell.

Experience with Linux system administration.

Strong understanding of networking security and cloud architecture principles.

Preferred Qualifications

Experience supporting AI/ML platforms and MLOps workflows.

Hands-on experience with Kubeflow MLflow Airflow or similar MLOps tools.

Experience deploying Large Language Models (LLMs) Generative AI applications or AI inference workloads.

Knowledge of GPU-enabled Kubernetes environments and AI infrastructure.

Experience with vector databases and AI-serving platforms.

Relevant cloud certifications (AWS Azure or GCP).

Kubernetes certifications (CKA CKAD or CKS).

Technical Skills

Containerization & Orchestration

Docker

Kubernetes

Helm

OpenShift (preferred)

Infrastructure as Code

Terraform

CloudFormation

Pulumi

Ansible

Cloud Platforms

AWS

Microsoft Azure

Google Cloud Platform (GCP)

CI/CD & Automation

Jenkins

GitHub Actions

GitLab CI/CD

Azure DevOps

Monitoring & Logging

Prometheus

Grafana

ELK Stack

Datadog

Programming/Scripting

Python

Bash

PowerShell

If interested please share your detailed latest resume along with the required info to speed up the process.


Thanks & Regards

Ahmed Ali

IT Recruiter

Momento USA Exceeding Customer Expectations

440 Benigno Blvd Unit#A-5 2nd Floor Interstate Business Park Bellmawr NJ 08031

Phone No: Ext: 1026 Fax: Email: Ahmed@ Web:

Linkedin : Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race color religion sex pregnancy sexual orientation gender identity national origin age protected veteran status or disability status.

Momento USA is a global technology consulting talent acquisition and creative development firm that addresses clients most pressing needs and challenges. We currently looking for Sr AI DevOps Engineer for a client based out in San Jose CA. Please see the job description below for your reference. ...