drjobs Staff AI Infrastructure Engineer

Staff AI Infrastructure Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Mountain View, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

What are we looking for

Were seeking a Staff AI Infrastructure Engineer with deep expertise in building automating and managing AI infrastructure at scale. You will be instrumental in designing and maintaining the systems essential for serving and deploying AI models efficiently and securely across diverse cloud environments.

What will you do

As a Staff AI Infrastructure Engineer youll join our globally distributed team to:

  • Architect build and maintain scalable infrastructure to host and serve AI products and models reliably.
  • Automate infrastructure deployment and management using Helm ArgoCD and Terraform.
  • Manage and optimize Kubernetes clusters to support high-performance AI workloads.
  • Implement and manage CI/CD pipelines utilizing GitHub Actions and Jenkins.
  • Ensure infrastructure compliance with security standards including FedRAMP and related guidelines.
  • Collaborate closely with AI engineering product teams and DevOps to meet infrastructure requirements.
  • Monitor infrastructure health and performance implementing optimizations proactively.
  • Drive infrastructure best practices and mentor team members to foster technical excellence.

What skills and experience should you bring

We are looking for an experienced infrastructure engineer who has:

  • A degree in Computer Science Information Technology or related field or equivalent practical experience.
  • 7 years of experience managing scalable secure and resilient infrastructure for AI and machine learning applications.
  • Deep proficiency with infrastructure-as-code tools like Helm Terraform and ArgoCD.
  • Extensive hands-on experience with Kubernetes for deploying containerized workloads.
  • Demonstrated experience with major cloud platforms (AWS GCP Azure) specifically with services related to AI model hosting (e.g. Azure OpenAI).
  • Experience implementing and managing CI/CD pipelines (GitHub Actions Jenkins).
  • Familiarity with compliance frameworks particularly FedRAMP and security best practices.
  • Strong scripting and automation skills using Python Bash or similar languages.
  • Excellent problem-solving skills creativity and self-driven motivation.

Exceptional candidates will also bring expertise in:

  • Previous experience as a Site Reliability Engineer (SRE) particularly in AI or ML contexts.
  • Monitoring and logging tools (Prometheus Grafana Datadog Jaeger).
  • Networking concepts and security best practices within cloud infrastructure.
  • Professional certifications in Kubernetes or cloud platforms (AWS Azure GCP).

Why Us

You will be joining a cutting-edge company where you will tackle extraordinary challenges and work with the very best in the industry.

  • Medical Vision Dental 401(k) Commuter Health and Dependent FSA
  • Unlimited PTO
  • Industry-leading gender-neutral parental leave
  • Paid Company Holidays
  • Paid Sick Time
  • Employee stock purchase program
  • Disability and life insurance
  • Employee assistance program
  • Gym membership reimbursement
  • Cell phone reimbursement
  • Numerous company-sponsored events including regular happy hours and team-building events

Required Experience:

Staff IC

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.