Infrastructure Engineer Location San Francisco CA. On site. Full time.
About the job Advantra is partnering with an early stage biotech ML startup in San Francisco to hire an Infrastructure Engineer to scale and maintain the machine learning inference system.
About the role You will build and maintain infrastructure that serves over 150 biological ML models. You will scale the platform to handle growing workloads while ensuring reliability and high availability. You will work with containerized workloads optimize resource usage and collaborate with founders to meet customer and system requirements.
Responsibilities -
Design and maintain scalable infrastructure for biological ML models.
-
Manage containerized workloads using Kubernetes and other orchestration tools.
-
Optimize compute storage and GPU resource allocation.
-
Ensure high availability and reliability of model serving systems.
-
Collaborate with founders to design infrastructure for variable workloads.
-
Automate deployment monitoring and scaling processes.
-
Troubleshoot production issues and improve system performance.
-
Evaluate and integrate new tools and technologies to support growth.
-
Document infrastructure architecture and operational procedures.
Core requirements -
3 years programming experience in Python Go or similar languages.
-
2 years experience with containerization and orchestration concepts.
-
1 year managing cloud infrastructure on AWS GCP or Azure.
-
Experience automating deployment and monitoring tasks.
-
Experience scaling production systems to support 100 models.
-
Familiarity with GPU workloads and scheduling.
-
Experience with infrastructure as code tools.
-
Strong problem-solving and troubleshooting skills.
-
Experience in fast-paced startup environments or small engineering teams.
-
Located in or willing to relocate to the SF Bay Area.
Nice to have -
Advanced Kubernetes deployment and cluster management.
-
Experience with Terraform Pulumi or similar IaC tools.
-
Observability and monitoring tools such as Prometheus or Grafana.
-
Experience with Bio-ML models or computational biology workloads.
-
Familiarity with CI/CD pipelines for ML systems.
Compensation Competitive base plus equity.
How to apply Email with links to shipped work and a short note on what you owned. Company name shared after the intro.
About Us Advantra-Upstart Crew is a search program inside Advantra Consulting. We partner with early stage and high growth startups to hire the top 2% in Tech and GTM. We run end to end searches from single role headhunts to full team build outs using domain experts and a vetted network to deliver tight shortlists. We stay close to founders and candidates so the relationship lasts beyond the first hire.
Infrastructure Engineer Location San Francisco CA. On site. Full time. About the job Advantra is partnering with an early stage biotech ML startup in San Francisco to hire an Infrastructure Engineer to scale and maintain the machine learning inference system. About the role You will build and mainta...
Infrastructure Engineer Location San Francisco CA. On site. Full time.
About the job Advantra is partnering with an early stage biotech ML startup in San Francisco to hire an Infrastructure Engineer to scale and maintain the machine learning inference system.
About the role You will build and maintain infrastructure that serves over 150 biological ML models. You will scale the platform to handle growing workloads while ensuring reliability and high availability. You will work with containerized workloads optimize resource usage and collaborate with founders to meet customer and system requirements.
Responsibilities -
Design and maintain scalable infrastructure for biological ML models.
-
Manage containerized workloads using Kubernetes and other orchestration tools.
-
Optimize compute storage and GPU resource allocation.
-
Ensure high availability and reliability of model serving systems.
-
Collaborate with founders to design infrastructure for variable workloads.
-
Automate deployment monitoring and scaling processes.
-
Troubleshoot production issues and improve system performance.
-
Evaluate and integrate new tools and technologies to support growth.
-
Document infrastructure architecture and operational procedures.
Core requirements -
3 years programming experience in Python Go or similar languages.
-
2 years experience with containerization and orchestration concepts.
-
1 year managing cloud infrastructure on AWS GCP or Azure.
-
Experience automating deployment and monitoring tasks.
-
Experience scaling production systems to support 100 models.
-
Familiarity with GPU workloads and scheduling.
-
Experience with infrastructure as code tools.
-
Strong problem-solving and troubleshooting skills.
-
Experience in fast-paced startup environments or small engineering teams.
-
Located in or willing to relocate to the SF Bay Area.
Nice to have -
Advanced Kubernetes deployment and cluster management.
-
Experience with Terraform Pulumi or similar IaC tools.
-
Observability and monitoring tools such as Prometheus or Grafana.
-
Experience with Bio-ML models or computational biology workloads.
-
Familiarity with CI/CD pipelines for ML systems.
Compensation Competitive base plus equity.
How to apply Email with links to shipped work and a short note on what you owned. Company name shared after the intro.
About Us Advantra-Upstart Crew is a search program inside Advantra Consulting. We partner with early stage and high growth startups to hire the top 2% in Tech and GTM. We run end to end searches from single role headhunts to full team build outs using domain experts and a vetted network to deliver tight shortlists. We stay close to founders and candidates so the relationship lasts beyond the first hire.
View more
View less