drjobs Founding LLM Inference Engineer (replacement search, exclusive)

Founding LLM Inference Engineer (replacement search, exclusive)

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Francisco, CA - USA

Monthly Salary drjobs

$ 200 - 300

Vacancy

1 Vacancy

Job Description

Founding LLM Inference Engineer


Full-time On-site San Francisco CA


Compensation: $200K $300K 0.10%1.00% Equity


About the Role


Were looking for a Founding LLM Inference Engineer to architect and optimize large-scale inference systems powering cutting-edge AI applications. Youll be building the backbone of an AI platform used by top enterprises with a focus on performance scalability and reliability.


This is a hands-on high-impact role where youll collaborate closely with research and product teams moving fast to bring breakthrough model capabilities into production. If youre excited about low-latency systems high-throughput pipelines and deploying bleeding-edge LLMs this role is for you.


Tech stack: Python CUDA LLMs API integrations TGI vLLM TensorRT-LLM


What Youll Do


  • Architect and implement scalable inference systems for state-of-the-art models
  • Optimize infrastructure for high throughput and low latency at scale
  • Develop and integrate advanced inference optimization techniques
  • Collaborate with research teams to productionize new model capabilities
  • Build developer tools and infra to support rapid experimentation and deployment


What Were Looking For


  • Deep expertise in LLM inference optimization and deployment at scale
  • Strong background in Python and GPU programming (CUDA)
  • Experience with serving frameworks (TGI vLLM TensorRT-LLM)
  • Proven track record of shipping production-grade AI systems
  • Excitement about building foundational infra at an early-stage AI startup


Benefits


  • Competitive salary equity (0.10%1.00%)
  • Health dental and vision insurance
  • Daily team lunches and wellness stipend
  • Unlimited PTO flexible parental leave
  • On-site role in San Francisco (5 days a week)


Ready to take the next step


Apply now or email Jenn at to learn more.


Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.