Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via email$ 200 - 300
1 Vacancy
Founding LLM Inference Engineer
Full-time On-site San Francisco CA
Compensation: $200K $300K 0.10%1.00% Equity
About the Role
Were looking for a Founding LLM Inference Engineer to architect and optimize large-scale inference systems powering cutting-edge AI applications. Youll be building the backbone of an AI platform used by top enterprises with a focus on performance scalability and reliability.
This is a hands-on high-impact role where youll collaborate closely with research and product teams moving fast to bring breakthrough model capabilities into production. If youre excited about low-latency systems high-throughput pipelines and deploying bleeding-edge LLMs this role is for you.
Tech stack: Python CUDA LLMs API integrations TGI vLLM TensorRT-LLM
What Youll Do
What Were Looking For
Benefits
Ready to take the next step
Apply now or email Jenn at to learn more.
Full Time