Founding LLM Inference Engineer (replacement search, exclusive)

Care Dynamics

Posted on : 05-10-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

San Francisco, CA - USA

Monthly Salary

$ 200 - 300

Vacancy

1 Vacancy

Posted on : 05-10-2025

Job Description

Founding LLM Inference Engineer

Full-time On-site San Francisco CA

Compensation: $200K $300K 0.10%1.00% Equity

About the Role

Were looking for a Founding LLM Inference Engineer to architect and optimize large-scale inference systems powering cutting-edge AI applications. Youll be building the backbone of an AI platform used by top enterprises with a focus on performance scalability and reliability.

This is a hands-on high-impact role where youll collaborate closely with research and product teams moving fast to bring breakthrough model capabilities into production. If youre excited about low-latency systems high-throughput pipelines and deploying bleeding-edge LLMs this role is for you.

Tech stack: Python CUDA LLMs API integrations TGI vLLM TensorRT-LLM

What Youll Do

Architect and implement scalable inference systems for state-of-the-art models
Optimize infrastructure for high throughput and low latency at scale
Develop and integrate advanced inference optimization techniques
Collaborate with research teams to productionize new model capabilities
Build developer tools and infra to support rapid experimentation and deployment

What Were Looking For

Deep expertise in LLM inference optimization and deployment at scale
Strong background in Python and GPU programming (CUDA)
Experience with serving frameworks (TGI vLLM TensorRT-LLM)
Proven track record of shipping production-grade AI systems
Excitement about building foundational infra at an early-stage AI startup

Benefits

Competitive salary equity (0.10%1.00%)
Health dental and vision insurance
Daily team lunches and wellness stipend
Unlimited PTO flexible parental leave
On-site role in San Francisco (5 days a week)

Ready to take the next step

Apply now or email Jenn at to learn more.

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

Care Dynamics

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Founding LLM Inference Engineer (replacement search, exclusive)

Care Dynamics

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Staff Systems Engineer Developer Experience

Sr. Systems Engineer- Developer Experience

Sr. Software Engineer Production Support (Hybrid Onsite)

Sales Engineer Civil Engineer

MECHANICAL ENGINEER

ELECTRICAL ENGINEER

Security engineer

Senior #Go Engineer