AI Inference Engineer - Large Language Models fmd

Aleph Alpha

Posted on : 06-04-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Heidelberg - Germany

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 06-04-2025

Job Description

Overview:

You will join our product team in a position that sits at the intersection of artificial intelligence research and realworld solutions. We foster a highly collaborative work culture where you can expect to work closely with your teammates and have a high level of communication between teams through methodologies such as pair or mob programming.

Your responsibilities:

Model Inference:Focus on inference optimization to ensure rapid response times and efficient resourceutilizationduring realtime model interactions.
Hardware Optimization:Run models on various hardware platforms from highperformance GPUs to edge devices ensuringoptimalcompatibility and performance.
Experimentation and Testing:Regularly run experiments analyze outcomes and refinethestrategies to achieve peak performance in varying deployment scenarios.
Staying up to date with the current literature onMLSys

Your profile:

You care about making something people want. You want to ship something that will bring value to our users. You want to deliver AI solutions endtoend and not finish building a prototype.
Bachelorsdegreeor higherin computer science or a related field.
You understand how multimodal transformers work.
You understand the characteristics of LLM inference (KV caching flash attention and model parallelization).
You have handson experience with large language models or other complex AI architectures.
You have experience in system design and optimization particularly within AI or deep learning contexts.
You areproficientin Python andhavedeepunderstanding of deep learning frameworkssuch asPyTorch.
A deep understanding of the challenges associated with scaling AI models forlargeuser bases.

Nice if you have:

Previousexperience in a highgrowth tech environment or a role focused on scaling AI solutions.
ExpertisewithCUDAand Tritonprogramming and GPU optimization for neural network inference.
ExperiencewithRust.
Experience in adapting AI models to suit a range of hardware including different accelerators.
Experience in model quantization pruning and other neural network optimization methodologies.
Atrack recordof contributions to opensource projects (please provide links).
SomeTwitter presencediscussing ML Sys topics.

What you can expect from us:

Become part of an AI revolution!
30 days of paid vacation
Access to a variety of fitness & wellness offerings via Wellhub
Mental health support through nilo.health
Substantially subsidized company pension plan for your future security
Subsidized Germanywide transportation ticket
Budget for additional technical equipment
Flexible working hours for better worklife balance and hybrid working model
Virtual Stock Option Plan
JobRad Bike Lease

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

Aleph Alpha

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

AI Inference Engineer - Large Language Models fmd

Aleph Alpha

Job Description

Overview:

Your responsibilities:

Your profile:

Nice if you have:

What you can expect from us:

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Project Engineer

Project Engineer

Sr. ELECTRICAL ENGINEER

Software QA Automation Engineer

Team Lead Procurement (German Language Expert)

XB | AI Software Developer Microsoft | Consultancy | Utrecht | 95k

Mechanical Engineer

Software Engineer