drjobs AI Software Engineer Model Evaluation (fmd)

AI Software Engineer Model Evaluation (fmd)

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Heidelberg - Germany

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Our Mission
Aleph Alpha Researchs mission is to deliver AI innovation that enables open accessible and trustworthy deployment of GenAI in enterprise applications. Our organization develops foundational models and next-generation methods that make it easy and affordable for Aleph Alphas customers to increase productivity in finance administration R&D logistics and manufacturing processes. We do this with a flat hierarchy and IC-driven culture: ideas come from the bottom up and its our shared responsibility to deliver impactful research.

Were looking for skilled Software Engineers to join our research team headquartered in Heidelberg with a focus on evaluating the capabilities safety and trustworthiness of our models. While we highly value in-person work we offer flexibility to work from Berlin or elsewhere in Germany with regular travel to onsite events.

Your responsibilities

As an AI Software Engineer in Model Evaluation you will help design implement and scale the systems that measure our models performance at the cutting edge. You will work closely with researchers to create evaluation benchmarks datasets and environments that test model capabilities safety and reliability across tasks from multilingual understanding to mathematical reasoning and creativity.

You will own significant portions of our evaluation infrastructure including dataset generation pipelines automated benchmarking tools analysis dashboards and large-scale evaluation orchestration on our compute clusters. Youll be building tools and experiments that drive product decisions shape research priorities and guide responsible deployment of our models.

This is high-scale high-impact engineering: youll work with petabyte-scale data run evaluations across large-scale distributed GPU clusters and deliver insights that inform the direction of Aleph Alphas research.

Our current open source eval-framework can be found here.

You can expect to contribute to the following areas:

Your profile

We hire slowly and deliberately. We recognise that we need top talent to deliver top research and we value ability over experience: if you think you would be a good fit for this role wed encourage you to apply even if you do not meet all of the following qualifications.

Basic Qualifications

Preferred Qualifications

We do not require prior experience in AI for this role but we value eagerness to learn. If you have prior experience in AI we will be particularly excited about your ability to translate evaluation insights into actionable improvements for models and systems.

Our tenets

We believe embodying these values would make you a great fit in our team:

What you can expect from us

Employment Type

Full-Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.