Research Engineer In Open-Source Machine Learning

Not Interested
Bookmark
Report This Job

profile Job Location:

Eindhoven - Netherlands

profile Monthly Salary: Not Disclosed
Posted on: 5 days ago
Vacancies: 1 Vacancy

Job Summary

Departments Department of Mathematics and Computer Science

Introduction

Would you like to work on open source software full-time Do you want to be part of a team that does advanced AI research and builds tools to make machine learning research more accessible and reproducible Join us to build the next generation of OpenML an open platform for sharing AI artifacts.

Job Description

OpenML is a popular open science platform for sharing interconnected AI artifacts (e.g. datasets models and benchmarks) using open standards and structured APIs. Hosted at TU Eindhoven OpenML serves over 300000 users has supported over 1600 scientific studies and won the Dutch Data Prize. We are looking for an excellent engineer to significantly redesign it and build the next generation of the OpenML platform.

There are two positions available:

  • Position 1: Improve OpenML itself. This position is for 1.5 years to start as soon as possible.
  • Position 2: Integrate OpenML with other data and compute infrastructure. This position is for 3.5 years to start around June-July 2026.

Position 1: Improve OpenML itself

This project will help strengthen the impact and sustainability of OpenML by:

  • Modernizing Infrastructure to simplify deployment and streamline community contributions.
  • Improving Open Data enriching OpenML metadata with FAIR-aligned elements (e.g. data cards and model cards) supporting diverse data types and modalities and allowing integration with other open science infrastructures.
  • Enhancing User Experience through better interfaces and collaboration features.
  • Engaging the Scientific Community ensuring that the platform meets the evolving needs of the open science community.

You will focus primarily on task 1 and 2 specific responsibilities include:

  • Update the technical infrastructure components of OpenML to more modern and contributor-friendly technologies.
  • Enrich metadata by building on community standards such as Croissant including quality metrics DOI references data provenance (e.g. data cards and model cards) and responsible AI metadata.
  • Extend support for more data modalities (e.g. imagery time series text and multi-modal data) and domain-specific open data formats (e.g. genomic data).
  • Streamline data loading into AI tools to facilitate AI experimentation in many scientific workflows.
  • Update developer tools and documentation to speed up the onboarding of new community contributors and streamline maintenance.
  • Close collaboration with another hire for the project based in Leiden University that focuses on task 3 and 4.

Position 2: Integrate OpenML with other data and compute infrastructure

For this position you will help seamlessly integrate OpenML with other Dutch and international infrastructures (e.g. SURF NLeSC Hugging Face) to transform it into a next-generation unified platform for AI:

  • Allow people to access datasets and models uniformly from various hosting infrastructures.
  • Easily train and benchmark AI models transparently on scalable computational infrastructure.
  • Organize all data and results automatically for easier reuse and better reproducibility.

Specific key tasks include:

  • Redesign the OpenML platform architecture to streamline federation.
  • Update the OpenML backend and APIs to implement integrations with other APIs (e.g. Hugging Face libraries Harvard Dataverse) and compute infrastructure.
  • Improve experiment tracking from AI tools (e.g. PyTorch) using a tracking API or callbacks with a focus on global community collaboration and better reproducibility.
  • Accelerate AI-driven research: support team-based real-time collaboration share reproducible results via permalinks in papers track impact metrics (e.g. dataset reuse) integrate with code repositories (e.g. GitHub) and publishing platforms (e.g. ArXiv).
  • Continuous infrastructure improvement incorporating community feedback improving scalability and simplifying maintenance.

Both positions are hosted by the research group on Advanced Models by Open Research & Engineering (AMOR/e) providing an ideal environment for advanced AI research and collaboration. You will work directly with AMOR/es engineering team including Pieter Gijsbers Subhaditya Mukherjee and Joaquin Vanschoren.

You will also be supported by project partners in Leiden University the SURF supercomputing center and the Dutch e-Science Center. Beyond that you work together with the wider OpenML community especially the core contributors as well as the Croissant community (led by Google and the Open Data Institute) and developers from other AI platforms such as Hugging Face.

For more information on the AMOR/e group visit
For more information on OpenML visit

A more complete description of the positions is also available on the AMOR/e website.

Job Requirements

  • Required: proficiency in Python
  • Required: familiarity with at least some parts of our stack: Linux Docker web services preferably also S3 Kubernetes and Elasticsearch.
  • Experience with open source development is an asset.
  • A collaborative spirit and good communication skills (in English).
  • A passion for AI.

If you are concerned that you dont meet all the requirements sufficiently but the job does excite you then be sure to respond. We would love to look at your profile.

Conditions of Employment

An exciting position within an international yet personal university. You are right in the middle of the students on a green campus within walking distance of the central station. Besides beautiful architecture you will find varied workplaces and excellent sports facilities. We also offer you:

Here you can discover even more information about our conditions of employment. Build on your career at TU/e!

About us

We are a leading international university where scientific curiosity meets a hands-on mindset. We work in an open and collaborative way with high-tech industries to tackle complex societal challenges. Our responsible and respectful approach ensures impact today and in the future. TU/e is home to over 13000 students and more than 7000 staff forming a diverse and vibrant academic community.

Our university is located in Brainport Eindhoven a worldleading tech region with more than 7000 hightech companies and strong R&D activity. Known for breakthroughs in AI photonics semiconductors and advanced manufacturing Brainport is a place where technology serves people and society. Learn more about the Brainport region here.

With over 110 (assistant associate and full) professors almost 300 PhD and EngD students about 1.500 Bachelor students and 1000 Master students the Department of Mathematics and Computer Science (M&CS) is the largest department of the TU/e. By performing top-level fundamental and applied research and maintaining strong ties with industry M&CS aims to contribute to science and innovation in and beyond the region.


Information

Do you recognize yourself in this profile and would you like to have more information about the positionthat is not on our website () please contact the hiring managers at .

For questions about the application process or the conditions of employment please contact .

Are you inspired and would like to know more about working at TU/e Please visit our career page.


Application

If you are interested please use the apply button to send us your CV and letter of application with your motivation and a description of your that you submit all the requested application documents. We give priority to complete applications.

Screening of candidates begins as soon as applications are received and continues until the position is filled. Where applicable internal candidates will be given priority over external candidates where they are equally suitable.

We look forward to receiving your application!

Please note

  • You can apply online. We will not process applications sent by email and/or post.
  • A pre-employment screening (e.g. knowledge security check) can be part of the selection procedure. For more information on the knowledge security check please consult the National Knowledge Security Guidelines.
  • Important for non-EU applicants: Please be aware that for this position specific residence permit requirements apply. If you are a non-EU national you may not be eligible to legally work in this role under current Dutch immigration regulations. We strongly advise you to contact our Staff Immigration Team () before applying to check your eligibility and to receive further guidance.
  • Please do not contact us for unsolicited services.

Share links

Return to job vacancies


Required Experience:

IC

Departments Department of Mathematics and Computer Science IntroductionWould you like to work on open source software full-time Do you want to be part of a team that does advanced AI research and builds tools to make machine learning research more accessible and reproducible Join us to bu...
View more view more