Data Engineer

Owkin

Not Interested
Bookmark
Report This Job

profile Job Location:

Paris - France

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

About us

Owkin is an AI company on a mission to solve the complexity of biology. It is building the first Biology Super Intelligence (BASI) by combining powerful biological large language models multimodal patient data and agentic software. At the heart of this system is Owkin K an AI copilot and its new LLM fine-tuned on biology called Owkin Zero used by researchers clinicians and drug developers to better understand biology validate scientific hypotheses and deliver better diagnostics and therapies faster.

About the role:

As a Data Engineer you will be part of the Engineering team supporting the development and maintenance of data pipelines for scientific processing and quality assurance. You will participate in designing optimizing and maintaining ETL/ELT pipelines using Airflow working within established frameworks to ensure reliability scalability and compliance with data governance standards.

Your primary responsibilities will include organizing and structuring data systems ensuring accurate reporting of pipeline performance and contributing to scientific and healthcare data processing workflows. The role requires attention to detail the ability to manage multiple priorities and strong collaboration skills to work effectively with engineers data scientists and researchers.

You will focus on streamlining production workflows ensuring proper monitoring and operational efficiency and implementing best practices for data governance and security.

  • Operate and optimize ETL/ELT pipelines using Airflow.
  • Support the structuring and organization of data systems in alignment with predefined architectures.
  • Ensure timely and accurate reporting of data pipeline performance and operational issues.
  • Follow data governance security and compliance standards in all data processing activities.
  • Work on containerized data infrastructures using Docker and Kubernetes under supervision.
  • Contribute to operational tasks related to scientific data processing and quality control.
  • Implement optimizations in Python and SQL-based workflows following team guidelines.
  • Work within established frameworks for data lake and data warehouse maintenance.
  • Collaborate with engineers and researchers to define data processing requirements.
  • Contribute to the standardization and monitoring of production data workflows.

In particular you will:

  • Support the design and optimization of data pipelines using Airflow.
  • Develop and operate Python and SQL-based solutions for data processing.
  • Contribute to the development of scalable ETL/ELT pipelines to process and transform datasets.
  • Work closely with data scientists business developers software engineers and biomedical researchers to deliver high-quality data solutions.
  • Contribute to management and monitoring of containerized data infrastructures with Docker Kubernetes and cloud platforms.
  • Follow best practices for data governance security and compliance in all workflows.
  • Operate on the data architectures including data lakes data warehouses and analytical insights platforms.
  • Contribute to the productionization of data processing pipelines ensuring efficiency and scalability in scientific data workflows.

Position is based in our Paris office or remotely in France.

About you

  • Proficiency in Python and SQL.
  • Familiarity with Airflow for workflow orchestration.
  • Familiarity with cloud-based data storage and cloud-native processing concepts.
  • Familiarity with containerization technologies such as Docker and Kubernetes.
  • Knowledge of data governance and security fundamentals.
  • Ability to work with structured and unstructured datasets in predefined formats.

Please submit your CV in English

#LI-MD1

What we offer

  • Flexible work organization
  • Friendly and informal working environment
  • Opportunity to work with an international team with high technical and scientific backgrounds

Recruitment Process & Security

  • Please complete the form and submit your CV.
  • Owkin is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race sex gender sexual orientation age color religion national origin protected veteran status or on the basis of disability.
  • Owkin is a great place to work. As a coveted workplace we are unfortunately vulnerable to recruitment phishing scams. We urge all job seekers and candidates to be wary of potential scams. Most of these have individuals posing as representatives of prominent companies including Owkin with the aim of obtaining personal sensitive or financial information from applicants. These scams prey upon an individuals desire to obtain a job and can sometimes feel like a genuine recruitment process. Some red flags are identified below. Should you encounter a recruitment process that claims to be for Owkin but is not consistent with the below please do not provide any personal or financial information:
  • Legitimate Owkin recruitment processes include communication with candidates through recognized professional networks such as LinkedIn.
  • Communication is always through an official Owkin email address (from the @ domain) over the phone or through our applicant tracking system (Greenhouse).
  • The Owkin talent team do use platforms such as LinkedIn and Job Teaser however if you have any concern or doubt about this contact please ask for them to send an email from @.
  • The Owkin talent team will not solicit personal data from candidates during the application phase including but not limited to date of birth social security numbers or bank account information;
  • Legitimate Owkin interviews may be conducted over the phone in person or via an approved enterprise videoconferencing service (Google Meets). They will not occur via Signal Telegram or Messenger
  • Owkin offers of employment are based on merit and only extended once a candidate has interviewed with members of the talent and hiring team. Offers will be extended both verbally and in written format.

If you think that you have been a victim of fraud

About usOwkin is an AI company on a mission to solve the complexity of biology. It is building the first Biology Super Intelligence (BASI) by combining powerful biological large language models multimodal patient data and agentic software. At the heart of this system is Owkin K an AI copilot and its...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala