drjobs Data Engineer- AI/ML

Data Engineer- AI/ML

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Mississauga - Canada

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

At Roche you can show up as yourself embraced for the unique qualities you bring. Our culture encourages personal expression open dialogue and genuine connections where you are valued accepted and respected for who you are allowing you to thrive both personally and professionally. This is how we aim to prevent stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche where every voice matters.

The Position

A healthier future. Thats what drives us.

Galileo is a strategic Roche Informatics program aiming to enable highvalue AI (initial focus: Generative AI GenAI) use cases at Roche through fitforpurpose platforms and services establishing a foundation for a Center of Excellence in AI. The recently formed Use Case Delivery (UCD) Team consisting of a number of delivery squads is tasked with building innovative GenAI applications.

We are looking for a highly skilled and dedicated Data Engineer to join a new AI solutions development squad that will be building cuttingedge applications leveraging Large Language Models (LLMs). We will be building AI solutions endtoend: from concept through prototyping productization to operations. The Data Engineer will be responsible for designing building and maintaining robust data infrastructure to support AI applications. The ideal candidate will have expertise in handling structured and unstructured data vector databases realtime data processing and cloudbased AI solutions (AWS or Azure).

The Opportunity:

  • Generative AI Application Cocreation: Collaborate with AI engineers data scientists product owners and other developers in Agile teams to integrate LLMs into scalable robust fair and ethical enduser applications focusing on user experience relevance and realtime performance

  • Data Infrastructure Development and Data Integration: Design and implement scalable highperformance data pipelines for AI/GenAI applications ensuring efficient data ingestion transformation storage and retrieval; integrate different databases requiring understanding of data architectures / Domain data ecosystem

  • Vector Database Management: Work with vector databases (e.g. AWS OpenSearch or Azure AI Search) to store and retrieve highdimensional data for Generative AI workloads

  • CloudBased Data Engineering: Build and maintain cloudbased data solutions using AWS (OpenSearch S3) or Azure (Azure AI Search Azure Blob Storage)

  • Snowflake Implementation: Design and optimize data storage and processing using Snowflake for scalable cloudnative analytics solutions

  • Data Processing & Transformation: Develop ETL/ELT pipelines to enable realtime and batch data processing

  • Support AI Model Workflows: Collaborate with AI/ML Engineers and Data Scientists to ensure seamless integration of data pipelines with AI finetuning inference and training workflows

  • Performance Optimization: Optimize data storage retrieval and processing strategies for efficiency scalability and costeffectiveness

Who you are:

  • Experience: A minimum of 57 years in data engineering preferably supporting AI/ML applications and hold . . or higher or equivalent in Computer Science Data Engineering or related fields

  • Programming: Proficiency in Python SQL and vector database native languages

  • Databases: Experience with relational NoSQL vector databases and Snowflake in particular

  • Cloud Platforms: Handson experience with AWS (OpenSearch S3 Lambda) or Azure (Azure AI Search Azure Blob Storage Azure Automation)

  • ETL/ELT Pipelines: Experience building scalable ETL/ELT workflows using dbt Apache Airflow or similar

  • APIs & Microservices: Ability to design and integrate RESTful APIs for data exchange

  • Data Security & Governance: Understanding of encryption and rolebased access controls

  • Version Control & DevOps: Familiarity with Git CI/CD containerization (Docker Kubernetes) and Infrastructure as Code (Terraform CloudFormation)

  • Generative AI Support: Experience working with AIspecific data needs such as embeddings RAG (Retrieval Augmented Generation) and LLM finetuning data preparation

Relocation benefits are not available for this job posting.


Who we are

A healthier future drives us to innovate. Together more than 100000 employees across the globe are dedicated to advance science ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities foster creativity and keep our ambitions high so we can deliver lifechanging healthcare solutions that make a global impact.


Lets build a healthier future together.

Roche is an Equal Opportunity Employer.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.