What awaits you/ Job Profile
| We are seeking a Data Engineer Ticket & Embedding Pipeline Specialist to join our BMW Techworks teams. You will build maintain and scale ingestion pipelines following BMW-standard ingestion patterns for CDH and Octane. The role includes cleaning validating and transforming ticket data implementing robust PII/secret scrubbing and operating metadata embedding pipelines. You will also support data quality dashboards drift monitoring and ensure data compliance with BMW governance and schema guidelines. |
What should you bring along
| 3 years of hands-on data engineering experience Strong experience with ingestion ETL/ELT pipelines and large-scale datasets Ability to work with semi-structured/unstructured ticket data Experience validating normalizing and cleaning data for downstream systems Understanding of PII handling scrubbing logic and data governance principles |
Must have technical skill | Ingestion & Data Processing Python (Pandas/Numpy) Building ingestion workflows for CDH / Octane Data cleaning validation and normalization Implementing PII and secret scrubbing mechanisms Embedding & Metadata Pipelines DynamoDB for metadata embedding storage Managing embedding generation and backfill logic Data Storage & Querying S3 data lake organization (Parquet formats) Athena/Glue for querying transformation and schema enforcement Monitoring & Governance Data drift checks data quality dashboards Adherence to BMW data governance schema and access structures |
Good to have technical skills | Experience with workflow orchestration (Step Functions Airflow Prefect) Understanding of RAG/embedding pipelines in production Familiarity with CI/CD for data pipelines Exposure to BMW internal data tools and access frameworks |
Required Experience:
Senior IC
What awaits you/ Job ProfileWe are seeking a Data Engineer Ticket & Embedding Pipeline Specialist to join our BMW Techworks teams. You will build maintain and scale ingestion pipelines following BMW-standard ingestion patterns for CDH and Octane. The role includes cleaning validating and transformi...
What awaits you/ Job Profile
| We are seeking a Data Engineer Ticket & Embedding Pipeline Specialist to join our BMW Techworks teams. You will build maintain and scale ingestion pipelines following BMW-standard ingestion patterns for CDH and Octane. The role includes cleaning validating and transforming ticket data implementing robust PII/secret scrubbing and operating metadata embedding pipelines. You will also support data quality dashboards drift monitoring and ensure data compliance with BMW governance and schema guidelines. |
What should you bring along
| 3 years of hands-on data engineering experience Strong experience with ingestion ETL/ELT pipelines and large-scale datasets Ability to work with semi-structured/unstructured ticket data Experience validating normalizing and cleaning data for downstream systems Understanding of PII handling scrubbing logic and data governance principles |
Must have technical skill | Ingestion & Data Processing Python (Pandas/Numpy) Building ingestion workflows for CDH / Octane Data cleaning validation and normalization Implementing PII and secret scrubbing mechanisms Embedding & Metadata Pipelines DynamoDB for metadata embedding storage Managing embedding generation and backfill logic Data Storage & Querying S3 data lake organization (Parquet formats) Athena/Glue for querying transformation and schema enforcement Monitoring & Governance Data drift checks data quality dashboards Adherence to BMW data governance schema and access structures |
Good to have technical skills | Experience with workflow orchestration (Step Functions Airflow Prefect) Understanding of RAG/embedding pipelines in production Familiarity with CI/CD for data pipelines Exposure to BMW internal data tools and access frameworks |
Required Experience:
Senior IC
View more
View less