Data Engineer

Not Interested
Bookmark
Report This Job

profile Job Location:

Bengaluru - India

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

What awaits you/ Job Profile

We are seeking a Data Engineer Ticket & Embedding Pipeline Specialist to join our BMW Techworks teams.
You will build maintain and scale ingestion pipelines following BMW-standard ingestion patterns for CDH and Octane.
The role includes cleaning validating and transforming ticket data implementing robust PII/secret scrubbing and operating metadata embedding pipelines.
You will also support data quality dashboards drift monitoring and ensure data compliance with BMW governance and schema guidelines.

What should you bring along

3 years of hands-on data engineering experience
Strong experience with ingestion ETL/ELT pipelines and large-scale datasets
Ability to work with semi-structured/unstructured ticket data
Experience validating normalizing and cleaning data for downstream systems
Understanding of PII handling scrubbing logic and data governance principles

Must have technical skill

Ingestion & Data Processing

Python (Pandas/Numpy)
Building ingestion workflows for CDH / Octane
Data cleaning validation and normalization
Implementing PII and secret scrubbing mechanisms

Embedding & Metadata Pipelines

DynamoDB for metadata embedding storage
Managing embedding generation and backfill logic

Data Storage & Querying

S3 data lake organization (Parquet formats)
Athena/Glue for querying transformation and schema enforcement

Monitoring & Governance

Data drift checks data quality dashboards
Adherence to BMW data governance schema and access structures

Good to have technical skills

Experience with workflow orchestration (Step Functions Airflow Prefect)
Understanding of RAG/embedding pipelines in production
Familiarity with CI/CD for data pipelines
Exposure to BMW internal data tools and access frameworks


Required Experience:

Senior IC

What awaits you/ Job ProfileWe are seeking a Data Engineer Ticket & Embedding Pipeline Specialist to join our BMW Techworks teams. You will build maintain and scale ingestion pipelines following BMW-standard ingestion patterns for CDH and Octane. The role includes cleaning validating and transformi...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala