Our client operates a high-scale consumer platform reaching millions of users globally. The company is investing heavily in its data foundation to power programmatic advertising personalized recommendations and marketplace analytics. The data team is expanding to support billions of daily events and petabyte-scale datasets across advertising and ecommerce.
Open Roles & Teams
The client is hiring across several domains and levels:
Data Platform Core lakehouse real-time streaming governance cost optimization
Programmatic Advertising Data DSP/SSP logs conversion modeling attribution incrementality signal quality
ML Data: Feature stores like Feast/Tecton online-offline feature parity point-in-time correctness training data infrastructure
For Architect / Principal Levels
8 years designing distributed data systems with 50 downstream engineers/analysts as customers
Experience leading 01 architecture for critical domains such as real-time advertising attribution or marketplace analytics
Demonstrated impact on data infrastructure cost optimization at $1M annual cloud spend
Knowledge of privacy-enhancing technologies: differential privacy data clean rooms secure multi-party compute
For Team Lead/Head of Data Engineering:
Required Qualifications
Technical
10 years in data engineering with 4 years leading teams of 5 engineers
Proven experience architecting and operating petabyte-scale data platforms in programmatic advertising ecommerce marketplaces or consumer internet
Deep expertise with distributed systems: Spark Flink Kafka and cloud data warehouses/lakes
Hands-on experience with real-time and batch paradigms. Can still read code and dive deep on design
Strong background in data governance and privacy for advertising data: GDPR CCPA SKAN consent frameworks clean rooms
Leadership
Track record hiring and developing senior engineers and tech leads
Experience running an org with multiple product areas: platform ads data ecommerce data ML data
Excellent cross-functional influence with Product Engineering and Executive stakeholders
History of delivering 01 initiatives and scaling them to org-wide standards
Managed budgets and vendor contracts $1M
Domain
Deep understanding of data challenges in programmatic advertising: RTB identity resolution attribution with signal loss incrementality
OR deep understanding of marketplace/ecommerce data: catalog scale pricing experiments inventory seller analytics
Experience balancing speed cost and compliance in a regulated environment
Technology Environment
Cloud & Infra: AWS/GCP Kubernetes Terraform Storage & Query: S3/GCS Apache Iceberg/Delta Lake/Hudi BigQuery Snowflake Trino Processing & Orchestration: Spark Flink Kafka Airflow DBT Real-time Analytics: Druid Pinot ClickHouse ML Infra: Feature Stores Ray Kubeflow Data Observability: DataDog Monte Carlo OpenLineage
Our client operates a high-scale consumer platform reaching millions of users globally. The company is investing heavily in its data foundation to power programmatic advertising personalized recommendations and marketplace analytics. The data team is expanding to support billions of daily events and...
Our client operates a high-scale consumer platform reaching millions of users globally. The company is investing heavily in its data foundation to power programmatic advertising personalized recommendations and marketplace analytics. The data team is expanding to support billions of daily events and petabyte-scale datasets across advertising and ecommerce.
Open Roles & Teams
The client is hiring across several domains and levels:
Data Platform Core lakehouse real-time streaming governance cost optimization
Programmatic Advertising Data DSP/SSP logs conversion modeling attribution incrementality signal quality