o9 D&A – Data lake (RawDomain)

Mondelēz International

Job Location:

Mumbai - India

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Job Description

Are You Ready to Make It Happen at Mondelēz International

Join our Mission to Lead the Future of Snacking. Make It Uniquely Yours.

Support the day-to-day operations of these GCP-based data pipelines ensuring data governance reliability and performance optimization. Hands-on experience with GCP data services such as Dataflow Big Query Data proc Pub/Sub and real-time streaming architectures is preferred. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up. The Data Engineer will support our software developers database architects data analysts and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams systems and products. The right candidate will be excited by the prospect of optimizing or even re-designing our companys data architecture to support our next generation of products and data initiatives. This role requires a flexible working schedule including potential weekend support for critical operations while maintaining a 40-hour work week .

How you will contribute

A key aspect of the MDLZ Data Hub Google Big Query platform is handling the complexity of inbound data which often does not follow a global design (e.g. variations in channel inventory customer PoS hierarchies distribution and promo plans). You will assist in ensuring the robust operation of pipelines that translate this varied inbound data into the standardized o9 global design. This also includes managing pipelines for different data drivers (> 6 months vs. 0-6 months) ensuring consistent input to o9.

What you will bring

A desire to drive your future and accelerate your career. You will bring experience and knowledge in:

6 years of overall industry experience and minimum of 6-8 years of experience building and deploying large scale data processing pipelines in a production environment
Focus on excellence: Has practical experience of Data-Driven Approaches Is familiar with the application of Data Security strategy Is familiar with well know data engineering tools and platform
Technical depth and breadth : Able to build and operate Data Pipelines Build and operate Data Storage Has worked on big data architecture within Distributed Systems. Is familiar with Infrastructure definition and automation in this context. Is aware of adjacent technologies to the ones they have worked on. Can speak to the alternative tech choices to that made on their projects.
Implementation and automation of Internal data extraction from SAP BW / HANA
Implementation and automation of External data extraction from openly available internet data sources via APIs
Data cleaning curation and enrichment by using Alteryx SQL Python R PySpark SparkR
Preparing consolidated DataMart for use by Data Scientists and managing SQL Databases
Exposing data via Alteryx SQL Database for consumption in Tableau
Data documentation maintenance/update
Collaboration and workflow using a version control system (e.g. Git Hub)
Learning ability : Is self-reflective Has a hunger to improve Has a keen interest to drive their own learning. Applies theoretical knowledge to practice
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition operational efficiency and other key business performance metrics.
Work with stakeholders including the Executive Product Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Data engineering Concepts: Experience in working with data lake data warehouse data mart and Implemented ETL/ELT and SCD concepts.
ETL or Data integration tool: Experience in Talend is highly desirable.
Analytics: Fluent with SQL PL/SQL and have used analytics tools like Big Query for data analytics
Cloud experience: Experienced in GCP services like cloud function cloud run data flow data proc and big query.
Data sources: Experience of working with structure data sources like SAP BW Flat Files RDBMS etc. and semi structured data sources like PDF JSON XML etc.
Flexible Working Hours: This role requires the flexibility to work non-traditional hours including providing support during off-hours or weekends for critical data pipeline job runs deployments or incident response while ensuring the total work commitment remains a 40-hour week.
Data Processing: Experience in working with any of the Data Processing Platforms like Dataflow Databricks.
Orchestration: Experience in orchestrating/scheduling data pipelines using any of the tools like Airflow and Alteryx

More about this role

Must be self-directed and comfortable supporting the data needs of multiple teams systems and products. The right candidate will be excited by the prospect of optimizing or even re-designing our companys data architecture to support our next generation of products and data initiatives. This role requires a flexible working schedule including potential weekend support for critical operations while maintaining a 40-hour work week .

What you need to know about this position:

What extra ingredients you will bring:

6 years of overall industry experience and minimum of 6-8 years of experience building and deploying large scale data processing pipelines in a production environment
Focus on excellence: Has practical experience of Data-Driven Approaches Is familiar with the application of Data Security strategy Is familiar with well know data engineering tools and platform
Technical depth and breadth : Able to build and operate Data Pipelines Build and operate Data Storage Has worked on big data architecture within Distributed Systems. Is familiar with Infrastructure definition and automation in this context. Is aware of adjacent technologies to the ones they have worked on. Can speak to the alternative tech choices to that made on their projects.
Implementation and automation of Internal data extraction from SAP BW / HANA
Implementation and automation of External data extraction from openly available internet data sources via APIs
Data cleaning curation and enrichment by using Alteryx SQL Python R PySpark SparkR
Preparing consolidated DataMart for use by Data Scientists and managing SQL Databases
Exposing data via Alteryx SQL Database for consumption in Tableau
Data documentation maintenance/update
Collaboration and workflow using a version control system (e.g. Git Hub)
Learning ability : Is self-reflective Has a hunger to improve Has a keen interest to drive their own learning. Applies theoretical knowledge to practice
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition operational efficiency and other key business performance metrics.
Work with stakeholders including the Executive Product Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Data engineering Concepts: Experience in working with data lake data warehouse data mart and Implemented ETL/ELT and SCD concepts.
ETL or Data integration tool: Experience in Talend is highly desirable.
Analytics: Fluent with SQL PL/SQL and have used analytics tools like Big Query for data analytics
Cloud experience: Experienced in GCP services like cloud function cloud run data flow data proc and big query.
Data sources: Experience of working with structure data sources like SAP BW Flat Files RDBMS etc. and semi structured data sources like PDF JSON XML etc.
Flexible Working Hours: This role requires the flexibility to work non-traditional hours including providing support during off-hours or weekends for critical data pipeline job runs deployments or incident response while ensuring the total work commitment remains a 40-hour week.
Data Processing: Experience in working with any of the Data Processing Platforms like Dataflow Databricks.
Orchestration: Experience in orchestrating/scheduling data pipelines using any of the tools like Airflow and Alteryx

No Relocation support available

Business Unit Summary

Headquartered in Singapore Mondelēz Internationals Asia Middle East and Africa (AMEA) region is comprised of six business units has more than 21000 employees and operates in more than 27 countries including Australia China Indonesia Ghana India Japan Malaysia New Zealand Nigeria Philippines Saudi Arabia South Africa Thailand United Arab Emirates and -six nationalities work across a network of more than 35 manufacturing plants three global research and development technical centers and in offices stretching from Auckland New Zealand to Casablanca Morocco. Mondelēz International in the AMEA region is the proud maker of global and local iconic brands such as Oreo and belVita biscuits Kinh Do mooncakes Cadbury Cadbury Dairy Milk and Milka chocolate Halls candy Stride gum Tang powdered beverage and Philadelphia cheese. We are also proud to be named a Top Employer in many of our markets.

Mondelēz International is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race color religion gender sexual orientation or preference gender identity national origin disability status protected veteran status or any other characteristic protected by law.

Job Type

Regular

Digital Strategy & Innovation

Technology & Digital

Job DescriptionAre You Ready to Make It Happen at Mondelēz InternationalJoin our Mission to Lead the Future of Snacking. Make It Uniquely Yours.Support the day-to-day operations of these GCP-based data pipelines ensuring data governance reliability and performance optimization. Hands-on experience w...

Job Description

Are You Ready to Make It Happen at Mondelēz International

Join our Mission to Lead the Future of Snacking. Make It Uniquely Yours.

How you will contribute

What you will bring

A desire to drive your future and accelerate your career. You will bring experience and knowledge in:

6 years of overall industry experience and minimum of 6-8 years of experience building and deploying large scale data processing pipelines in a production environment
Focus on excellence: Has practical experience of Data-Driven Approaches Is familiar with the application of Data Security strategy Is familiar with well know data engineering tools and platform
Technical depth and breadth : Able to build and operate Data Pipelines Build and operate Data Storage Has worked on big data architecture within Distributed Systems. Is familiar with Infrastructure definition and automation in this context. Is aware of adjacent technologies to the ones they have worked on. Can speak to the alternative tech choices to that made on their projects.
Implementation and automation of Internal data extraction from SAP BW / HANA
Implementation and automation of External data extraction from openly available internet data sources via APIs
Data cleaning curation and enrichment by using Alteryx SQL Python R PySpark SparkR
Preparing consolidated DataMart for use by Data Scientists and managing SQL Databases
Exposing data via Alteryx SQL Database for consumption in Tableau
Data documentation maintenance/update
Collaboration and workflow using a version control system (e.g. Git Hub)
Learning ability : Is self-reflective Has a hunger to improve Has a keen interest to drive their own learning. Applies theoretical knowledge to practice
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition operational efficiency and other key business performance metrics.
Work with stakeholders including the Executive Product Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Data engineering Concepts: Experience in working with data lake data warehouse data mart and Implemented ETL/ELT and SCD concepts.
ETL or Data integration tool: Experience in Talend is highly desirable.
Analytics: Fluent with SQL PL/SQL and have used analytics tools like Big Query for data analytics
Cloud experience: Experienced in GCP services like cloud function cloud run data flow data proc and big query.
Data sources: Experience of working with structure data sources like SAP BW Flat Files RDBMS etc. and semi structured data sources like PDF JSON XML etc.
Flexible Working Hours: This role requires the flexibility to work non-traditional hours including providing support during off-hours or weekends for critical data pipeline job runs deployments or incident response while ensuring the total work commitment remains a 40-hour week.
Data Processing: Experience in working with any of the Data Processing Platforms like Dataflow Databricks.
Orchestration: Experience in orchestrating/scheduling data pipelines using any of the tools like Airflow and Alteryx

More about this role

What you need to know about this position:

What extra ingredients you will bring:

6 years of overall industry experience and minimum of 6-8 years of experience building and deploying large scale data processing pipelines in a production environment
Focus on excellence: Has practical experience of Data-Driven Approaches Is familiar with the application of Data Security strategy Is familiar with well know data engineering tools and platform
Technical depth and breadth : Able to build and operate Data Pipelines Build and operate Data Storage Has worked on big data architecture within Distributed Systems. Is familiar with Infrastructure definition and automation in this context. Is aware of adjacent technologies to the ones they have worked on. Can speak to the alternative tech choices to that made on their projects.
Implementation and automation of Internal data extraction from SAP BW / HANA
Implementation and automation of External data extraction from openly available internet data sources via APIs
Data cleaning curation and enrichment by using Alteryx SQL Python R PySpark SparkR
Preparing consolidated DataMart for use by Data Scientists and managing SQL Databases
Exposing data via Alteryx SQL Database for consumption in Tableau
Data documentation maintenance/update
Collaboration and workflow using a version control system (e.g. Git Hub)
Learning ability : Is self-reflective Has a hunger to improve Has a keen interest to drive their own learning. Applies theoretical knowledge to practice
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition operational efficiency and other key business performance metrics.
Work with stakeholders including the Executive Product Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Data engineering Concepts: Experience in working with data lake data warehouse data mart and Implemented ETL/ELT and SCD concepts.
ETL or Data integration tool: Experience in Talend is highly desirable.
Analytics: Fluent with SQL PL/SQL and have used analytics tools like Big Query for data analytics
Cloud experience: Experienced in GCP services like cloud function cloud run data flow data proc and big query.
Data sources: Experience of working with structure data sources like SAP BW Flat Files RDBMS etc. and semi structured data sources like PDF JSON XML etc.
Flexible Working Hours: This role requires the flexibility to work non-traditional hours including providing support during off-hours or weekends for critical data pipeline job runs deployments or incident response while ensuring the total work commitment remains a 40-hour week.
Data Processing: Experience in working with any of the Data Processing Platforms like Dataflow Databricks.
Orchestration: Experience in orchestrating/scheduling data pipelines using any of the tools like Airflow and Alteryx

No Relocation support available

Business Unit Summary

Headquartered in Singapore Mondelēz Internationals Asia Middle East and Africa (AMEA) region is comprised of six business units has more than 21000 employees and operates in more than 27 countries including Australia China Indonesia Ghana India Japan Malaysia New Zealand Nigeria Philippines Saudi Arabia South Africa Thailand United Arab Emirates and -six nationalities work across a network of more than 35 manufacturing plants three global research and development technical centers and in offices stretching from Auckland New Zealand to Casablanca Morocco. Mondelēz International in the AMEA region is the proud maker of global and local iconic brands such as Oreo and belVita biscuits Kinh Do mooncakes Cadbury Cadbury Dairy Milk and Milka chocolate Halls candy Stride gum Tang powdered beverage and Philadelphia cheese. We are also proud to be named a Top Employer in many of our markets.

Job Type

Regular

Digital Strategy & Innovation

Technology & Digital

Key Skills

Apply Now

About Company

Mondelēz International

Mondelēz International, Inc. empowers people to snack right in over 150 countries around the world. We're leading the future of snacking with iconic brands such as Oreo, belVita and LU biscuits; Cadbury Dairy Milk, Milka and Toblerone chocolate; Sour Patch Kids candy and Trident gum. ... View more

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

o9 D&A – Data lake (RawDomain)

Mumbai - India

Job Summary

Job Description

Business Unit Summary

Job Type

Job Description

Business Unit Summary

Job Type

Key Skills

About Company

Related Jobs