Ready to shape the future of work
At Genpact we dont just adapt to changewe drive it. AI and digital innovation are redefining industries and were leading the charge. Genpacts AI Gigafactory our industry-first accelerator is an example of how were scaling advanced technology solutions to help global enterprises work smarter grow faster and transform at scale. From large-scale models to agentic AI our breakthrough solutions tackle companies most complex challenges.
If you thrive in a fast-moving tech-driven environment love solving real-world problems and want to be part of a team thats shaping the future this is your moment.
Genpact (NYSE: G) is anadvanced technology services and solutions company that deliverslastingvalue for leading ourdeep business knowledge operational excellence and cutting-edge solutions we help companies across industries get ahead and stay by curiosity courage and innovationour teamsimplementdata technology and AItocreate tomorrow to know us onLinkedInXYouTube andFacebook.
Inviting applications for the role of Assistant Vice President Lead Data Engineer
In this role a Lead data engineer will lead the design and optimization of advanced data solutions. This role requires expertise in Databricks Azure Data Factory (ADF) Python PySpark and Unity Catalog to efficiently process and manage large datasets along with a deep understanding of cloud architecture to build scalable secure and reliable data solutions on the Microsoft Azure platform. The primary responsibility of the lead data engineer with Unity Catalogue expertise is to apply advanced data engineering skills to optimize data integration enhance data accessibility and drive strategic decision-making through effective data governance simplification standardization and innovative solutions across all supported units. This role will be implementing DevOps best practices and driving innovation using modern data platform capabilities such as Unity Catalog MLflow and Large Language Models (LLMs).
Responsibilities
Design and development.
oCollaborate with business stakeholders and analysts to understand data requirements. Design develop and test data pipelines and workflows using Unity Catalogue to optimize end-to-end processes. Create reusable components robust exception handling and standardized frameworks for data solutions.
Solution Design
oDevelop and maintain robust data architectures using Lakehouse principles to ensure efficient data processing and storage. Comprehensive data architecture solutions using Databricks and Lakehouse principles to support advanced analytics and machine learning initiatives.
oExplore and integrate Large Language Models (LLMs) and Copilot tools to drive automation and agility.
oLeverage Databricks MLflow for model lifecycle management and operationalization
oLeverage data best practices and tools and assist ML engineer in pulling filtering tagging joining parsing and normalizing data sets for use.
Data Quality and Governance:
oEnsure data quality frameworks lineage and monitoring are in place.
oImplement data quality checks validation rules and governance policies to ensure the accuracy reliability and security of data assets.
oImplement data security and privacy measures to protect sensitive information.
Data Integration and Analytics:
oPull data from different sources transform and stitch it for advanced analytics activities.
oDesign implement and deploy data loaders to load data into the engineering sandbox.
oCollaborate with data scientists and analysts to support their data requirements and prepare machine learning feature stores.
oPull/ingest data from different sources transform and stitch and wrangle it for advanced analytics activities.
Leadership and Mentorship:
oOwn complex cross-functional data projects from ideation to production including defining requirements designing solutions leading development and ensuring successful deployment and long-term maintenance.
oProvide guidance and technical leadership to a team of data engineers through in-depth code reviews mentoring junior and mid-level engineers and fostering a culture of technical excellence.
oMentor mid-level engineers and perform peer reviews.
oProvide input to ML engineer/cloud engineer for the design and implementation of data management and/or architecture solutions
Process improvement and efficiency.
oDrive continuous improvement initiatives in data processes and systems. Promote standardization and automation to enhance efficiency and accuracy. Support regional and global data projects
Qualifications We Seek in You!
Minimum Qualifications / Skills
Bachelors degree in computer science Information Systems or a related field.
Experience in Databricks Azure ADF Python Pyspark and Unity Catalog Dataflow and Lakehouse architecture
Deep hands-on expertise in Azure Data Services (e.g. Azure Data Lake Azure Data Factory Synapse etc.) and Databricks.
Strong experience in data pipeline design ETL/ELT development and data orchestration frameworks.
Proficiency in DevOps tools and practices (CI/CD pipelines IaC monitoring).
Knowledge of data lineage cataloging and enterprise data marketplace concepts.
Familiarity with integrating 3rd party data sources and managing data quality frameworks.
Ability to leverage LLMs and Copilot solutions to enhance data platform productivity.
Experience in building self-healing architecture for data pipelines.
Proven experience in managing data projects in complex environments including global or multinational contexts
Hands-on experience with data pipeline development and optimization
Deep knowledge of data governance frameworks and tools including Databricks Unity Catalog to ensure data security quality and compliance at an enterprise level.
A strong understanding of MLOps for building data foundations that support machine learning.
Experience with DevOps practices to enhance data project delivery efficiency
Preferred Qualifications / Skills
Prior track record of leadingenterprise HR/People platformsa plus
Leads multiple pods mentoring senior and mid-level engineers
Experience in large-scale Lakehouse design data mesh principles and performance optimization
Certifications in Azure data engineering Databricks or related fields
Why join Genpact
Be a transformation leader Work at the cutting edge of AI automation and digital innovation
Make an impact Drive change for global enterprises and solve business challenges that matter
Accelerate your career Get hands-on experience mentorship and continuous learning opportunities
Work with the best Join 140000 bold thinkers and problem-solvers who push boundaries every day
Thrive in a values-driven culture Our courage curiosity and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress
Come join the tech shapers and growth makers at Genpact and take your career in the only direction that matters: Up.
Lets build tomorrow together.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race color religion or belief sex age national origin citizenship status marital status military/veteran status genetic information sexual orientation gender identity physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity customer focus and innovation.
Furthermore please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a starter kit paying to apply or purchasing equipment or training.
Required Experience:
Exec
Artificial Intelligence. Real Outcomes. AI is changing big businesses, and so are we. Discover how cutting-edge AI drives unparalleled value.