Position: Sr. Lead/ Lead Data engineer
Location: Chennai
Who Are We: Purchasing Power () We are an Atlanta-based voluntary benefit company offering an industry-leading employee purchase program for brand-name consumer products online education services and travel offerings through convenient payroll deduction helping employees achieve financial flexibility. Opportunity: Purchasing Power one of Atlantas fastest-growing e-retailers is seeking a Lead Data Engineer to join our team in Chennai this role the selected candidate will lead the design development and orchestration of scalable data pipelines and infrastructure in a cloud-native environment reporting directly to the Data Architect. The Lead Data Engineer will be responsible for architecting and optimizing data workflows across our AWS ecosystem ensuring robust data lineage security and performance. This role demands deep expertise in big data frameworks cloud services and pipeline automation to support enterprise-level analytics and decision-making.
What You Will Do:
Lead the architecture and implementation of robust ETL pipelines using AWS Glue Spark and Hive.
Manage distributed data processing on Amazon EMR ensuring high performance and fault tolerance.
Orchestrate workflows using Apache Airflow or AWS MWAA optimizing for scalability and reliability.
Design and maintain efficient data models and schemas for cloud-based data warehouses and storage systems.
Monitor pipeline health using CloudWatch SNS and other AWS tools ensuring proactive issue resolution.
Secure data infrastructure through proper IAM role and policy configurations.
Collaborate with cross-functional teams including analysts product managers and stakeholders to translate business needs into technical solutions.
Document pipeline architecture data lineage and operational procedures for transparency and maintainability.
Utilize Python and Scala for data processing automation and Spark job development.
Apply advanced SQL skills for data manipulation performance tuning and analytics.
Work with GreenPlum for enterprise data warehousing and reporting. The Experience You Will Bring: Technical Skills:
Proven expertise in AWS services: EMR S3 IAM CloudWatch SNS EC2 and MWAA.
Strong background in big data frameworks like Apache Spark and Hadoop.
Hands-on experience with ETL development data modeling and pipeline orchestration.
Deep understanding of data warehousing principles and cloud-native architecture.
Proficiency in Python Scala and SQL for data engineering tasks.
Experience with GreenPlum or similar MPP databases. Academic Requirements:
Bachelors/ Masters degree in Computer Science or related Field