What will this person be working on
Design and implement ETL pipelines using AWS services Glue EMR DMS S3 Redshift
Orchestrate workflows with AWS Step Functions EventBridge and Lambda
Integrate CICD pipelines with GitHub and AWS CDK for automated deployments
Develop conceptual logical and physical data models for operational and analytical systems
Optimize queries normalize datasets and apply performance tuning techniques
Use Python PySpark and SQL for data transformation and automation
Monitor pipeline performance using CloudWatch and Glue job logs
Troubleshoot and resolve data quality and performance issues proactively
Minimum Experience
8-10 years in Data Engineering or related roles
Proven track record in AWSbased data solutions and orchestration
Integration with ERP systems SAP Homegrown ERP Systems
APIbased Data Exchange between Manufacturing Supply Chain legacy applications and AWS pipelines
Metadata Management for compliance attributes
Audit Trails Reporting for compliance verification
Expertise in cloud to design build and maintain datadriven solutions
Skilled in Data Architecture and Data Engineering with a strong background in Supply Chain domain
Experienced in Data Modeling Conceptual Logical and Physical ETL optimizations Query optimizations and Performance tuning
Technical Skills
Languages Python PySpark SQL
AWS Services Glue EMR EC2 Lambda DMS S3 Redshift RDS
Data Governance Informatica CDGCCDQ
DevOps Tools Git GitHub AWS CDK
Security IAM encryption policies
Monitoring CloudWatch Glue Catalog Athena
Strong integration background with DB2 UDB SQL Server etc
What will this person be working on Design and implement ETL pipelines using AWS services Glue EMR DMS S3 Redshift Orchestrate workflows with AWS Step Functions EventBridge and Lambda Integrate CICD pipelines with GitHub and AWS CDK for automated deployments Develop conceptual logical and physical...
What will this person be working on
Design and implement ETL pipelines using AWS services Glue EMR DMS S3 Redshift
Orchestrate workflows with AWS Step Functions EventBridge and Lambda
Integrate CICD pipelines with GitHub and AWS CDK for automated deployments
Develop conceptual logical and physical data models for operational and analytical systems
Optimize queries normalize datasets and apply performance tuning techniques
Use Python PySpark and SQL for data transformation and automation
Monitor pipeline performance using CloudWatch and Glue job logs
Troubleshoot and resolve data quality and performance issues proactively
Minimum Experience
8-10 years in Data Engineering or related roles
Proven track record in AWSbased data solutions and orchestration
Integration with ERP systems SAP Homegrown ERP Systems
APIbased Data Exchange between Manufacturing Supply Chain legacy applications and AWS pipelines
Metadata Management for compliance attributes
Audit Trails Reporting for compliance verification
Expertise in cloud to design build and maintain datadriven solutions
Skilled in Data Architecture and Data Engineering with a strong background in Supply Chain domain
Experienced in Data Modeling Conceptual Logical and Physical ETL optimizations Query optimizations and Performance tuning
Technical Skills
Languages Python PySpark SQL
AWS Services Glue EMR EC2 Lambda DMS S3 Redshift RDS
Data Governance Informatica CDGCCDQ
DevOps Tools Git GitHub AWS CDK
Security IAM encryption policies
Monitoring CloudWatch Glue Catalog Athena
Strong integration background with DB2 UDB SQL Server etc
View more
View less