AWS Data Engineer

Cloudious LLC

Not Interested
Bookmark
Report This Job

profile Job Location:

Marysville, OH - USA

profile Monthly Salary: Not Disclosed
Posted on: 11 hours ago
Vacancies: 1 Vacancy

Job Summary

What will this person be working on

Design and implement ETL pipelines using AWS services Glue EMR DMS S3 Redshift

Orchestrate workflows with AWS Step Functions EventBridge and Lambda

Integrate CICD pipelines with GitHub and AWS CDK for automated deployments

Develop conceptual logical and physical data models for operational and analytical systems

Optimize queries normalize datasets and apply performance tuning techniques

Use Python PySpark and SQL for data transformation and automation

Monitor pipeline performance using CloudWatch and Glue job logs

Troubleshoot and resolve data quality and performance issues proactively

Minimum Experience

8-10 years in Data Engineering or related roles

Proven track record in AWSbased data solutions and orchestration

Integration with ERP systems SAP Homegrown ERP Systems

APIbased Data Exchange between Manufacturing Supply Chain legacy applications and AWS pipelines

Metadata Management for compliance attributes

Audit Trails Reporting for compliance verification

Expertise in cloud to design build and maintain datadriven solutions

Skilled in Data Architecture and Data Engineering with a strong background in Supply Chain domain

Experienced in Data Modeling Conceptual Logical and Physical ETL optimizations Query optimizations and Performance tuning

Technical Skills

Languages Python PySpark SQL

AWS Services Glue EMR EC2 Lambda DMS S3 Redshift RDS

Data Governance Informatica CDGCCDQ

DevOps Tools Git GitHub AWS CDK

Security IAM encryption policies

Monitoring CloudWatch Glue Catalog Athena

Strong integration background with DB2 UDB SQL Server etc

What will this person be working on Design and implement ETL pipelines using AWS services Glue EMR DMS S3 Redshift Orchestrate workflows with AWS Step Functions EventBridge and Lambda Integrate CICD pipelines with GitHub and AWS CDK for automated deployments Develop conceptual logical and physical...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala