drjobs Data Engineering Architect ATC

Data Engineering Architect ATC

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bengaluru - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Experience:8 years of experience in data engineering specifically in cloud environments like in PySpark for distributed data processing and experience with AWS Glue for ETL jobs and managing data experience with AWS Data Pipeline (DPL) for workflow experience with AWS services such as S3 Lambda Redshift RDS and Skills:Proficiency in Python and PySpark for data processing and transformation understanding of ETL concepts and best with AWS Glue (ETL jobs Data Catalog and Crawlers).Experience building and maintaining data pipelines with AWS Data Pipeline or similar orchestration with AWS S3 for data storage and management including file formats (CSV Parquet Avro).Strong knowledge of SQL for querying and manipulating relational and semistructured with Data Warehousing and Big Data technologies specifically within Skills:Experience with AWS Lambda for serverless data processing and of AWS Redshift for data warehousing and with Data Lakes Amazon EMR and Kinesis for streaming data of data governance practices including data lineage and with CI/CD pipelines and Git for version with Docker and containerization for building and deploying and Build Data Pipelines: Design implement and optimize data pipelines on AWS using PySpark AWS Glue and AWS Data Pipeline to automate data integration transformation and storage Development: Develop and maintain Extract Transform and Load (ETL) processes using AWS Glue and PySpark to efficiently process large Workflow Automation: Build and manage automated data workflows using AWS Data Pipeline ensuring seamless scheduling monitoring and management of data Integration: Work with different AWS data storage services (e.g. S3 Redshift RDS) to ensure smooth integration and movement of data across and Scaling: Optimize and scale data pipelines for high performance and cost efficiency utilizing AWS services like Lambda S3 and EC2.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.