Data Engineer

Not Interested
Bookmark
Report This Job

profile Job Location:

Bengaluru - India

profile Monthly Salary: Not Disclosed
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

Job Description:

  • Design develop and maintain scalable ETL / ELT data pipelines using PySpark and SQL on distributed systems.

  • Implement data ingestion transformation and integration workflows from multiple sources (structured and unstructured).

  • Optimize performance of Spark jobs and ensure high-quality reliable data delivery.

  • Collaborate with data analysts data scientists and business teams to understand requirements and deliver efficient data solutions.

  • Manage and automate data workflows using Python scripting and orchestration tools (e.g. Airflow Azure Data Factory AWS Glue).

  • Deploy and manage data solutions on cloud platforms (AWS / Azure / GCP) using native services (e.g. S3 Redshift BigQuery Synapse Databricks).

  • Implement data quality security and compliance controls in alignment with PwCs governance framework.

  • Participate in code reviews documentation and process improvement initiatives.

Job Description: Design develop and maintain scalable ETL / ELT data pipelines using PySpark and SQL on distributed systems. Implement data ingestion transformation and integration workflows from multiple sources (structured and unstructured). Optimize performance of Spark jobs a...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala