Develop and maintain batch and streaming data pipelines as per platform standards
Implement data workflows using Apache Airflow
Build batch data transformations using Apache Spark (PySpark / Spark SQL)
Develop real-time and streaming pipelines using Apache Flink
Write and manage data in Apache Iceberg tables using Project Nessie
Ensure pipelines meet performance reliability and data quality requirements
Debug and resolve pipeline failures and data inconsistencies
Collaborate with Data Engineering Leads on pipeline design and optimization
Work closely with DevOps Engineers for deployments and runtime support
Follow established coding standards review processes and documentation practices
Job Description: Develop and maintain batch and streaming data pipelines as per platform standards Implement data workflows using Apache Airflow Build batch data transformations using Apache Spark (PySpark / Spark SQL) Develop real-time and streaming pipelines using Apache Flink Write ...
Job Description:
Develop and maintain batch and streaming data pipelines as per platform standards
Implement data workflows using Apache Airflow
Build batch data transformations using Apache Spark (PySpark / Spark SQL)
Develop real-time and streaming pipelines using Apache Flink
Write and manage data in Apache Iceberg tables using Project Nessie
Ensure pipelines meet performance reliability and data quality requirements
Debug and resolve pipeline failures and data inconsistencies
Collaborate with Data Engineering Leads on pipeline design and optimization
Work closely with DevOps Engineers for deployments and runtime support
Follow established coding standards review processes and documentation practices