Data Engineer Batch and Streaming Pipelines

Belvedere

Job Location:

Warsaw - Poland

Monthly Salary: Not Disclosed

Posted on: 2 days ago

Vacancies: 1 Vacancy

Job Summary

Data Engineer - Batch and Streaming Pipelines

Location: Warsaw

Duration(months): 6

Contract Type: Contract

Design and build scalable data pipelines handling both batch and streaming workloads

Work with Apache Spark Apache Beam and Airflow in a hands-on engineering role

Collaborate with Analytics and Platform Engineers on data infrastructure projects

Develop reliable well-modelled datasets using SQL and dbt transformations

Take ownership of data quality observability and pipeline performance

Company Overview

Our client is a forward-thinking data-driven organisation based in Warsaw specialising in building robust data infrastructure and analytics platforms. They work with teams across the business to design and operate scalable data systems that power critical decision-making. The organisation values technical excellence collaboration and continuous improvement attracting experienced engineers who are passionate about solving complex data challenges.

Our client is recruiting a Data Engineer to join their growing data platform team. If you have 8-10 years of experience building and operating distributed data pipelines and are looking to advance your career working with modern data technologies this contract role offers the opportunity to make a direct impact on their data infrastructure.

Position Overview

As a Data Engineer youll design build and operate scalable batch and streaming data pipelines that serve as the foundation for the organisations analytics capabilities. Youll implement distributed data processing solutions orchestrate complex workflows and ensure data quality across the platform. Your work will directly enable Analytics Engineers and Platform Engineers to deliver insights and build on reliable well-modelled datasets.

Responsibilities

Design and build scalable batch and streaming data pipelines using Apache Spark and Apache Beam

Orchestrate complex data workflows using Apache Airflow ensuring reliability and performance

Develop and maintain SQL and dbt transformations to create clean well-modelled datasets

Implement distributed data processing solutions that handle large-scale data volumes

Monitor data quality observability and pipeline performance across all systems

Troubleshoot and optimise pipeline performance to meet service level requirements

Collaborate closely with Analytics Engineers and Platform Engineers on data infrastructure projects

Document data pipelines transformations and processes for team knowledge sharing

Requirements

8-10 years of experience designing and operating data pipelines

Strong proficiency in Python for data engineering tasks

Hands-on experience with Apache Spark and distributed data processing frameworks

Proven experience with Apache Airflow for workflow orchestration

Solid SQL skills and experience with dbt for data transformations

Experience building and maintaining data warehouses

Understanding of data quality principles and observability practices

Ability to collaborate effectively with cross-functional data teams

Experience with Google Cloud Platform or similar cloud data platforms is advantageous

Benefits

Competitive contract rate reflecting your experience and expertise

Opportunity to work with modern data technologies and frameworks

Collaborative environment with experienced Analytics and Platform Engineers

Contract flexibility with potential for extension based on performance

Alongside these benefits youll join a data-focused organisation that values technical expertise and collaboration. Youll work with engineers who are committed to building reliable scalable data systems and have the autonomy to make meaningful technical decisions.

How to Apply

To apply for this role please submit your CV using the form below or email

Required Experience:

Data Engineer - Batch and Streaming PipelinesLocation: WarsawDuration(months): 6Contract Type: Contract Design and build scalable data pipelines handling both batch and streaming workloads Work with Apache Spark Apache Beam and Airflow in a hands-on engineering role Collaborate with Analytics and Pl...