Data Engineer Batch and Streaming Pipelines
Job Summary
Data Engineer - Batch and Streaming Pipelines
Location: Warsaw
Duration(months): 6
Contract Type: Contract
Design and build scalable data pipelines handling both batch and streaming workloads
Work with Apache Spark Apache Beam and Airflow in a hands-on engineering role
Collaborate with Analytics and Platform Engineers on data infrastructure projects
Develop reliable well-modelled datasets using SQL and dbt transformations
Take ownership of data quality observability and pipeline performance
Company Overview
Our client is a forward-thinking data-driven organisation based in Warsaw specialising in building robust data infrastructure and analytics platforms. They work with teams across the business to design and operate scalable data systems that power critical decision-making. The organisation values technical excellence collaboration and continuous improvement attracting experienced engineers who are passionate about solving complex data challenges.
Our client is recruiting a Data Engineer to join their growing data platform team. If you have 8-10 years of experience building and operating distributed data pipelines and are looking to advance your career working with modern data technologies this contract role offers the opportunity to make a direct impact on their data infrastructure.
Position Overview
As a Data Engineer youll design build and operate scalable batch and streaming data pipelines that serve as the foundation for the organisations analytics capabilities. Youll implement distributed data processing solutions orchestrate complex workflows and ensure data quality across the platform. Your work will directly enable Analytics Engineers and Platform Engineers to deliver insights and build on reliable well-modelled datasets.
Responsibilities
Design and build scalable batch and streaming data pipelines using Apache Spark and Apache Beam
Orchestrate complex data workflows using Apache Airflow ensuring reliability and performance
Develop and maintain SQL and dbt transformations to create clean well-modelled datasets
Implement distributed data processing solutions that handle large-scale data volumes
Monitor data quality observability and pipeline performance across all systems
Troubleshoot and optimise pipeline performance to meet service level requirements
Collaborate closely with Analytics Engineers and Platform Engineers on data infrastructure projects
Document data pipelines transformations and processes for team knowledge sharing
Requirements
8-10 years of experience designing and operating data pipelines
Strong proficiency in Python for data engineering tasks
Hands-on experience with Apache Spark and distributed data processing frameworks
Proven experience with Apache Airflow for workflow orchestration
Solid SQL skills and experience with dbt for data transformations
Experience building and maintaining data warehouses
Understanding of data quality principles and observability practices
Ability to collaborate effectively with cross-functional data teams
Experience with Google Cloud Platform or similar cloud data platforms is advantageous
Benefits
Competitive contract rate reflecting your experience and expertise
Opportunity to work with modern data technologies and frameworks
Collaborative environment with experienced Analytics and Platform Engineers
Contract flexibility with potential for extension based on performance
Alongside these benefits youll join a data-focused organisation that values technical expertise and collaboration. Youll work with engineers who are committed to building reliable scalable data systems and have the autonomy to make meaningful technical decisions.
How to Apply
To apply for this role please submit your CV using the form below or email
Required Experience:
IC