You will work on designing building and optimizing scalable data solutions in a cloud environment focusing on creating efficient ETL pipelines and data models that support business needs. The role involves close collaboration with both technical and business stakeholders to translate requirements into reliable data products while continuously improving performance data quality and development standards across the platform.
Your tasks
Your responsibilities will include performance tuning and optimization of existing solutions building and maintaining ETL pipelines as well as testing and documenting current data flows.
You will also be involved in implementing tools and processes to support data-related projects and promoting the best development standards across the team.
Design build test and deploy Cloud and on-premise data models and transformations in Cloud Native or dedicated toolset.
Optimize data views for specific visualization use cases making use of schema design partitions indexes down-sampling archiving etc. to manage trade-offs such as performance and flexibility.
Review and refine interpret and implement business and technical requirements.
Ensure you are part of the on-going productivity and priorities by refining User Stories Epics and Backlogs in Jira.
Onboarding new data sources design build test and deploy Cloud data ingest pipelines warehouse and data models/products.
Requirements
At least 4-5 years of commercial experience as a Data Engineer.
Strong Python and PySpark skills.
Experience with GCP Cloud toolset.
Strong hands-on experience with SQL and query optimization.
Experience with ETL/ELT pipelines development testingand management.
Strong experience with Hadoop.
Understanding of key concepts around Data Warehousing Data Lakes and Data Lakehouses.
Openness to work 2 days a week from our clients offive (Kraków).
Nice to have
Experience with Java/Scala.
GDPR clause
Based on Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such a data and repealing Directive 95 /46/EC (known as a GDPR) I hereby give consent to GFT Poland Sp. z o.o. (90-118 Łódź ul. Kilińskiego 66) to process my personal data for the purpose of: a) ongoing recruitment process for the position I am applying; b) recruitment processes organized in the future.
Required Experience:
Senior IC
Type of contract: B2B contractSalary range: 125-160 PLN/HWhat will you doYou will work on designing building and optimizing scalable data solutions in a cloud environment focusing on creating efficient ETL pipelines and data models that support business needs. The role involves close collaboration w...
Type of contract: B2B contract
Salary range: 125-160 PLN/H
What will you do
You will work on designing building and optimizing scalable data solutions in a cloud environment focusing on creating efficient ETL pipelines and data models that support business needs. The role involves close collaboration with both technical and business stakeholders to translate requirements into reliable data products while continuously improving performance data quality and development standards across the platform.
Your tasks
Your responsibilities will include performance tuning and optimization of existing solutions building and maintaining ETL pipelines as well as testing and documenting current data flows.
You will also be involved in implementing tools and processes to support data-related projects and promoting the best development standards across the team.
Design build test and deploy Cloud and on-premise data models and transformations in Cloud Native or dedicated toolset.
Optimize data views for specific visualization use cases making use of schema design partitions indexes down-sampling archiving etc. to manage trade-offs such as performance and flexibility.
Review and refine interpret and implement business and technical requirements.
Ensure you are part of the on-going productivity and priorities by refining User Stories Epics and Backlogs in Jira.
Onboarding new data sources design build test and deploy Cloud data ingest pipelines warehouse and data models/products.
Requirements
At least 4-5 years of commercial experience as a Data Engineer.
Strong Python and PySpark skills.
Experience with GCP Cloud toolset.
Strong hands-on experience with SQL and query optimization.
Experience with ETL/ELT pipelines development testingand management.
Strong experience with Hadoop.
Understanding of key concepts around Data Warehousing Data Lakes and Data Lakehouses.
Openness to work 2 days a week from our clients offive (Kraków).
Nice to have
Experience with Java/Scala.
GDPR clause
Based on Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such a data and repealing Directive 95 /46/EC (known as a GDPR) I hereby give consent to GFT Poland Sp. z o.o. (90-118 Łódź ul. Kilińskiego 66) to process my personal data for the purpose of: a) ongoing recruitment process for the position I am applying; b) recruitment processes organized in the future.
We see opportunity in technology. In domains such as cloud, AI, mainframe modernisation, DLT and IoT, we blend established practice with new thinking to help our clients stay ahead.