Senior Data Engineer


Job Location:

Warsaw - Poland

Monthly Salary: Not Disclosed
Posted on: 2 days ago
Vacancies: 1 Vacancy

Job Summary

  • Design and build scalable cloud-native data platforms from greenfield to production
  • Implement near-real-time ingestion pipelines using event-driven patterns
  • Define and enforce platform standards including Data Lake / Lakehouse principles medallion architecture and data contracts
  • Refactor and optimise existing Spark and PySpark scripts for performance and maintainability
  • Introduce best practices for code quality testing and CI/CD across data pipelines
  • Drive adoption of AI tooling and agentic workflows within the data engineering team
  • Ensure data quality observability and reliability across all pipelines and platforms
  • Develop self-service tooling and microservices to simplify platform usage for other teams

Qualifications :

  • 5 years of professional experience in Data Engineering
  • Strong Python and SQL development skills for pipeline development and optimisation
  • Proficiency in Apache Spark / PySpark including query optimisation and performance tuning
  • Hands-on experience with Databricks (preferred) or Snowflake
  • Experience with at least one major cloud provider: Azure (preferred) AWS or GCP
  • Experience with stream processing technologies (Kafka Spark Structured Streaming)
  • Solid understanding of ETL/ELT patterns data modelling (dimensional Data Vault) and data warehousing
  • Experience with orchestration tools (Apache Airflow Azure Data Factory or equivalent)
  • Knowledge of Infrastructure as Code (Terraform or equivalent)
  • Understanding of production-grade system requirements: reliability scalability observability and performance
  • Upper-Intermediate English level

WILL BE A PLUS

  • Familiarity with RAG pipeline design and LLM integration patterns
  • Knowledge of data governance frameworks and tools (Unity Catalog Apache Atlas or similar)
  • Experience with dbt for data transformation and modelling
  • Familiarity with MLflow Feature Stores or ML platform integration

Additional Information :

PERSONAL PROFILE

  • Self-driven and proactive in identifying improvements
  • Comfortable working in a fast-paced innovative environment
  • Strong problem-solving mindset with attention to detail
  • Open to experimenting with emerging technologies and approaches

Remote Work :

Yes


Employment Type :

Full-time

Design and build scalable cloud-native data platforms from greenfield to productionImplement near-real-time ingestion pipelines using event-driven patternsDefine and enforce platform standards including Data Lake / Lakehouse principles medallion architecture and data contractsRefactor and optimise e...

About Company

Company Logo

At Sigma Software, we are involved with the client’s team to contribute to the design and development of a technical solution for their tokenized domain reservation platform. We started by assigning a software architect to design the smart contracts and integrate blockchain into the s ... View more

View Profile View Profile