We are looking for a Data Engineer (m/f) with solid experience in Big Data environments particularly using Apache Spark. You will be involved in the design and implementation of robust data pipelines and storage solutions supporting the data needs of business and technical stakeholders.
Responsibilities:
- Build and optimize data pipelines with Apache Spark (Python and/or Scala)
- Process large-scale batch and streaming datasets
- Work with REST APIs to retrieve and integrate external data
- Collaborate with data scientists and engineers in Agile teams
- Ensure data quality testing and monitoring
- Contribute to CI/CD and automation best practices
- Organize and manage data in on-prem object storage
- Promote data governance awareness: data lineage metadata PII data contracts
Profile:
- Bachelors or Masters degree in Computer Science Engineering or a related field
- 2 to 5 years of experience as a Data Engineer in Big Data environments
- Strong skills in Apache Spark (Python and/or Scala) SQL and data integration
- Comfortable with Git Airflow and CI/CD pipelines
- Experience with REST APIs and object storage (S3/MinIO)
- Previous work in on-premises environments (not cloud-based) is appreciated
- Awareness of data governance topics: data lineage metadata PII data contracts
- Fluent in French and English (minimum B2 level)
- Proactive detail-oriented and a strong communicator
Remote Work :
No
Employment Type :
Full-time
We are looking for a Data Engineer (m/f) with solid experience in Big Data environments particularly using Apache Spark. You will be involved in the design and implementation of robust data pipelines and storage solutions supporting the data needs of business and technical stakeholders.Responsibilit...
We are looking for a Data Engineer (m/f) with solid experience in Big Data environments particularly using Apache Spark. You will be involved in the design and implementation of robust data pipelines and storage solutions supporting the data needs of business and technical stakeholders.
Responsibilities:
- Build and optimize data pipelines with Apache Spark (Python and/or Scala)
- Process large-scale batch and streaming datasets
- Work with REST APIs to retrieve and integrate external data
- Collaborate with data scientists and engineers in Agile teams
- Ensure data quality testing and monitoring
- Contribute to CI/CD and automation best practices
- Organize and manage data in on-prem object storage
- Promote data governance awareness: data lineage metadata PII data contracts
Profile:
- Bachelors or Masters degree in Computer Science Engineering or a related field
- 2 to 5 years of experience as a Data Engineer in Big Data environments
- Strong skills in Apache Spark (Python and/or Scala) SQL and data integration
- Comfortable with Git Airflow and CI/CD pipelines
- Experience with REST APIs and object storage (S3/MinIO)
- Previous work in on-premises environments (not cloud-based) is appreciated
- Awareness of data governance topics: data lineage metadata PII data contracts
- Fluent in French and English (minimum B2 level)
- Proactive detail-oriented and a strong communicator
Remote Work :
No
Employment Type :
Full-time
View more
View less