We are seeking an experienced and dedicated GCP Data Engineer to join our team. You will be responsible for designing building and optimizing robust scalable and highly available data pipelines and ETL/ELT solutions exclusively within the Google Cloud Platform (GCP). This role requires a strong focus on utilizing GCPs native data services to ensure data quality automation and performance optimization across the data lifecycle.
Design build and maintain scalable data pipelines and ETL/ELT processes using core GCP data services such as Cloud Dataflow (Apache Beam) Cloud Dataproc and BigQuery.
Develop and optimize data infrastructure on GCP to ensure reliable high-speed data ingestion (e.g. using Cloud Pub/Sub and Cloud Storage).
Implement data quality checks monitoring and validation to ensure accuracy and integrity of data across all GCP systems.
Collaborate closely with Data Scientists and Data Analysts to ensure data readiness for reporting analytics and Machine Learning initiatives (e.g. integrating with Vertex AI).
Automate deployment monitoring and testing of data infrastructure and pipelines using Infrastructure as Code (IaC) tools like Terraform and CI/CD practices.
Manage and optimize GCP data storage solutions primarily BigQuery (data warehouse) and Cloud Storage (data lake) for performance and cost efficiency.
Provide technical guidance and recommendations on data architecture and technology choices within the GCP ecosystem.
Qualifications :
Mandatory hands-on expertise with Google Cloud Platform (GCP) data services including BigQuery Cloud Dataflow Cloud Storage and Cloud Pub/Sub.
Strong proficiency in SQL and extensive experience with data warehousing concepts and data modeling techniques.
Expertise in at least one programming language commonly used for data engineering (Python is highly preferred).
Experience with Infrastructure as Code (IaC) tools like Terraform for automating GCP data infrastructure deployment.
Solid understanding of distributed systems and ETL/ELT frameworks.
Excellent analytical and problem-solving skills with a passion for continuous learning and data governance.
Preferred Skills:
Google Certified Professional Data Engineer certification.
Experience with streaming data technologies and real-time processing within GCP.
Knowledge of containerization and orchestration (Docker Kubernetes/GKE).
Additional Information :
The Devoteam Group works for equal opportunities promoting its employees based on merit and actively fights against all forms of discrimination. We are convinced that diversity contributes to the creativity dynamism and excellence of our organization. All of our vacancies are open to people with disabilities.
Remote Work :
No
Employment Type :
Full-time
Devoteam is a AI-driven tech consulting firm specialised in cloud platforms, cyber, data, and sustainability. Tech native for almost 30 years, Devoteam guides businesses through sustainable digital transformation to deliver value. With over 11,000 tech architects in more than 25 co ... View more