Job Title: Senior Data Engineer Location: McLean VA Can do Only w2 No C2C
Job Summary:
We are seeking a highly skilled Data Engineer with strong analytics capabilities and expertise in designing scalable reusable and sustainable data platforms. The ideal candidate will own the complete data lifecycle from data ingestion and pipeline development to advanced analytics and Tableau dashboard creation. This role requires deep expertise in Google Cloud Platform (GCP) BigQuery Python SQL Cloud Composer (Apache Airflow) Vertex AI Pipelines and Tableau.
The successful candidate will bring an architectural mindset focusing on modularity efficiency data governance and long-term maintainability while delivering scalable solutions that support enterprise-wide analytics and reporting needs.
Key Responsibilities:
Design develop and maintain scalable end-to-end data pipelines on Google Cloud Platform (GCP).
Build and optimize enterprise-grade datasets using BigQuery with advanced partitioning clustering materialized views and cost-optimization strategies.
Develop and manage production-grade workflows using Cloud Composer (Apache Airflow) including scheduling dependency management monitoring and retry mechanisms.
Build and support Vertex AI Pipelines for machine learning workflows and large-scale data transformation processes.
Design reusable and scalable data models using Medallion Architecture (Bronze/Silver/Gold) Star Schema and Data Vault methodologies.
Create modular and reusable datasets that support multiple business functions and reporting requirements.
Design develop and optimize interactive Tableau dashboards and reporting solutions.
Translate business requirements into intuitive visualizations and self-service analytics capabilities.
Optimize Tableau extracts published data sources and dashboard performance.
Implement data governance standards naming conventions schema management and documentation practices.
Collaborate with business stakeholders analysts data scientists and engineering teams to gather requirements and deliver scalable solutions.
Develop automation tools API integrations and custom data processing solutions using Python.
Write and optimize complex SQL queries involving CTEs window functions recursive queries and large-scale data processing.
Support CI/CD implementation and infrastructure-as-code initiatives for data engineering projects.
Maintain version control and deployment processes using Git-based workflows.
Required Skills:
Google Cloud Platform (GCP)
BigQuery (Advanced)
Python (Advanced)
SQL (Advanced)
Cloud Composer / Apache Airflow
Vertex AI Pipelines
Tableau Desktop
Tableau Server / Tableau Cloud
Data Pipeline Development
Data Modeling and Data Architecture
Medallion Architecture (Bronze/Silver/Gold)
Star Schema Design
Data Vault Modeling
ETL/ELT Development
Data Warehouse Design
Materialized Views
Partitioning and Clustering Strategies
Query Optimization
API Integrations
Data Governance and Documentation
CI/CD Practices
Infrastructure as Code (IaC)
Git / GitLab
Preferred Qualifications:
9 years of experience in Data Engineering.
Strong experience working within enterprise-scale cloud environments.
Hands-on experience with Terraform for infrastructure provisioning and management.
Experience with Google Cloud Storage (GCS).
Experience with Google Cloud Functions.
Exposure to machine learning workflow orchestration and MLOps practices.
Experience designing canonical data models and shared enterprise datasets.
Understanding of scalable cloud-native data architecture patterns
Best Regards:
Bindu M Phone: Email:
Job Title: Senior Data Engineer Location: McLean VA Can do Only w2 No C2C Job Summary: We are seeking a highly skilled Data Engineer with strong analytics capabilities and expertise in designing scalable reusable and sustainable data platforms. The ideal candidate will own the complete data lifecyc...
Job Title: Senior Data Engineer Location: McLean VA Can do Only w2 No C2C
Job Summary:
We are seeking a highly skilled Data Engineer with strong analytics capabilities and expertise in designing scalable reusable and sustainable data platforms. The ideal candidate will own the complete data lifecycle from data ingestion and pipeline development to advanced analytics and Tableau dashboard creation. This role requires deep expertise in Google Cloud Platform (GCP) BigQuery Python SQL Cloud Composer (Apache Airflow) Vertex AI Pipelines and Tableau.
The successful candidate will bring an architectural mindset focusing on modularity efficiency data governance and long-term maintainability while delivering scalable solutions that support enterprise-wide analytics and reporting needs.
Key Responsibilities:
Design develop and maintain scalable end-to-end data pipelines on Google Cloud Platform (GCP).
Build and optimize enterprise-grade datasets using BigQuery with advanced partitioning clustering materialized views and cost-optimization strategies.
Develop and manage production-grade workflows using Cloud Composer (Apache Airflow) including scheduling dependency management monitoring and retry mechanisms.
Build and support Vertex AI Pipelines for machine learning workflows and large-scale data transformation processes.
Design reusable and scalable data models using Medallion Architecture (Bronze/Silver/Gold) Star Schema and Data Vault methodologies.
Create modular and reusable datasets that support multiple business functions and reporting requirements.
Design develop and optimize interactive Tableau dashboards and reporting solutions.
Translate business requirements into intuitive visualizations and self-service analytics capabilities.
Optimize Tableau extracts published data sources and dashboard performance.
Implement data governance standards naming conventions schema management and documentation practices.
Collaborate with business stakeholders analysts data scientists and engineering teams to gather requirements and deliver scalable solutions.
Develop automation tools API integrations and custom data processing solutions using Python.
Write and optimize complex SQL queries involving CTEs window functions recursive queries and large-scale data processing.
Support CI/CD implementation and infrastructure-as-code initiatives for data engineering projects.
Maintain version control and deployment processes using Git-based workflows.
Required Skills:
Google Cloud Platform (GCP)
BigQuery (Advanced)
Python (Advanced)
SQL (Advanced)
Cloud Composer / Apache Airflow
Vertex AI Pipelines
Tableau Desktop
Tableau Server / Tableau Cloud
Data Pipeline Development
Data Modeling and Data Architecture
Medallion Architecture (Bronze/Silver/Gold)
Star Schema Design
Data Vault Modeling
ETL/ELT Development
Data Warehouse Design
Materialized Views
Partitioning and Clustering Strategies
Query Optimization
API Integrations
Data Governance and Documentation
CI/CD Practices
Infrastructure as Code (IaC)
Git / GitLab
Preferred Qualifications:
9 years of experience in Data Engineering.
Strong experience working within enterprise-scale cloud environments.
Hands-on experience with Terraform for infrastructure provisioning and management.
Experience with Google Cloud Storage (GCS).
Experience with Google Cloud Functions.
Exposure to machine learning workflow orchestration and MLOps practices.
Experience designing canonical data models and shared enterprise datasets.
Understanding of scalable cloud-native data architecture patterns