drjobs Python Data Engineer

Python Data Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Jose, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

We are seeking a highly skilled Python Data Engineer with deep experience in CMS datasets (MOR MMR MAO) and a strong understanding of healthcare regulations and compliance standards (HIPAA). This role is ideal for a datadriven professional who thrives in cloudnative environments and is passionate about building robust scalable and efficient pipelines that drive healthcare innovation.

Key Responsibilities:

  • Design develop and maintain scalable ETL pipelines for CMS datasets using GCP Dataflow (Apache Beam) and Python

  • Architect and manage BigQuery data warehouses ensuring optimal performance and costefficiency

  • Implement and manage Airflow DAGs for workflow orchestration and scheduling

  • Ensure endtoend data quality lineage validation and governance in alignment with HIPAA and CMS standards

  • Optimize largescale healthcare datasets using partitioning clustering sharding and efficient query patterns in BigQuery

  • Collaborate within Agile teams using tools like Jira and Confluence for sprint planning and documentation

  • Monitor troubleshoot and improve pipeline reliability and performance across the full data lifecycle

Qualifications:

  • Bachelors degree in Computer Science Information Systems or related field

  • 3 years of experience in cloudbased data engineering preferably with healthcare datasets

  • Strong proficiency in Python GCP Dataflow and Apache Beam

  • Expertlevel knowledge in BigQuery including schema design performance tuning and advanced SQL

  • Handson experience with Airflow forthe orchestration of complex data workflows

  • Indepth understanding of data warehouse design including star/snowflake schemas normalization and denormalization

  • Strong analytical skills for query and data optimization

  • Familiarity with Agile methodologies and collaboration tools (Jira Confluence)

  • Knowledge of CMS datasets (MOR MMR MAO) and healthcare data privacy/compliance standards (HIPAA)

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.