drjobs Data Engineer - Tool Abstraction

Data Engineer - Tool Abstraction

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Rockville - USA

Monthly Salary drjobs

$ 115000 - 155000

Vacancy

1 Vacancy

Job Description

(ID: 2025-0413)


Axle is a bioscience and information technology company that offers advancements in translational research biomedical informatics and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science software engineering and program management we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).

Axle is seeking a Data Engineer - Tool Abstraction to join our vibrant team at the National Institutes of Health (NIH) supporting the National Center for Advancing Translation Sciences (NCATS) located in Rockville MD.

Benefits We Offer:

  • 100% Medical Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

Key Responsibilities:

Design and Automation of Data Pipelines:

  • Build and maintain scalable and efficient data pipelines for clinical and research datasets.

  • Automate the extraction transformation and loading (ETL) processes to ensure timely and reliable data delivery while optimizing workflows for downstream analysis.

Data Ingestion Standardization and Harmonization:

  • Ingest large-scale datasets from diverse clinical and research sources.

  • Collaborate with data science teams to harmonize data across systems

  • Implement best practices for cleaning and standardizing data to enable consistent analytics.

Standards Compliance and Modeling:

  • Ensure datasets meet healthcare and research compliance requirements by aligning data with established Common Data Models such as CDISC and OMOP.

  • Work closely with clinical data teams to maintain integrity and usability of standardized datasets.

Workflow Development and Reproducibility:

  • Develop optimize and automate workflows using tools like Snakemake or Nextflow.

  • Containerize pipelines using Docker to support reproducibility and scalability across research and production environments.

  • Promote continuous integration and deployment within data workflows.

Collaboration and Documentation:

  • Work closely with multidisciplinary teams including data scientists biostatisticians and software engineers to align data infrastructure with project needs.

  • Maintain comprehensive documentation of pipeline architectures and workflow logic to ensure clarity transparency and reproducibility.

Required:

  • Bachelors degree in computer science Data Engineering Bioinformatics or a related field with 5 years of relevant experience; or a Masters degree with 2-3 years of experience.

  • Proven ability to design build and maintain scalable data pipelines and automate ETL processes.

  • Hands-on experience working with clinical or research data and familiarity with healthcare data standards and Common Data Models (e.g. CDISC OMOP).

  • Familiarity with big data frameworks like Apache Spark or Hadoop.

  • Strong skills in Python SQL and shell scripting (e.g. Bash).

  • Experience using Docker to containerize data workflows for reproducibility and scalability.

  • Proficiency with version control systems like Git and continuous integration practices.

Preferred:

  • Experience with cloud platforms (e.g. AWS GCP Azure) for large-scale data processing.

  • Proficiency with workflow management systems such as Snakemake Nextflow or similar tools.

Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills responsibilities duties and/or assignments required. Individuals may be required to perform duties outside of their position job description or responsibilities as needed.

The diversity of Axles employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age race gender religion national origin disability marital status covered veteran status sexual orientation status with respect to public assistance and other characteristics protected under state federal or local law and to deter those who aid abet or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact:

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidates experience qualifications skills and location.

#INDPSD

Employment Type

Full Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.