About Definitive Healthcare:
At Definitive Healthcare (NASDAQ: DH) were passionate about turning data analytics and expertise into meaningful intelligence that helps our customers achieve success and shape the future of healthcare. We empower them to uncover the right markets opportunities and peoplepaving the way for smarter decisions and greater impact.
Headquartered just outside of Boston Massachusetts Definitive Healthcare operates across North America Europe and India supporting a growing global client base of more than 2400 customers since our founding in 2011.
Were also a great place to work. In 2024 and 2025 we earned multiple workplace honors including Built Ins 100 Best Places to Work in Boston (both years) a Stevie Bronze Award for Great Employers and recognition as a Great Place to Work in India.
We foster a collaborative inclusive culture where diverse perspectives drive innovation. Through programs like DefinitiveCares and our employee-led affinity groups we strive to promote connection education and inclusion.
We are looking for aData Engineer who is passionate about building scalable data pipelines working with complex healthcare datasets and contributing to a modern cloudnative data architecture.
If you thrive in a fastpaced datadriven environment and have strong experience with Python Spark Databricks AWS SQL and related technologies wed love to hear from you.
Design and Develop Data Pipelines:
Develop and maintain robust data pipelines using Python Spark Databricks SQL and SSIS
Implement and orchestrate ETL/ELT workflows using Apache Airflow and SSIS
Build reliable repeatable processes that support the ingestion and transformation of large healthcare datasets
Data Integration and Management:
Integrate data from diverse sources (AWS onprem thirdparty vendors) into our enterprise data platform
Work with a wide range of file formats including CSV XML Parquet Delta and more
Apply strong data quality cleansing and curation practices to ensure accuracy and consistency
Optimize storage and compute resources for performance cost and scalability
Automate observability and monitoring across data pipelines and workloads
Metadata Management and Governance:
Implement and manage Unity Catalog for metadata lineage and access control
Ensure adherence to data governance security and privacy standards
Maintain clear documentation data dictionaries and lineage tracking
Contribute to automation of data observability and governance workflows
Performance Tuning and Troubleshooting:
Tune and optimize Spark jobs for speed reliability and cost efficiency
Diagnose and resolve performance bottlenecks across distributed systems
Apply JVM tuning and Spark optimization techniques to improve throughput
Data Maturity Lifecycle:
Support and enhance our Medallion architecture (bronze/silver/gold) to improve data quality and usability
Ensure data is processed enriched and validated at each stage of the lifecycle
Collaboration and Continuous Improvement:
Partner with data scientists analysts product teams and business stakeholders to understand data needs
Implement CI/CD pipelines to streamline deployment and testing of data assets
Stay current with emerging technologies and bring forward recommendations to evolve our data platform
Technical Skills:
Strong programming experience in SQL and Python or Scala
Handson experience with Apache Spark and Databricks
Experience with Apache Airflow or similar orchestration tools
Knowledge of data cleansing curation and quality frameworks
Familiarity with Unity Catalog or other metadata management tools
Understanding of data governance security and compliance best practices
Experience working with AWS cloud services
Proficiency with CI/CD tools (Jenkins GitLab CI etc.)
Experience tuning Spark jobs and JVMbased applications
Experience implementing or working within a Medallion architecture
Strong analytical and problemsolving abilities
Excellent communication and crossfunctional collaboration skills
Ability to work independently and within a team environment
High attention to detail and commitment to quality
AWS certifications (e.g. AWS Certified Data Analytics)
Experience with SQL and NoSQL databases
Background in a fastpaced datacentric SaaS or healthcare environment
Compensation and Benefits
The salary range for this position is $69000 $129000 per year which represents the base pay the company reasonably and in good faith expects to pay for this role. Actual pay within this range will be determined based on factors such as relevant experience skills and qualifications.
Depending on the position employees may also be eligible to participate in a company bonus or commission plan. All employees are eligible for a comprehensive benefits package including medical dental and vision coverage unlimited paid time off and participation in the companys 401(k) plan with employer contribution.
Why we love Definitive and why you will too!
What our Employees are saying about us on Glassdoor:
Great Work atmosphere great work life balance excellent company to work for amazing top notch product incredible customer service lots of tools to help you succeed.
-Business Development Manager
Great team. Amazing growth. Employees are treated very well.
-Research Analyst
I have waited 36 years to work at a dream job for a dream company and I am so happy to have finally got there.
-Profile Analyst
If you dont fit all of these qualifications but believe youre still a great fit feel free to apply and tell us why in your cover letter.
If you are a California Colorado New York City or Washington resident and this role is a remote role you can receive additional information about the compensation and benefits for this role which we will provide upon request.
Definitive Hiring Philosophy
Definitive Healthcare is an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race color religion age gender gender identity sexual orientation or any other status. If youre interested in working in a fast growing exciting working environment we encourage you to apply!
Privacy
Your privacy is important to us. Please review our Candidate Privacy Notice which tells you how we use and process your personal information.
Please note: All communications regarding the hiring process at Definitive Healthcare will come directly from one of our corporate recruiters or coordinators with an @ email address. We will never request any money transfer or purchase of equipment with a promise of reimbursement. If you receive any suspicious communications please reach out to to confirm your status in the application process.
Required Experience:
IC
Convert data and analytics into healthcare commercial intelligence to identify markets, opportunities, and key stakeholders for future success.