HealthLeap builds AI that helps clinicians prioritize patients surfaces the right data and gets patients the care they need earlier so they can leave the hospital sooner.
We integrate with hospital electronic health record systems screen 100% of patients daily and risk-rank them in real time. Clinicians at Cedars-Sinai and Penn Medicine start every morning with HealthLeap with Houston Methodist Emory and Intermountain Health deploying now.
Real results: 39% more diagnoses. 4 days earlier detection. $11M/year ROI for our first site at Cedars Sinai. 7 revenue growth in 7 months.
We started with malnutrition. Were expanding to every major condition to ensure no patient falls through the cracks. Sequoia and First Round are backing us to build the platform that screens every patient for everything and drives tangible outcomes.
Were 15 people. >$7M raised. SF-based hybrid-friendly. Early enough to shape the product. Late enough to know it works. Results that are changing lives.
Outcomes Youll Drive
Condition expansion velocity: Idea signal & label viability using current EHR data validated model customer-ready (for viable use cases weeks not months)
Improving patient health outcomes: Quantified length of stay (LOS) reduction readmission reduction mortality reduction with clear confidence intervals and robust counterfactuals.
Pilot production conversion: Run retrospective analyses on hospital data to prove impact then transition validated pilots into live deployments that deliver measurable outcomes.
Role Overview
Were looking to hire a product-minded Data Scientist with a sound theoretical knowledge foundation. You will own end-to-end problem framing timeline scoping experimental design and model iteration. Youll work closely with our CEO and small team to launch new models quickly and safely by leveraging and expanding on our existing feature tables. You will also run retrospective pilots to estimate clinical and financial impact (reimbursement lift LOS reduction mortality reduction) and support pre-sales by meeting AI/Data Science leaders at world-class health systems to share your clinical and financial model assumptions and development methodologies.
Key Responsibilities
Own end-to-end modeling from financial incentives and problem framing to a validated model.
Estimate impact with rigorous retrospective analyses (LOS readmissions mortality reimbursement).
Productionize pipelines and rollouts with reliability.
Monitor & improve: drift calibration/uncertainty and fairness (Independence/Separation/Sufficiency).
Translate research into pragmatic wins for our platform.
Partner with stakeholders: clear visuals crisp narratives and method presentation for analysts clinicians and executives.
Requirements
Passionate about AIs potential in healthcare; outcomes-oriented with a focus on impact not just research.
Statistics: parametric and non-parametric tests hypothesis testing experimental design confidence intervals and causal inference basics.
ML fluency: Python SQL; polars (or pandas) scikit-learn XGBoost/LightGBM (PyTorch/transformers a plus); survival/time-to-event experience is great.
Visualization & storytelling: Expert at turning complex analyses into crisp user visualizations dashboards and narratives for clinicians and executives.
Customer-facing: Comfortable interviewing stakeholders presenting to AI/data science leaders and defending methods.
Read the latest research and rapidly translate new statistical/ML papers into pragmatic wins.
3 - 5 years of relevant experience from a high-growth environment.
BS/MS in Statistics Biostats CS or equivalent experience.
Resourceful fast learner high ownership bias to action fast experimentation cycles and ability to work independently while collaborating in a small team.
Understanding of fairness: Independence Separation and Sufficiency
Nice-to-Haves
Background in applied AI companies with strong product traction (not hype-driven firms).
Interest in healthcare data (e.g. from research labs with practical applications).
Side projects demonstrating productionization (e.g. turning prototypes like landing agents into reliable systems).
Uncertainty quantification
Covariate and prediction drift detection in production
Hands-on experience with LLMs in production; LLMs for clinical text weak/active/semi-supervised learning.
Strong software engineering skills with proven ML experience: Productionizing models (tabular/text data preferred; not pure vision specialists) and building scalable pipelines.
Familiarity with EHR schemas/standards (FHIR/HL7) IRB/validation study workflows and model governance.
We Provide:
Competitive salary with performance-based incentives
Comprehensive Healthcare Benefits - we cover 100% of premiums for employees
Unlimited Paid Time Off - we need you at your best at all times. Our recommended time off of 20 PTO days per year lets you schedule your work around your life.
401K match of up to 4% of employee salary
Laptop and equipment budget to set up your at-home office environment
Lunch snacks and drinks are provided in the office to ensure you never go hungry :)
Opportunity for professional growth in a dynamic fast-paced startup environment
Location: San Francisco (hybrid)
Compensation is dependent on experience overall fit to our role and candidate location.
If youre passionate about applying frontier AI to real-world impact join us in building healthcares future.
HealthLeap continuously screens every patient using live chart data so patients get better faster, and you get reimbursed for the care you deliver.