Job Title: Data Scientist (Healthcare)
Location: 100% Remote
Visa: USC GC H1B Transfer EAD/W2
Interview Process: Recruiter screen (skills & domain fit)
Technical deep dive (SQL modeling case study)
Practical exercise (notebook or take-home)
Stakeholder panel (communication & healthcare use-case discussion
About the Role: Were looking for a hands-on Data Scientist to solve real-world business problems in healthcare/health insurance using machine learning and modern cloud tooling. Youll own the full lifecycle-from problem framing and data wrangling to modeling deployment support and stakeholder storytelling.
Top Skills: Python/R SQL Spark/Databricks Azure ML or Vertex AI MLflow Git/GitHub Jupyter/Databricks notebooks Power BI/Tableau Azure/GCP services.
What Youll Do:
- Partner with business/clinical stakeholders to define measurable use cases (e.g. risk & cost forecasting readmission/LOS prediction member churn fraud/waste/abuse).
- Explore cleanse and engineer tabular data from claims eligibility EMR utilization and care-management sources.
- Build and evaluate models using regression classification time series clustering trees/GBMs; pilot deep-learning where additive value.
- Productionize with data & MLOps teams: feature pipelines (Spark/Databricks) model packaging monitoring drift/decay and A/B testing.
- Write clear analyses dashboards and exec-ready presentations; translate findings into actions and ROI.
- Ensure privacy/compliance (HIPAA/PHI) data governance and reproducible research (git notebooks experiment tracking).
- Contribute to LLM initiatives: prompt engineering RAG basic fine-tuning and experiments with agentic AI frameworks for workflow automation.
Required Qualifications
- 6 8 years of hands-on data science delivering business impact (healthcare/health insurance experience is a big plus).
- Strong tabular data handling/manipulation skills.
- Solid SQL and proficiency in at least one statistical programming language (Python or R).
- Working knowledge of MS Office (Excel/PowerPoint/Access) for quick analyses and stakeholder comms.
- Experience with Big Data tools: HDFS Hive Spark MapR-DB (or equivalent).
- Practical experience with statistical & ML techniques: linear/logistic regression time series clustering decision trees tree ensembles (XGBoost/LightGBM).
- Cloud & ML platforms: Databricks Azure ML and/or Google Vertex AI (pipelines model registry deployment workflows).
- Exposure to prompt engineering LLM fine-tuning (LoRA/PEFT or platform-native) and agentic AI concepts.
- Clear concise communicator; able to convert ambiguity into structured analysis and decisions.
- BS/MS in Computer Science Statistics Applied Math Engineering or related field (or equivalent experience).
#LI-BS1
Job Title: Data Scientist (Healthcare) Location: 100% Remote Visa: USC GC H1B Transfer EAD/W2 Interview Process: Recruiter screen (skills & domain fit) Technical deep dive (SQL modeling case study) Practical exercise (notebook or take-home) Stakeholder panel (communication & healthcare use-case disc...
Job Title: Data Scientist (Healthcare)
Location: 100% Remote
Visa: USC GC H1B Transfer EAD/W2
Interview Process: Recruiter screen (skills & domain fit)
Technical deep dive (SQL modeling case study)
Practical exercise (notebook or take-home)
Stakeholder panel (communication & healthcare use-case discussion
About the Role: Were looking for a hands-on Data Scientist to solve real-world business problems in healthcare/health insurance using machine learning and modern cloud tooling. Youll own the full lifecycle-from problem framing and data wrangling to modeling deployment support and stakeholder storytelling.
Top Skills: Python/R SQL Spark/Databricks Azure ML or Vertex AI MLflow Git/GitHub Jupyter/Databricks notebooks Power BI/Tableau Azure/GCP services.
What Youll Do:
- Partner with business/clinical stakeholders to define measurable use cases (e.g. risk & cost forecasting readmission/LOS prediction member churn fraud/waste/abuse).
- Explore cleanse and engineer tabular data from claims eligibility EMR utilization and care-management sources.
- Build and evaluate models using regression classification time series clustering trees/GBMs; pilot deep-learning where additive value.
- Productionize with data & MLOps teams: feature pipelines (Spark/Databricks) model packaging monitoring drift/decay and A/B testing.
- Write clear analyses dashboards and exec-ready presentations; translate findings into actions and ROI.
- Ensure privacy/compliance (HIPAA/PHI) data governance and reproducible research (git notebooks experiment tracking).
- Contribute to LLM initiatives: prompt engineering RAG basic fine-tuning and experiments with agentic AI frameworks for workflow automation.
Required Qualifications
- 6 8 years of hands-on data science delivering business impact (healthcare/health insurance experience is a big plus).
- Strong tabular data handling/manipulation skills.
- Solid SQL and proficiency in at least one statistical programming language (Python or R).
- Working knowledge of MS Office (Excel/PowerPoint/Access) for quick analyses and stakeholder comms.
- Experience with Big Data tools: HDFS Hive Spark MapR-DB (or equivalent).
- Practical experience with statistical & ML techniques: linear/logistic regression time series clustering decision trees tree ensembles (XGBoost/LightGBM).
- Cloud & ML platforms: Databricks Azure ML and/or Google Vertex AI (pipelines model registry deployment workflows).
- Exposure to prompt engineering LLM fine-tuning (LoRA/PEFT or platform-native) and agentic AI concepts.
- Clear concise communicator; able to convert ambiguity into structured analysis and decisions.
- BS/MS in Computer Science Statistics Applied Math Engineering or related field (or equivalent experience).
#LI-BS1
View more
View less