Senior Data Scientist

Biohub

Not Interested
Bookmark
Report This Job

profile Job Location:

Redwood City, CA - USA

profile Monthly Salary: $ 190000 - 261800
Posted on: 2 days ago
Vacancies: 1 Vacancy

Job Summary

Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization with the support of the Chan Zuckerberg Initiative.

The Team

Biohub supports the science and technology that will make it possible to help scientists cure prevent or manage all diseases by the end of this century. While this may seem like an audacious goal in the last 100 years biomedical science has made tremendous strides in understanding biological systems advancing human health and treating disease.

Achieving our mission will only be possible if scientists are able to better understand human biology. To that end we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems paving the way for new discoveries that will change medicine in the decades that follow:

  • Building an AI-based virtual cell model to predict and understand cellular behavior
  • Developing state-of-the-art imaging systems to observe living cells in action
  • Instrumenting tissues to better understand inflammation a key driver of many diseases
  • Engineering and harnessing the immune system for early detection prevention and treatment of disease

As a Senior Data Scientist youll lead the creation of groundbreaking datasets that power our AI/ML efforts within and across our scientific grand challenges. Working at the intersection of data science biology and AI your work will focus on creating large AI-ready datasets spanning single-cell sequencing immune receptor profiling and mass spectrometry peptidomics data. You will define data needs format standards analysis approaches and quality metrics and build pipelines to ingest transform and validate data products that form the foundation of our experiments.

Our Data Ecosystem:

These efforts will form a part of and interoperate with our larger larger data ecosystem. We are generating unprecedented scientific datasets that drive biological innovation:

  • Billions of standardized cells of single-cell transcriptomic data with a focus on measuring genetic and environmental perturbations
  • 10s of thousands of donor-matched DNA & RNA samples
  • 10s PBs-scale static and dynamic imaging datasets
  • 100s TBs-scale mass spectrometry datasets
  • Diverse large multi-modal biological datasets that enable biological bridges across measurement types and facilitate multi-modal model training to define how cells act.

When analysis of a dataset is complete you will help publish it through public resources like CELLxGENE Discover the CryoET Portal and the Virtual Cell Platform used by tens of thousands of scientists monthly to advance understanding of genetic variants disease risk drug toxicities and therapeutic discovery.

Your Impact

Youll collaborate with cross-functional teams to lead dataset definition ingestion transformation and delivery for AI modeling and experimental analysis. Success means delivering high-quality usable datasets that directly address modeling challenges and accelerate scientific progress. Join us in building the data foundation that will transform our understanding of human biology and move us along the path to curing preventing and managing all disease.

What Youll Do

  • Contribute the tools required for a robust data ecosystem: build single cell data ingestion pipelines select data formats standards and database schemas and write validation tools QC approaches and analysis pipelines.
  • Collaborate with Platform Scientists ML engineers AI Researchers and Data Engineers to iteratively evaluate refine and grow datasets to improve our biological understanding of inflammation.
  • Work closely with Platform Scientists to identify technical variables and devise approaches to harmonize data across generation sites to enable joint analysis.
  • Discover and define new data generation opportunities and manage the delivery of those data products to our scientific teams.

What Youll Bring

  • 10 years of experience with large scale high throughput biological data (single cell sequencing immune receptor profiling mass spectrometry).
  • Demonstrated ability to deliver multiple large biological data products.
  • Experience with big data: extraction transport loading databases standardization validation QC and analysis.
  • Experience with processing and orchestration pipelines such as Argo Workflows Databricks
  • Strong fundamentals in statistical reasoning and machine learning.
  • Experience with biological data analysis and QC best practices
  • Excellent written and verbal communication skills.
  • Enthusiasm to ramp up on technologies and learn new domains.
  • Experience working in a multidisciplinary environment (scientific platforms engineering product AI Research).

Compensation

The Redwood City CA base pay range for this role is $190000 - $261800. New hires are typically hired into the lower portion of the range enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience as evaluated throughout the interview process.

Work Mode

As we grow were excited to strengthen in-person connections and cultivate a collaborative team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month approximately 3 days a week with specific in-office days determined by the teams manager. The exact schedule will be at the hiring managers discretion and communicated during the interview process.

Benefits for the Whole You

Were thankful to have an incredible team behind our work. To honor their commitment we offer a wide range of benefits to support the people who make all we do possible.

  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving

If youre interested in a role but your previous experience doesnt perfectly align with each qualification in the job description we still encourage you to apply as you may be the perfect fit for this or another role.

#LI-Hybrid


Required Experience:

Senior IC

Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization with the support of the Chan Zuckerberg Initiative.The TeamBiohub supports the science and technology that will make it possible to help scientists cure prevent or manage al...
View more view more

Key Skills

  • Laboratory Experience
  • Mammalian Cell Culture
  • Biochemistry
  • Assays
  • Protein Purification
  • Research Experience
  • Next Generation Sequencing
  • Research & Development
  • cGMP
  • Cell Culture
  • Molecular Biology
  • Flow Cytometry