Member of Technical Staff Safety Lead

San Francisco, CA - USA

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

Our Mission

Reflections mission is to build open superintelligence and make it accessible to all.

Were developing open weight models for individuals agents enterprises and even nation states. Our team of AI researchers and company builders come from DeepMind OpenAI Google Brain Meta Anthropic and beyond.

About the Role

Own the red-teaming and adversarial evaluation pipeline for Reflections models continuously probing for failure modes across security misuse and alignment gaps.
Work hand-in-hand with the Alignment team to translate safety findings into concrete guardrails ensuring models behave reliably under stress and adhere to deployment policies.
Validate that every release meets the labs risk thresholds before it ships serving as a critical gatekeeper for our open weight releases.
Develop scalable automated safety benchmarks that evolve alongside our model capabilities moving beyond static datasets to dynamic adversarial testing.
Research and implement state-of-the-art jailbreaking techniques and defenses to stay ahead of potential vulnerabilities in the wild.

About You

Graduate degree (MS or PhD) in Computer Science Machine Learning or related discipline or equivalent practical experience in AI Safety.
Deep technical understanding of LLM safety including adversarial attacks red-teaming methodologies and interpretability.
Strong software engineering capabilities with experience building automated evaluation pipelines or large-scale ML systems.
Experience with Reinforcement Learning (RLHF/RLAIF) and how it impacts model safety and alignment is a strong plus.
Thrive in a fast-paced high-agency startup environment with bias toward action.
Willing to make high-stakes decisions regarding model release and safety thresholds.
Passionate about advancing the frontier of intelligence.

What We Offer:

We believe that to build superintelligence that is truly open you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company and help define the frontier of open foundational models.

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
Health & wellness: Comprehensive medical dental vision life and disability insurance.
Life & family: Fully paid parental leave for all new parents including adoptive and surrogate journeys. Financial support for family planning.
Benefits & balance: paid time off when you need it relocation support and more perks that optimize your time.
Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Required Experience:

Staff IC

Our MissionReflections mission is to build open superintelligence and make it accessible to all.Were developing open weight models for individuals agents enterprises and even nation states. Our team of AI researchers and company builders come from DeepMind OpenAI Google Brain Meta Anthropic and bey...

Our Mission

Reflections mission is to build open superintelligence and make it accessible to all.

About the Role

Own the red-teaming and adversarial evaluation pipeline for Reflections models continuously probing for failure modes across security misuse and alignment gaps.
Work hand-in-hand with the Alignment team to translate safety findings into concrete guardrails ensuring models behave reliably under stress and adhere to deployment policies.
Validate that every release meets the labs risk thresholds before it ships serving as a critical gatekeeper for our open weight releases.
Develop scalable automated safety benchmarks that evolve alongside our model capabilities moving beyond static datasets to dynamic adversarial testing.
Research and implement state-of-the-art jailbreaking techniques and defenses to stay ahead of potential vulnerabilities in the wild.

About You

Graduate degree (MS or PhD) in Computer Science Machine Learning or related discipline or equivalent practical experience in AI Safety.
Deep technical understanding of LLM safety including adversarial attacks red-teaming methodologies and interpretability.
Strong software engineering capabilities with experience building automated evaluation pipelines or large-scale ML systems.
Experience with Reinforcement Learning (RLHF/RLAIF) and how it impacts model safety and alignment is a strong plus.
Thrive in a fast-paced high-agency startup environment with bias toward action.
Willing to make high-stakes decisions regarding model release and safety thresholds.
Passionate about advancing the frontier of intelligence.

What We Offer:

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
Health & wellness: Comprehensive medical dental vision life and disability insurance.
Life & family: Fully paid parental leave for all new parents including adoptive and surrogate journeys. Financial support for family planning.
Benefits & balance: paid time off when you need it relocation support and more perks that optimize your time.
Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Required Experience:

Staff IC

Key Skills

ICT
ASP.NET
Gas
Field

Apply Now

About Company

Reflection

Building frontier open intelligence.

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Member of Technical Staff Safety Lead

San Francisco, CA - USA

Department:

Job Summary

Our Mission

About the Role

About You

What We Offer:

Our Mission

About the Role

About You

What We Offer:

Key Skills

About Company

Related Jobs