Research Scientist - Speech Synthesis - Seattle WA (Onsite)
This role offers the rare opportunity to shape foundational technology in a space where the boundaries are still being defined.
Youll be building the first human foundation model that operates across text speech facial expression and body language in real time. This unified model understands fine-grained human signals from a quirked eyebrow to a subtle change in voice and infers meaning in context
You will generate lifelike responsive avatars whose expressions gestures and tone evolve frame-by-frame to deliver genuine responses.
Seniority
At least 1 year of hands-on experience building and deploying AI/ML systems in real-world environments.
2 - 5 years of experience in full-stack software engineering with ownership of production-grade systems (frontend backend and scalable infrastructure)
Work experience
Built complex production-grade systems across the full stack ideally including AI-driven evaluation or scoring infrastructure.
Experience with simulation design (e.g. game development physics simulations).
Experience building or working on interactive simulation systems (e.g. browser-based tools game/physics engines or real-time user workflows).
Education
Degree in Engineering Computer Science or a related hardtech field from a rigorous program
Hard skills
Strong experience in AI/ML particularly building or deploying multi-modal systems (e.g. video telemetry artifact ingestion).
Experience working with CAD/ECAD data simulation tooling or video processing pipelines in production environments.
Updated
Have deep experience applying AI to human evaluation problems scoring open-ended responses inferring latent behavioral traits modeling cognitive patterns or building systems that assess how people think rather than just what they know
Soft skills
Highly autonomous able to own features end-to-end.
Demonstrates passion for the field e.g. works on side projects.
Miscellaneous
Based in San Francisco and can work hybrid (3 days/week in-office).
Willing to work flexible hours including occasional weekends when necessary.
Research Scientist - Speech Synthesis - Seattle WA (Onsite) This role offers the rare opportunity to shape foundational technology in a space where the boundaries are still being defined. Youll be building the first human foundation model that operates across text speech facial expression and body l...
Research Scientist - Speech Synthesis - Seattle WA (Onsite)
This role offers the rare opportunity to shape foundational technology in a space where the boundaries are still being defined.
Youll be building the first human foundation model that operates across text speech facial expression and body language in real time. This unified model understands fine-grained human signals from a quirked eyebrow to a subtle change in voice and infers meaning in context
You will generate lifelike responsive avatars whose expressions gestures and tone evolve frame-by-frame to deliver genuine responses.
Seniority
At least 1 year of hands-on experience building and deploying AI/ML systems in real-world environments.
2 - 5 years of experience in full-stack software engineering with ownership of production-grade systems (frontend backend and scalable infrastructure)
Work experience
Built complex production-grade systems across the full stack ideally including AI-driven evaluation or scoring infrastructure.
Experience with simulation design (e.g. game development physics simulations).
Experience building or working on interactive simulation systems (e.g. browser-based tools game/physics engines or real-time user workflows).
Education
Degree in Engineering Computer Science or a related hardtech field from a rigorous program
Hard skills
Strong experience in AI/ML particularly building or deploying multi-modal systems (e.g. video telemetry artifact ingestion).
Experience working with CAD/ECAD data simulation tooling or video processing pipelines in production environments.
Updated
Have deep experience applying AI to human evaluation problems scoring open-ended responses inferring latent behavioral traits modeling cognitive patterns or building systems that assess how people think rather than just what they know
Soft skills
Highly autonomous able to own features end-to-end.
Demonstrates passion for the field e.g. works on side projects.
Miscellaneous
Based in San Francisco and can work hybrid (3 days/week in-office).
Willing to work flexible hours including occasional weekends when necessary.
View more
View less