Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
About TaskUs: TaskUs is a provider of outsourced digital services and next-generation customer experience to fast-growing technology companies helping its clients represent protect and grow their brands. Leveraging a cloud-based infrastructure TaskUs serves clients in the fastest-growing sectors including social media e-commerce gaming streaming media food delivery ride-sharing HiTech FinTech and HealthTech.
The People First culture at TaskUs has enabled the company to expand its workforce to approximately 45000 employees globally. Presently we have a presence in twenty-three locations across twelve countries which include the Philippines India and the United States.
It started with one ridiculously good idea to create a different breed of Business Processing Outsourcing (BPO)! We at TaskUs understand that achieving growth for our partners requires a culture of constant motion exploring new technologies being ready to handle any challenge at a moments notice and mastering consistency in an ever-changing world.
What We Offer: At TaskUs we prioritize our employees well-being by offering competitive industry salaries and comprehensive benefits packages. Our commitment to a People First culture is reflected in the various departments we have established including Total Rewards Wellness HR and Diversity. We take pride in our inclusive environment and positive impact on the community. Moreover we actively encourage internal mobility and professional growth at all stages of an employees career within TaskUs. Join our team today and experience firsthand our dedication to supporting People First.
Our researchers invent safety methods and our delivery teams deploy them but we need data to prove they work fast. As our Data Scientist AI Safety Services youll bring statistical rigor and analytical firepower to every benchmark redteam and client rollout. Your models dashboards and experiments will show customers where their risks lie and how our interventions move the needle.
The impact youll make:
Quantify safety: Transform evaluation outputs into clear metrics confidence intervals and risk scores that guide client decisions.
Accelerate research: Provide rapid statistical feedback that helps researchers iterate on alignment robustness and interpretability ideas.
Strengthen delivery: Build data pipelines and visualizations that give Solutions and Operations teams realtime visibility into project health.
What youll do:
Own analytics pipelines for jailbreak detection rates toxicity scores drift signals and other safety KPIs - ETL through presentation.
Design A/B and sequential tests to measure the impact of prompt tweaks finetuning or policy changes on model behavior.
Develop risk models & dashboards in Python (Pandas NumPy Plotly Streamlit) backed by scalable storage (BigQuery Redshift or similar).
Collaborate with Researchers to choose statistical methods validate assumptions and publish reproducible notebooks alongside papers.
Partner with Solutions Engineering to embed metrics in client reports and scope data requirements for new engagements.
Automate reporting: Build CI hooks or Airflow jobs so safety scores refresh with each new model drop or data batch.
Stay current: Evaluate new safety benchmarks opensource metric libraries and MLOps best practices; recommend adoption where useful.
Experiences youll bring:
35 years in data science analytics engineering or applied statisticsideally in an AI/ML or risk domain.
Proven track record turning messy highvolume data into actionable insights and visualizations for stakeholders.
Solid understanding of machinelearning evaluation workflows; exposure to large language model outputs a plus.
Hands-on experience with Python data stack (Pandas NumPy SciPy scikitlearn) and SQLor equivalents in Spark/BigQuery.
Familiarity with experiment design hypothesis testing and causal inference techniques.
Core skills youll need:
Statistical rigor: power analysis significance testing bootstrap / Bayesian methods.
Data engineering basics: ETL data validation versioning and scalable storage.
Visualization & storytelling: interactive dashboards and clear narratives for technical & executive audiences.
Scripting & automation: write maintainable code schedule pipelines and integrate with CI/CD.
Collaboration: work fluidly with researchers engineers and clientsfacing teams under tight timelines.
Nice to have: Experience with MLflow or Weights & Biases exposure to privacypreserving analytics or familiarity with NIST RMF / EU AI Act metrics.
Why youll love this role:
Remote-first flexibility: Work where youre most productive!
Immediate impact: Your analyses steer decisions for frontier AI deployments weeks after you write them.
Breadth of problems: From toxicity scoring to drift detection no two problems are the same.
Growth runway: Define the data science discipline for AI Safety at TaskUs and grow into a senior or lead role as the team scales.
Mission focus: Help ensure the worlds most powerful models behave safely and fairly.
Ready to turn safety data into decisive action
Apply today and bring clarity to the front lines of trusted AI.
How We Partner To Protect You: TaskUs will neither solicit money from you during your application process nor require any form of payment in order to proceed with your application. Kindly ensure that you are always in communication with only authorized recruiters of TaskUs.
DEI: In TaskUs we believe that innovation and higher performance are brought by people from all walks of life. We welcome applicants of different backgrounds demographics and circumstances. Inclusive and equitable practices are our responsibility as a business. TaskUs is committed to providing equal access to opportunities. If you need reasonable accommodations in any part of the hiring process please let us know.
We invite you to explore all TaskUs career opportunities and apply through the provided URL
Full Time