Lilt Careers | Job Vacancies & Opportunities Page 3

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Abuja - Nigeria

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Cairo - Egypt

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Tokyo - Japan

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Berlin - Germany

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Madrid - Spain

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Tokyo - Japan

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

New Delhi - India

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Prague - Czech Republic

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Abuja - Nigeria

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Berlin - Germany

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

14 days ago

Contract

Madrid - Spain

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

19 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

19 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

19 days ago

Contract

Berlin - Germany

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Berlin - Germany

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

19 days ago

Contract

Berlin - Germany

Apply Now

LILT

234 Job openings in LILT

Ai Benchmark Engineer Native Language Specialist | Hausa

Ai Benchmark Engineer Native Language Specialist | Arabic

Ai Benchmark Engineer Native Language Specialist | Japanese

Ai Benchmark Engineer Native Language Specialist | German

Ai Benchmark Engineer Native Language Specialist | Spanish

Ai Benchmark Engineer Native Language Specialist | Japanese

Ai Benchmark Engineer Native Language Specialist | Marathi

Ai Benchmark Engineer Native Language Specialist | Czech

Ai Benchmark Engineer Native Language Specialist | Hausa

Ai Benchmark Engineer Native Language Specialist | German

Ai Benchmark Engineer Native Language Specialist | Spanish

Legal & Compliance Ai Raterevaluator

Legal & Compliance Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Legal & Compliance Ai Raterevaluator

About LILT

Jobs You Might Be Interested In

AI Benchmark Engineer Native Language Sp...

AI Benchmark Engineer Native Language Sp...