Ai Benchmark Engineer Jobs in International

keywords
locations Please select the country where you want to search for a job.
Please select the country where you want to search for a job.
Please enter keywords to search relevant jobs
  • City filter icon
  • Job Type filter icon
  • Posting Date filter icon
Clear All

18 Jobs Found | Sort By : Relevance | Posted Date

Not-Found

Less results matching your search!

Try removing some of the filters to get more results

Reset Filters
Not-Found

No results matching your search!

Try removing some of the filters to get more results

Reset Filters

Ai Performance Engineer

Mbr Partners

profile Dubai - UAE

AI Performance Engineer DescriptionOur clients are a leading technology company specialising in the design and developmentof cutting-edge customised server hardware solutions optimised for artificial intelligenceand machine learning applications.Their mission is to empower businesses and researchers...

Yesterday
Full Time

Ai Benchmark Engineer | Native Language Specialist...

Lilt

profile New Delhi - India

About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...

7 days ago
Contract

Ai Benchmark Engineer | Native Language Specialist...

Lilt

profile Beijing - China

About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...

7 days ago
Contract

Ai Benchmarking Lead, Performance Benchmarking Eva...

Amazon

profile Hyderabad - India

Join our mission-critical team supporting Seller Assistant Amazons Gen-AI powered copilot that helps sellers navigate Amazons complex ecosystem and grow their businesses. As a Quality Assurance Specialist youll play a pivotal role in ensuring the reliability and accuracy of AI model evaluations as w...

18 days ago
Full Time

Ai Benchmarking Spec. Chinese, International Selle...

Amazon

profile Shanghai - China

The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...

30+ days ago
Full Time

Ai Benchmarking Team Lead, International Seller Gr...

Amazon

profile Bengaluru - India

The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...

30+ days ago
Full Time

Ai Benchmarking Specialist, Sp Support Spanish, In...

Amazon

profile Bengaluru - India

The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...

30+ days ago
Full Time

Ai Benchmarking Specialist, Sp Support German, Int...

Amazon

profile Bengaluru - India

The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...

30+ days ago
Full Time

Ai Benchmarking Specialist, Sp Support Italian, In...

Amazon

profile Bengaluru - India

The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...

30+ days ago
Full Time

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Cairo - Egypt

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile New Delhi - India

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Madrid - Spain

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Tokyo - Japan

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Abuja - Nigeria

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Berlin - Germany

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer Native Language Specialist |...

Lilt

profile Prague - Czech Republic

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...

30+ days ago
Contract

Ai Benchmark Engineer | Native Language Specialist...

Lilt

profile Berlin - Germany

About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...

30+ days ago
Contract

Ai Benchmark Engineer | Native Language Specialist...

Lilt

profile Tokyo - Japan

About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...

30+ days ago
Contract