وظائف Ai Benchmark Engineer في International
-
المدينة
-
نوع التوظيف
-
تاريخ الإعلان
تم العثور على 18 وظيفة | التصفية حسب : نسبة الملائمة | التاريخ
نتائج أقل تتطابق مع بحثك
حاول إزالة بعض المرشحات للحصول على المزيد من النتائج
لا نتائج مطابقة لبحثك!
حاول إزالة بعض المرشحات للحصول على المزيد من النتائج
Ai Performance Engineer
MBR Partners
AI Performance Engineer DescriptionOur clients are a leading technology company specialising in the design and developmentof cutting-edge customised server hardware solutions optimised for artificial intelligenceand machine learning applications.Their mission is to empower businesses and researchers...
Ai Benchmark Engineer | Native Language Specialist...
LILT
About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...
Ai Benchmark Engineer | Native Language Specialist...
LILT
About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...
Ai Benchmarking Lead, Performance Benchmarking Eva...
Amazon
Join our mission-critical team supporting Seller Assistant Amazons Gen-AI powered copilot that helps sellers navigate Amazons complex ecosystem and grow their businesses. As a Quality Assurance Specialist youll play a pivotal role in ensuring the reliability and accuracy of AI model evaluations as w...
Ai Benchmarking Spec. Chinese, International Selle...
Amazon
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...
Ai Benchmarking Team Lead, International Seller Gr...
Amazon
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...
Ai Benchmarking Specialist, Sp Support Spanish, In...
Amazon
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...
Ai Benchmarking Specialist, Sp Support German, Int...
Amazon
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...
Ai Benchmarking Specialist, Sp Support Italian, In...
Amazon
The Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training measuring...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer Native Language Specialist |...
LILT
We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and complex locale/encodi...
Ai Benchmark Engineer | Native Language Specialist...
LILT
About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...
Ai Benchmark Engineer | Native Language Specialist...
LILT
About The OpportunityWe are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and...