Work location: on-site Paris (France)
Engagement model: Temporary Employment (approx. 2 months with possible extension)
Workload: 25 hours/week
Required Start: Immediately
Languages required: Hindi and English
DataForce by TransPerfect is seeking a detail-oriented and linguistically skilled professional to design and evaluate user prompts for AI language models. The primary focus of this role is to create high-quality prompts define grading criteria and review model outputs to ensure accuracy relevance and adherence to guidelines. You will critically assess model generations identify failure cases and provide clear well-justified feedback to improve system performance.
Key Responsibilities:
Develop user prompts and establish robust evaluation criteria for AI model testing.
Review and rate model responses ensuring compliance with project guidelines and quality standards.
Investigate and document failure cases providing actionable insights for improvement.
Communicate observations feedback and rating justifications effectively to stakeholders.
Apply guidelines rigorously to maintain excellent data quality.
Additional Responsibilities (as needed):
Identify and source Hindi-language audio content (e.g. podcasts videos) from online platforms.
Transcribe Hindi audio recordings to create high-quality datasets for language model training.
Annotate and label audio and text data according to project specifications.
Evaluate the performance of Speech Recognition and Text-to-Speech systems.
Curate and maintain golden datasets for model development.
Requirements:
Idiomatic fluency in Hindi
Advanced level of English
Be in Paris (onsite position)
Strong attention to detail and critical thinking skills
Basic programming knowledge and ability to review code is a plus
Rigorous analytical and methodical approach
Efficient execution of assigned tasks and resilience to dynamic working environment
Familiarity with AI language models and evaluation processes is a plus.
DataForce by TransPerfect is part of the TransPerfect family of companies the worlds largest provider of language and technology solutions for global business with offices in more than 100 cities worldwide.
We offer high-quality data for Human-Machine Interaction to some of the most prestigious technology companies in the world. Our department focuses on gathering enriching and processing data for Machine Learning in different AI domains.