Freelance Gen AI Testing QA (Senior)

WILLWARE TECHNOLOGIES PRIVATE LIMITED

Job Location:

Bengaluru - India

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Role: Freelance Gen AI Testing QA Evaluation Engineer (Senior)

Company Name : WillWare Technologies

Work Model : Remote / Contract / Fulltime

Experience : 5 Years

Work Location :Chennai/Bangalore/Kochi/Jaipur/Coimbatore/Remote

Job description:

Experience:
5-8 years in QA automation with 1-3 years in GenAI / API-based testing.

Key Responsibilities:

Develop and maintain automated evaluation pipelines.
Implement evaluation scripts using Python frameworks (e.g. DeepEval custom frameworks)
Integrate LLM/Chatbot APIs and agent workflows into evaluation pipelines
Execute dataset-driven evaluations and capture and process responses.
Support manual test scenario execution and validation
Assist in dataset creation and enrichment
Generate evaluation reports and logs
Debug and troubleshoot execution issues.
Enable CI/CD integration for continuous evaluation.

Key Skills :

Core GenAI Evaluation Skills:
Experience with evaluation frameworks (e.g. DeepEval or Arize)
Understanding of LLM-as-a-Judge (G-Eval) methodology
Strong prompt engineering and evaluation design skills
Experience in manual evaluation of LLM outputs.

Technical Skills:

Strong programming in Python
Experience in API testing and integration
Proficiency in JSON handling parsing and data processing
Automation framework development/integration.
Knowledge of logging reporting and debugging tools

Agent Manual Testing & Dataset Skills:

Experience in:
o Test scenario creation for GenAI use cases
o Manual validation of LLM responses (qualitative assessment)
o Dataset creation and curation
o Writing expected outputs or golden answers.
Ability to design edge cases negative scenarios and adversarial inputs (prompt injection jailbreaks)

Domain & QA Skills:

Strong foundation in software testing principles:
o Functional integration regression testing
Experience in test design defect tracking and reporting.
Strong analytical and problem-solving skills.
Conversational AI testing experience.
Understanding of AI agent behavior workflows and edge cases.

Required Skills:

QA EngineeringGen AI TestingGenAIAPI TestingPython

Role: Freelance Gen AI Testing QA Evaluation Engineer (Senior) Company Name : WillWare Technologies Work Model : Remote / Contract / Fulltime Experience : 5 Years Work Location :Chennai/Bangalore/Kochi/Jaipur/Coimbatore/Remote Job description: Experience: 5-8 years in QA automation with 1-3 years i...