We are seeking a highly skilled Quality Assurance Engineer with expertise in testing AI models Large Language Models (LLMs) AI agents and Generative AI applications. The ideal candidate will have experience in validating AIdriven systems ensuring model accuracy performance fairness explainability and robustness. Additionally the candidate will be responsible for developing automation tests alongside their AI testing responsibilities.
The successful candidate will be selfmotivated and proactive demonstrating a strong sense of ownership and accountability while requiring minimal supervision.
What You Will Do
Develop and implement QA strategies for AIpowered applications focusing on accuracy bias fairness robustness and performance.
Design and execute automated and manual test cases to validate AI Agents/LLM models APIs and data pipelines and good understanding of data integrity data models etc
Assess AI models using quality metrics such as precision/recall and hallucination detection.
Test AI models for bias fairness explainability (XAI) drift and adversarial robustness.
Validate prompt engineering finetuning techniques and modelgenerated responses for accuracy and ethical AI considerations.
Conduct scalability latency and performance testing for AIdriven applications.
Collaborate with data engineers to validate data pipelines feature engineering processes and model outputs.
Design develop and maintain automation scripts using Selenium and Playwright for API and web testing
Work closely with crossfunctional teams to integrate automation best practices into the development lifecycle.
Identify document and track bugs while conducting detailed regression testing to ensure product quality.
What You Will Bring
Proven expertise in testing AI models LLMs and Generative AI applications with handson experience in AI evaluation metrics and testing tools like Arize MAIHEM and LangTest.
Strong proficiency in Python for writing test scripts and automating model validation along with a deep understanding of AI bias detection adversarial testing model explainability (XAI) and AI robustness.
Demonstrate strong SQL expertise for validating data integrity and backend processes particularly in PostgreSQL and MySQL.
Strong analytical and problemsolving skills with keen attention to detail along with excellent communication and documentation abilities to convey complex testing processes and results.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.