Line of Service
AdvisoryIndustry/Sector
Not ApplicableSpecialism
Emerging TechnologiesManagement Level
Senior AssociateJob Description & Summary
At PwC our people in software and product innovation focus on developing cutting-edge software solutions and driving product innovation to meet the evolving needs of clients. These individuals combine technical experience with creative thinking to deliver innovative software products and solutions.Why PWC
At PwC you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes forour clients and communities. This purpose-led and values-driven work powered by technology in an environment that drives innovation will enable you to make a tangible impact in the real world. We reward your contributions support your wellbeing and offer inclusive benefits flexibility programmes and mentorship that will help you thrive in work and life. Together we grow learn care collaborate and create a future of infinite experiences foreach other. Learn more about us.
At PwC we believe in providing equal employment opportunities without any discrimination on the grounds of gender ethnic background age disability marital status sexual orientation pregnancy gender identity or expression religion or other beliefs perceived differences and status protected by law. We strive to create an environment where each one of our people can bring their true selves and contribute to their personal growth and the firms growth. To enable this we have zero tolerance for any discrimination and harassment based on the above considerations.
Job Description & Summary:We areseekinga highly skilled AI Quality & Performance Test Engineer to design implement and execute test strategies for AI-powered chatbot and Generative AI solutions. This role is a unique blend of QA Automation API Testing Performance Engineering and Ethical AI Testing withstrongfocus on RAG evaluation automated test generation fairness testing and flow-based conversational validation.
Responsibilities:
Design and implementautomated test frameworks(PythonPytest/Robot Framework) for chatbot and AI applications.
ConductAPI testingfor chatbot backends RAG pipelines and third-party integrations (REST/GraphQL).
Validatechatbot workflowsincludingflow-based conversations contextual continuity fallback handling and error reportingfor end-to-end customer scenarios.
ImplementRAG testingusingRAGADeepEval and LLM-as-a-Judgeto evaluateretrievalaccuracy and hallucination risks.
Build anAI-driven Testcase Generator(using GPT-4 Omni) to process enterprise documents (doc pdf xml jpg excel) and auto-generate Q&A test datasets.
Define and trackcomprehensive AI evaluation metrics:
Quantitative Metrics Accuracy Latency Throughput.
Qualitative MetricsAnswer Relevancy Faithfulness Hallucination Contextual Relevancy.
Business MetricsCustomer feedback (thumbs up & thumbs down) resolution rateRacialBiasand Diversity (using embedding models to detect bias/fairness issues).
Conductperformance load and stress testingusingLocust Grafana and K6ensuring scalability under peak citizen traffic.
Ensureethical AI complianceby testing fairness inclusivity transparency and privacy.
Integrate automated test suites intoCI/CD pipelines(Jenkins GitHub Actions GitLab CI).
Collaborate with AI/ML engineers developers and stakeholders to ensureresponsible AI deployment.
Generate structured reports/dashboards (Allurepytest-html Grafana) capturingquality performance and ethical AI insights.
Mandatory skill sets:
Strong experience inQA Automationwith Python (Pytest Robot Framework).
Handson knowledge in Mobile testing.
ExpertiseinAPI testing(RESTGraphQL) with tools like PostmanPytest or Rest Assured.
Hands-on withRAG testing tools(RAGADeepEval LLM-as-a-Judge).
Proficiencyinautomated Q&A generationusing GPT-based models (GPT-4 Omni).
Strong knowledge ofperformance/load testing tools(Locust Grafana K6).
Familiarity withAI evaluation metrics(Quantitative Qualitative Business).
Experience inchatbot/NLP testing(flow-based conversations contextual relevance).
Knowledge ofbias/fairness validationusing embedding models for diversity/racial bias detection.
Strong debugging log analysis and test reporting skills.
Preferred skill sets:
Exposure toGenerative AI evaluation frameworksfor hallucination and relevance scoring.
Experience withCI/CD automationin AI testing workflows.
Awareness ofdata privacybestpracticesin test data handling.
Domain knowledge incitizen services banking or telecom.
Years of experience:
4 to 6 years of relevant job experience
Education qualification:
Bachelors degree in Computer Science Engineering ora relatedfield or equivalent work experience.
Minimum of 4 years of professional experience in Java development.
Education (if blank degree and/or field of study not specified)
Degrees/Field of Study required: Bachelor of Technology Bachelor of EngineeringDegrees/Field of Study preferred:Certifications (if blank certifications not specified)
Required Skills
Generative AIOptional Skills
Accepting Feedback Accepting Feedback Active Listening Analytical Thinking Artificial Intelligence Business Planning and Simulation (BW-BPS) Communication Competitive Advantage Conducting Research Creativity Digital Transformation Embracing Change Emotional Regulation Empathy Implementing Technology Inclusion Innovation Processes Intellectual Curiosity Internet of Things (IoT) Learning Agility Optimism Product Development Product Testing Prototyping Quality Assurance Process Management 10 moreDesired Languages (If blank desired languages not specified)
Travel Requirements
Available for Work Visa Sponsorship
Government Clearance Required
Job Posting End Date
Required Experience:
Senior IC
At PwC, our purpose is to build trust in society and solve important problems. We’re a network of firms in 155 countries with over 284,000 people who are committed to delivering quality in assurance, advisory and tax services. Find out more and tell us what matters to you by vis ... View more