Opportunity Description
Get AI-powered advice on this job and more exclusive features. We’re looking for AI QA trainers who specialize in model evaluation, LLM safety, prompt robustness, data quality assurance, multilingual and domain-specific testing, grounding verification, and compliance readiness checks. You’ll evaluate advanced language models on tasks such as hallucination detection, factual consistency, prompt-injection and jailbreak resistance, bias/fairness audits, chain-of-reasoning reliability, tool-use correctness, retrieval-augmentation fidelity, and end-to-end workflow validation. You will document every failure mode to raise the bar for quality.
On a typical day, you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root-cause hypotheses, and suggest improvements to promp...
Ready to Apply?
Submit your application for Ai qa trainer - llm evaluation - freelance project at Invisible Expert Marketplace
Apply for this Position