Opportunity Description
**Role Overview**
We are seeking a **Senior AI Scientist** to lead the design, development, and operationalization of evaluation frameworks for Generative AI systems, with a primary focus on Large Language Models (LLMs) and agentic AI solutions.
This role will be responsible for defining and implementing robust methods to assess quality, safety, reliability, and business impact across LLM-powered applications and multi-agent workflows. The position operates within regulated environments such as life sciences, clinical research, and regulatory domains, ensuring that AI systems meet enterprise and compliance standards.
**Key Responsibilities**
**1. LLM Evaluation & Benchmarking**
+ Design and implement scalable evaluation frameworks for LLMs across use cases including:
+ Question answering, summarization, information extraction, and reasoning
+ Clinical and regulatory document generation (e.g., ICFs, CSRs, protocols)
+ Develop both ...
We are seeking a **Senior AI Scientist** to lead the design, development, and operationalization of evaluation frameworks for Generative AI systems, with a primary focus on Large Language Models (LLMs) and agentic AI solutions.
This role will be responsible for defining and implementing robust methods to assess quality, safety, reliability, and business impact across LLM-powered applications and multi-agent workflows. The position operates within regulated environments such as life sciences, clinical research, and regulatory domains, ensuring that AI systems meet enterprise and compliance standards.
**Key Responsibilities**
**1. LLM Evaluation & Benchmarking**
+ Design and implement scalable evaluation frameworks for LLMs across use cases including:
+ Question answering, summarization, information extraction, and reasoning
+ Clinical and regulatory document generation (e.g., ICFs, CSRs, protocols)
+ Develop both ...