Reasoning with Machines Lab

Research Themes

Benchmarks and Evaluation

We develop the science of LLM evaluation, setting the standard for rigorous assessment and identifying hidden risks before they matter.

AI Safety and Security

From bias and toxicity to agentic misalignment, we study the full spectrum of AI risk and develop the technical and governance tools to address it.

Agentic AI for Science

We build agentic systems that automate scientific knowledge synthesis and discovery, with a focus on agents that are reliable, transparent and domain-grounded.

Human-AI Interaction

We run large-scale empirical studies on how people use AI for high stakes decisions, from healthcare and law to policy and beyond.

Welcome to the Reasoning with Machines Lab @ University of Oxford

Research Themes

Benchmarks and Evaluation

AI Safety and Security

Agentic AI for Science

Human-AI Interaction

Member Name