Redwood Research blog | Buck Shlegeris | Substack

Recent posts

Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance

AI can sometimes distribute cognition over many extra tokens

Dec 22, 2025 • Ryan Greenblatt

BashArena and Control Setting Design

We’ve just released BashArena, a new high-stakes control setting we think is a major improvement over the settings we’ve used in the past.

Dec 18, 2025 • Adam Kaufman and James Lucassen

The behavioral selection model for predicting AI motivations

The basic arguments about AI motivations in one causal graph

Dec 4, 2025 • Alex Mallen and Buck Shlegeris

Will AI systems drift into misalignment?

A reason alignment could be hard

Nov 15, 2025 • Josh Clymer

What's up with Anthropic predicting AGI by early 2027?

I operationalize Anthropic's prediction of "powerful AI" and explain why I'm skeptical

Nov 3, 2025 • Ryan Greenblatt

Sonnet 4.5's eval gaming seriously undermines alignment evals

And this seems caused by training on alignment evals.

Oct 30, 2025 • Alexa Pan and Ryan Greenblatt

The latest from

Buck Shlegeris

The inaugural Redwood Research podcast

Ryan Greenblatt

Recent LLMs can do 2-hop and 3-hop latent (no CoT) reasoning on natural facts

Alex Mallen

The behavioral selection model for predicting AI motivations

Josh Clymer

Will AI systems drift into misalignment?

Julian Stastny

Prospects for studying actual schemers

Vivek Hebbar

Recent Redwood Research project proposals

Redwood Research blog

Redwood Research blog

We research catastrophic AI risks and techniques that could be used to mitigate them.

Recommendations

The Power Law

Peter Wildeford

ForeWord

Forethought

AI Futures Project

AI Futures Project

Daniel Kokotajlo

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts