The latest from
Buck Shlegeris
The inaugural Redwood Research podcast
User's avatar
Ryan Greenblatt
Recent LLMs can do 2-hop and 3-hop latent (no CoT) reasoning on natural facts
User's avatar
Alex Mallen
The behavioral selection model for predicting AI motivations
User's avatar
Josh Clymer
Will AI systems drift into misalignment?
User's avatar
Julian Stastny
Prospects for studying actual schemers
User's avatar
Vivek Hebbar
Recent Redwood Research project proposals
User's avatar
Redwood Research blog
Redwood Research blog
We research catastrophic AI risks and techniques that could be used to mitigate them.
Recommendations
The Power Law
Peter Wildeford
ForeWord
Forethought
AI Futures Project
Daniel Kokotajlo