Joe Carlsmith's Substack

Joe Carlsmith's Substack

Home
My website
Archive
About
58:45
Video and transcript of talk on human-like-ness in AI safety
From a talk at Constellation in December 2025.
Dec 17, 2025 • Joe Carlsmith
How human-like do safe AI motivations need to be?
AIs with alien motivations can still follow instructions safely on the inputs that matter.
Nov 12, 2025 • Joe Carlsmith
Leaving Open Philanthropy, going to Anthropic
On a career move, and on AI-safety-focused people working at AI companies.
Nov 3, 2025 • Joe Carlsmith
Controlling the options AIs can pursue
On blocking paths to power, and on making deals.
Sep 29, 2025 • Joe Carlsmith
Video and transcript of talk on giving AIs safe motivations
From a talk at UT Austin in September 2025.
Sep 22, 2025 • Joe Carlsmith
Giving AIs safe motivations
A four-part picture.
Aug 18, 2025 • Joe Carlsmith
Video and transcript of talk on "Can goodness compete?"
From a public talk on long-term equilibria post-AGI, given at Mox in SF in July 2025.
Jul 17, 2025 • Joe Carlsmith
Video and transcript of talk on AI welfare
An overview of my take on AI welfare as of May 2025, from a talk at Anthropic.
May 22, 2025 • Joe Carlsmith
Joe Carlsmith's Substack
Joe Carlsmith's Substack
Philosophy, futurism, and other topics
Recommendations
Sasha's 'Newsletter'
Sasha's 'Newsletter'
Sasha Chapin
Astral Codex Ten
Astral Codex Ten
Scott Alexander
Good Thoughts
Good Thoughts
Richard Y Chappell
User's avatar
world spirit sock stack
Katja Grace
Also Recommended
Cold Takes
Minding Our Way

Joe Carlsmith's Substack

AboutArchiveRecommendationsSitemap
© 2026 Joe Carlsmith · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture