Jesse Mu (@jayelmnop) / X

Jesse Mu

666 posts

Jesse Mu

@jayelmnop

computational linguistics

Joined May 2010

Jesse Mu
@jayelmnop
Jan 9, 2023
I've found the killer app of large language models.
741K
Jesse Mu
@jayelmnop
Feb 24, 2025
SOTA on the only eval that matters
190K
Jesse Mu
@jayelmnop
May 7, 2020
The machine learning research process
Jesse Mu
@jayelmnop
Feb 16, 2023
Since prompting, instruction tuning, RLHF, ChatGPT etc are such new and fast-moving topics, I haven't seen many university course lectures covering this content. So we made some new slides for this year's CS224n: NLP w/ Deep Learning course at @Stanford! web.stanford.edu/class/cs224n/s…
339K
Jesse Mu
@jayelmnop
Mar 9, 2023
PSA to anyone who wants to write an op-ed criticizing LLMs (yes, including Noam Chomsky): if you're going to come up with hypothetical failure cases for LLMs, at a minimum, please actually check that your case fails with a modern LLM nytimes.com/2023/03/08/opi…
232K
Jesse Mu
@jayelmnop
Jun 27, 2022
I am announcing the Perverse Scaling Prize: a $1.14 USD prize for tasks which exhibit any of the following scaling curves
Ethan Perez
@EthanJPerez
Jun 27, 2022
We’re announcing the Inverse Scaling Prize: a $100k grand prize + $150k in additional prizes for finding an important task where larger language models do *worse*. Link to contest details: github.com/inverse-scalin… 🧵
Jesse Mu
@jayelmnop
May 22, 2025
I recently moved to the Code RL team at Anthropic, and it’s been a wild and insanely fun ride. Join us! We are singularly focused on solving SWE. No 3000 elo leetcode, competition math, or smart devices. We want Claude n to build Claude n+1, so we can go home and knit sweaters.
Anthropic
@AnthropicAI
May 22, 2025
Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.
157K
Jesse Mu
@jayelmnop
Nov 21, 2022
Excited to present 3 #NeurIPS2022 papers on a trend I've been very excited about recently: blurring the boundaries between language models and RL agents (+a bonus 4th paper on active learning!) 🧵(0/7) PS: I'm on the industry job market!
Jesse Mu
@jayelmnop
Apr 18, 2023
Prompting is cool and all, but isn't it a waste of compute to encode a prompt over and over again? We learn to compress prompts up to 26x by using "gist tokens", saving memory+storage and speeding up LM inference: arxiv.org/abs/2304.08467 (w/ @XiangLisaLi2 and @noahdgoodman) 🧵
00:00
161K
Jesse Mu
@jayelmnop
Jun 2, 2022
TIL in 2009 two Berkeley undergrads flipped a coin *40,000* times (1hr/day for a semester) to see whether a coin flip was truly random (it's biased towards the side facing up pre-flip!) Gives a new meaning to the term "undergraduate research project"... stat.berkeley.edu/~aldous/Real-W…
Jesse Mu
@jayelmnop
Apr 7, 2023
Life update: this week I joined the Alignment team @AnthropicAI! I’m starting part-time for now as I finish up my PhD at Stanford. Excited to work on making large language models safer and more aligned!
70K
Jesse Mu
@jayelmnop
Mar 20, 2024
We’re hiring for the adversarial robustness team @AnthropicAI! As an Alignment subteam, we're making a big effort on red-teaming, test-time monitoring, and adversarial training. If you’re interested in these areas, let us know! (emails in 🧵)
73K
Jesse Mu
@jayelmnop
Jun 26, 2020
New preprint with @jacobandreas: we generate explanations of the individual neurons inside deep neural networks by identifying *compositional logical concepts* that closely approximate neuron behavior (e.g. "water that isn't blue") arxiv.org/abs/2006.14032 (1/5)
Jesse Mu
@jayelmnop
Sep 21, 2023
My lecture on prompting, instruction tuning, and RLHF for Stanford's CS224n course is (finally!) available online: youtube.com/watch?v=SXpJ9E…
Jesse Mu
@jayelmnop
Feb 16, 2023
Since prompting, instruction tuning, RLHF, ChatGPT etc are such new and fast-moving topics, I haven't seen many university course lectures covering this content. So we made some new slides for this year's CS224n: NLP w/ Deep Learning course at @Stanford! web.stanford.edu/class/cs224n/s…
91K