Pinned
👉 New preprint: how do we make LMs more reliable once there's no more training data?
Enforcing *consistency* of LM predictions across inputs lets us unsupervisedly optimize for factual accuracy & faithful explanation (& get a unifying view on many existing post-training algs)
New paper: It's time to optimize for 🔁self-consistency 🔁
We’ve pushed LLMs to the limits of available data, yet failures like sycophancy and factual inconsistency persist.
We argue these stem from the same assumption: that behavior can be specified one I/O pair at a time. 🧵













