Pinned
Our Q1 2026 newsletter is out: deception detection research, alignment workshops, technical AI policy, and new hiring.
Highlights: models learning to evade lie detectors, a new method for tracing misbehavior to training data, and prefill attacks that broke every open-weight model

