Great collaborating with @ScaleAILabs on pushing evaluations and benchmarks in drug discovery. One clear takeaway we get from this and many others: different frontier models show different strengths across biomedical task categories.
Thanks @afeyzaakyurek, @TuXinming, and the
Excited to share a new @ScaleAILabs research in collaboration with @phylo_bio on coding agents for drug-discovery research! 💊
We ran Claude Code, Codex, and Gemini on 60+ expert-curated drug-discovery tasks inside a shared Biomni-powered biomedical research environment and the















