Scott Wu (@ScottWu46) / X

Scott Wu

271 posts

Scott Wu

@ScottWu46

Building @cognition

Joined February 2018

Pinned
Scott Wu
@ScottWu46
Jun 8
SWE-Bench style grading has been the standard for years now - you ask the agent to solve an issue and then run its code on a pre-constructed unit test. The problem is that passing a unit test is only one part of writing production-ready code. You also want to evaluate agents on
Cognition
@cognition
Jun 8
Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?
85K
Scott Wu
@ScottWu46
Aug 5, 2025
People have asked about our culture and recent employee communications. Cognition has an extreme performance culture, and we’re upfront about this in hiring so there are no surprises later. We routinely are at the office through the weekend and do some of our best work late into
568K
Scott Wu
@ScottWu46
Nov 5, 2025
As an engineer, you spend a lot more time reading and understanding code than you do writing code. But codebases aren't necessarily optimized for this; scrolling through a ton of CRUD functions tells you a lot less about what a file does than a clear explanation of why the code
Cognition
@cognition
Nov 4, 2025
Replying to @cognition @windsurf and @paulg
our mental model for why codebase understanding is valuable and why vibe coding has limits to scale is expressed by this chart: manual understanding is the bottleneck. Codemaps understanding scales with model intelligence x.com/windsurf/statu…
555K
Scott Wu
@ScottWu46
Jul 14, 2025
It’s a privilege to welcome Windsurf to Cognition. Here are more details in the note I sent to our Cognition team this morning: Team, As discussed during our all-hands, we are acquiring Windsurf. We have now signed a definitive agreement and we couldn’t be more excited. Here’s
Cognition
@cognition
Jul 14, 2025
Cognition has signed a definitive agreement to acquire Windsurf. The acquisition includes Windsurf’s IP, product, trademark and brand, and strong business. Above all, it includes Windsurf’s world-class people, whom we’re privileged to welcome to our team. We are also honoring
00:00
896K
Scott Wu
@ScottWu46
Apr 3, 2025
I'm often asked what programming will look like in 10 years or if it's even worth studying CS anymore. The future of software engineering is the single biggest question we think about in building Devin: 🧵
502K
Scott Wu
@ScottWu46
Sep 17, 2025
so insane. you guys have no idea how hard this is
Mark Chen
@markchen90
Sep 17, 2025
We wrapped up this year's competition circuit with a full score on the ICPC, after achieving 6th in the IOI, a gold medal at the IMO, and 2nd in the AtCoder Heuristic contest!
309K
Scott Wu
@ScottWu46
Mar 12, 2024
I first learned to program when I was 9 years old and fell in love with the ability to turn my ideas into reality. Now, teaching AI to code at @cognition_labs has been a dream come true. We've still got a long way to go; if this sounds like something you'd like to work on, please
Cognition
@cognition
Mar 12, 2024
Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is
00:00
288K
Scott Wu
@ScottWu46
Feb 27, 2025
GPT-4.5 has been awesome to work with. On our agentic coding benchmarks it already shows massive improvements over o1 and 4o. Excited to see the models' continued trajectory on code! One interesting data point: though GPT-4.5 and Claude 3.7 Sonnet score similarly on our overall
OpenAI
@OpenAI
Feb 27, 2025
GPT-4.5 has entered the Chat. openai.com/live/
296K
Scott Wu
@ScottWu46
Dec 2, 2024
Devin has saved companies millions of dollars and has shown as much as an 8x productivity boost in engineering time. Great to chat with Forbes about the work Devin has been doing with customers like Nubank, Ramp, and MongoDB!
191K
Scott Wu
@ScottWu46
Jun 10, 2025
The new o3 price drop makes it 15x cheaper than gpt-4-32k, the SOTA model from two years ago. Meanwhile the number of use cases is probably up 1,000,000x. Kudos to the openai team!
63K
Scott Wu
@ScottWu46
Jul 17, 2025
am i hallucinating or did we just go from "openai acquiring windsurf" to "cog+ws team shipping wave 11" in seven days
Devin Desktop
@devindesktop
Jul 17, 2025
Wave 11 is live! Seven big upgrades to Windsurf 🧵
00:00
144K
Scott Wu
@ScottWu46
Sep 8, 2025
When we started Cognition, we were a small group of engineers who shared a lifelong love of coding. We hunkered down in an apartment in NYC and built the product we always wanted for ourselves. A lot has changed over the last 20 months but the core vision has not. We see a
Cognition
@cognition
Sep 8, 2025
We’ve raised over $400M at a $10.2B post-money valuation to advance the frontier of AI coding agents. The round was led by Founders Fund with other existing investors including Lux, 8VC, Neo, Elad Gil, Definition Capital, and Swish VC all doubling down. We’re also joined by new
114K
Scott Wu
@ScottWu46
Aug 27, 2025
No better company for my first ever beer than John and Patrick. Unfortunately was non-alcoholic since I had to go back to work after.
John Collison
@collision
Aug 27, 2025
Maths savant turned @cognition founder @ScottWu46 joins me to discuss their AI software engineer, acquiring Windsurf over a weekend, the Moneyball-ification of everything, math contests with @alexandr_wang in 6th grade, the future of independent coding tools, and why he thinks we
00:00
116K
Scott Wu
@ScottWu46
Jun 27, 2025
Slack came out in 2014. At the time it wasn't obvious why you'd want to use it. We already had email for professional conversations; quick group messages felt too unrefined to use at work. Coding agents are at a similar inflection point today. Most engineers are used to writing
devin.ai
Coding Agents 101: The Art of Actually Getting Things Done
Coding Agents 101: The Art of Actually Getting Things Done
125K