Pinned
Cerebras
2,658 posts
The world's fastest AI inference and training.
Try the latest open models at: inference.cerebras.ai
- Cerebras repostedone of the things I loved about my convo with @sarahookr is how she frames chaos > "no one knows the answer" = fun time > a million different approaches to memory = super interesting > agentic workflows breaking everything = a huge opportunity her take: the bigger the problem,
00:00 - Cerebras reposted
00:00 - Cerebras repostedThe internet was supposed to summon what you need. Instead: hundreds of tabs. AI should fix that. Co-founder @sarahookr explains the interface should adapt to you, not the other way around.
00:00 - TSMC is perhaps the greatest manufacturing company in the world. They have been our partner from our inception. In August of 2017, we met with TSMC’s senior leadership in Taipei. They were one of the largest companies in the world. We had an idea written on PowerPoint. We thought
- Google just released their fastest model: Gemini 3.5 Flash. We ran it head-to-head against Kimi K2.6 on Cerebras. The two are neck-and-neck on intelligence, but what about speed? Full benchmark results: cerebras.ai/blog/which-is-…
- We are in the recursive age of AI. Faster inference => faster AI development. Our take on this from the hardware perspective:Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
- Cerebras repostedHow did @cerebras arrive at their blockbuster IPO? "It's the largest chip ever made. [It's] the fastest AI processor ever built. We solved the problem that had been open in the compute industry for 75 years." @andrewdfeldman #BloombergTech @tsgiles ⏯️bloom.bg/4ucEPfi
00:00 - Replying to @cerebras @sarahookr and @adaptionlabs
- "If you can co-locate your memory, you're getting a lot more bang for your buck because you're avoiding this costly memory transfer." @sarahookr (author of The Hardware Lottery, founder of @adaptionlabs ) on why inference is forcing a new chip paradigm - one that wafer-scale was
00:00













