We built the region’s largest inference cluster in Saudi Arabia in 51 days and we just announced a $1.5B agreement for Groq to expand our advanced LPU-based AI inference infrastructure.
Build fast.
What are we doing with this capital? Originally we intended to raise $300M which was going to allow us to deploy 108,000 LPUs into production by end of Q1 2025. We raised 2x that, so we're also expanding our cloud and core engineering teams.
We're hiring!
What can you do with Llama quality and Groq speed? Instant. That's what.
3 months back: Llama 8B running at 750 Tokens/sec
Now: Llama 70B model running at 3,200 Tokens/sec
We're still going to get a liiiiiiitle bit faster, but this is our V1 14nm LPU - how fast will V2 be? 😉
Prediction: AI will displace social drinking within 5 years
Just as alcohol is a social disinhibitor, like the Steve Martin movie Roxanne, people will use AI powered earbuds to help them socialize. At first we'll view it as creepy, but it will quickly become superior to alcohol
(1/5) Everyone at Groq has one of these challenge coins on them. It’s how we create alignment
One side says its 25 million, because we're going to get to 25 million tokens per second by the end of the year
On the other side, it says, “Make it real. Make it now. Make it wow.”
When you make compute cheaper do people buy more?
Yes. It's called Jevons Paradox and it's a big part of our business thesis.
In the 1860s, an Englishman wrote a treatise on coal where he noted that every time steam engines got more efficient people bought more coal.
🧵(1/3)
What do @GroqInc's LPUs cost? So much curiosity!
We're very comfortable with this pricing and performance - and no, the chips/cards don't cost anywhere near $20,000 😂
#Groqspeed
Jonathan Ross, Founder & CEO, Groq: “Open-source wins. Meta is building the foundation of an open ecosystem that rivals the top closed models and at Groq we put them directly into the hands of the developers—a shared value that’s been fundamental at Groq since our beginning. To