Self-care life hack: if you feel a bit down/tired, paste the url of your website/linkedin/bio in Google's NotebookLM to get 8 min of realistically sounding deep congratulations for your life and achievements from a duo of podcast experts 😂
After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook"
Check it out here: hf.co/spaces/nanotro…
A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,
we've seen nothing yet! hosted a 9-13 yo vibe-coding event w. @robertkeus this w-e (h/t @antonosika@LovableBuild)
takeaway? AI is unleashing a generation of wildly creative builders beyond anything I'd have imagined
and they grow up *knowing* they can build anything!
Thrilled to finally share what we've been working on for months at @huggingface 🤝@pollenrobotics
Our first robot: Reachy Mini
A dream come true: cute and low priced, hackable yet easy to use, powered by open-source and the infinite community.
Tiny price, small size, huge
There was a super impressive AI competition that happened last week that many people missed in the noise of AI world. I happen to know several participants so let me tell you a bit of this story as a Sunday morning coffee time.
You probably know the Millennium Prize Problems
Finally took time to go over Dario's essay on DeepSeek and export control and to be honest it was quite painful to read. And I say this as a great admirer of Anthropic and big user of Claude*
The first half of the essay reads like a lengthy attempt to justify that closed-source
I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a "compressed 21st century".
The "compressed 21st century" comes from Dario's "Machine of Loving Grace" and if you haven’t read it, you probably
🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access
Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks.
⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster
⚡ 3.66 TiB/min
Everyone wants to get into robotics.
No one knows where to start.
LeRobot's Francesco just dropped a 70-page crash course that takes you from zero to cutting-edge:
- RL sim/real
- ACT, Diffusion policies
- VLAs, SmolVLA, Pi-0
Absolute gold if you want to catch up fast.
A comprehensive, hands-on tutorial on the most recent advancements in robotics 🤟
...with self-contained explanations of modern techniques for end-to-end robot learning & ready-to-use code examples using @LeRobotHF and @huggingface. Now available everywhere! 🤗
The @kyutai_labs fully end-to-end audio model demo of today is a huge deal that many people missed in the room
Mostly irrelevant are the facts that:
- they come a few week after OpenAI ChatGPT-4o
- the demo was less polished than the 4o one (in terms of voice quality, voice
🔥Pytorch-Transformers 1.0🔥
Six NLU/NLG architectures: BERT, GPT, GPT-2, Transfo-XL, XLNet, XLM
Total: 27 pretrained models
Still the same
-Superfast onboarding
-SOTA scripts: GLUE, SQuAD, Text generation
New
-Unified API
-Access hidden-states, attentions...
-Torchscript
-...
Authors have no say on the animal O'Reilly choose for the cover of their book
But I'm really happy that they chose a parrot🦜 for the cover of the book on Transformers we are finalising with Lewis and Leandro
It's a Coconut Lorikeet parrot (a very stochastic Coconut Lorikeet😉)