Pinned
Modal
1,502 posts
AI infrastructure that developers love 💚
Run inference, sandboxes, batch processing, training, and many other things on Modal
- Modal repostedTried to squeeze the most important bits about the entire stack for cloud deployment of transformer inference, from application layer concerns to hardware, debugging, and o11y, into one talk. Had to operate at a very high tok/s! youtube.com/watch?v=ZUdIsR…
- With today's launch of Nemotron 3 Ultra, @nvidia continues to expand its investment in open-source AI. Their flagship frontier-reasoning model, built for long-running autonomous agents, is available Day 0 on Modal. - 550B with 55B active parameters - Hybrid Transformer-Mamba MoE
- Modal repostedwe're hosting some parties to celebrate our C 💚 exclusive swag at both ofc
00:00
00:15We're bringing together our friends and community to celebrate our Series C. Join us at Noguchi's Sunken Garden in NYC on June 16th or at the Legion of Honor in SF on June 25th. Invites are limited, apply here: modal.com/c-function - Modal repostedI’m so excited about the launch of ESMFold2, ESMC, and the new ESM Atlas. This was a massive team effort, and I’m grateful to have worked with such an incredible group @biohub. A headline result I’m especially excited about: ESMFold2 can design minibinders and antibodies with
- We're bringing together our friends and community to celebrate our Series C. Join us at Noguchi's Sunken Garden in NYC on June 16th or at the Legion of Honor in SF on June 25th. Invites are limited, apply here: modal.com/c-function
00:00 - Modal repostedAt @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models. Delta compression is key, but the job's not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training.@FireworksAI_HQ + @cursor_ai highlighted why delta-compressed weight sync matters for RL at frontier scale. slime brings this capability to OSS: lossless delta sync for Megatron ↔ SGLang disaggregation — ship deltas, not full checkpoints. This is another step toward a fully
- Cyber attackers don't wait for you to spin up infrastructure. How @DoppelHQ uses Modal's elastic compute to scale inference, cut training overhead, and parallelize experimentation 👇
- Day 0 support for Step 3.7 Flash on Modal. - 198B parameter MoE with 11B active - 256K context - 3 reasoning levels - Native image & video understanding Great to work with @StepFun_ai and @sgl_project on this one.














