- You either die making fun of SF billboards, or live long enough to be on one of them.
- We at @modal_labs just released QuiLLMan: An open-source chat app that lets you interface with Vicuna-13B using your voice. All serverless. Fork the repo to deploy and start building your own LLM-based app in less than a minute! github.com/modal-labs/quiโฆ
GIF - The first time I tried Devin, it: - navigated to the @modal_labs docs page I gave it - learned how to install - handed control to me to authenticate - spun up a ComfyUI deployment - interacted with it in the browser to run stuff ๐คฏ
01:49Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is - I wonder what they used for code execution ๐(๐งต) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publiโฆ
- It's true โย @modal has raised a $87M Series B at a $1.1B valuation to advance the future of AI infrastructure.ย Thank you to @Lux_Capital, @Redpoint, @AmplifyPartners, and others. Now more than ever, AI demands a complete reinvention of traditional compute infrastructure
00:00 - The most valuable thing I got from my time at @scale_AI was the opportunity to work with incredible people. This is why I tell everyone early in their career to think hardest about who they want to be working long hours in the trenches with. It matters more than anything else,
- Big day for Python tooling enjoyers: @modal_labs finally has first-class uv support! Use: โข `uv_sync` to sync your Modal image with your local project โข `uv_pip_install` to install packages lightning fastโก๏ธ
- At @modal we've built every layer of the AI infra stack from scratch โ from filesystems and networking to our own async queues and multi-cloud GPU orchestration. I sat down with @narayanarjun from @AmplifyPartners to go into depth on all of this, including the fun ways the
00:00 - Spend all your time writing Slack messages? We're releasing DoppelBot, a Slack app that lets you fine-tune an LLM to answer messages like yourself. Install the app now, or fork and host yourself: modal.com/docs/guide/slaโฆ 100% serverless, running on @modal_labs.
- Claude's law: If Claude Code starts to hallucinate that a feature exists in a product, there's a good chance it will actually exist soon.
- Weโre thrilled to share that @JamsocketHQ is joining @modal_labs! Weโve known @paulgb and @taylorbaldwin for years now, and deeply admire their technical vision and execution in everything theyโve built. Theyโre cooking up exciting things at Modal already, more on this soon :)
- Weekend hack: gotย two @modal_labs containers to UDP hole-punch and communicate over QUIC. In theory could be used to connect your non-Modal service to a GPU for low latency inference without all the hassle of WebRTC! Hereโs two containers in ๐ฏ๐ต getting 2ms round-trip time to
- You can now cold-start vLLM in 5s on @modal_labs. GPU snapshotting is a primitive that unlocks a whole world of possibilities we're only beginning to unlock. If you're interested in working on the frontiers of what's possible with AI infra, please reach out :)We just launched GPU memory snapshotting on @modal_labs in alpha. Speed up cold boots by up to 12x ๐ If you're deploying AI models, a huge amount of cold boot time comes from loading model weights into GPU memory. This makes it difficult to scale GPU resources up and down

















