Michael Carbin (@mcarbin) / X

Michael Carbin

568 posts

Michael Carbin

@mcarbin

Cambridge, MA

people.csail.mit.edu/mcarbin

Joined September 2007

Pinned
Michael Carbin
@mcarbin
Oct 23, 2025
If you're interested in emerging modeling approaches (like these and others), then reach out!
Naveen Rao
@NaveenGRao
Oct 23, 2025
At Unconventional, we’re building the computational substrate for the AI era. Scientists and SWEs interested in dynamical systems (Diffusion, Neural ODEs, Deep Equilbrium Models, and Energy-based models) DM or email [email protected] (subject: dynamics). Things are getting really
4.5K
Michael Carbin
@mcarbin
Mar 27, 2024
“How’s your sabbatical?” Well…DBRX is GREAT at RAG! If you’ve been using Mixtral/Llama2/GPT3.5, then try DBRX! The combination of RAG with its SoTA capabilities on knowledge/code/reasoning will unlock new CompoundAI opportunities. databricks.com/blog/introduci…
57K
Michael Carbin
@mcarbin
Jun 26, 2023
Boom! We at @MosaicML plan to unite with an amazing group of colleagues at @Databricks! And don’t worry, still the same great @MosaicML taste: our brand, products, and mission remain. But, going bigger, much bigger. So watch out for more from a truly amazing team! Bravo team!
Ali Ghodsi
@alighodsi
Jun 26, 2023
Big news: we've agreed to acquire @MosaicML, a leading generative AI platform. I couldn’t be more excited to join forces once the deal closes. databricks.com/mosaic-news
24K
Michael Carbin
@mcarbin
Feb 12, 2020
That is indeed me. :-) #SloanFellow. Thank you, mentors. But, importantly, the award reflects my work with fantastic students: @jfrankle, @alex_renda_, Eric Atkinson, Ben Sherman, Cambridge Yang, @charith_mendis, @TomChen17, @JesseMMichel, James Gilles sloan.org/fellowships/20…
Michael Carbin
@mcarbin
Mar 16, 2022
Proud of the @MosaicML team’s continued push for open science and efficient ML with another Composer release: github.com/mosaicml/compo…... 🧵
Michael Carbin
@mcarbin
Oct 13, 2021
Finally! I'm incredibly proud of the amazing team we have at @mosaicml. Together, we are out to reduce the costs of ML training with openly released tools and methodologies. I also desire our work to make strong, reproducible baselines more accessible to the research community.
Databricks AI Research
@DbrxMosaicAI
Oct 13, 2021
Hello World! Today we come out of stealth to make ML training more efficient with a mosaic of methods that modify training to improve speed, reduce cost, and boost quality. Read our founders' blog by @NaveenGRao @hanlintang @mcarbin @jefrankle mosaicml.com/blog/founders-… (1/4)
Michael Carbin
@mcarbin
Jul 9, 2020
We and the community are still on the hunt for why lottery tickets exist. Our (@jefrankle, @KDziugaite, @roydanroy) work here develops a powerful microscope to assess neural net training behavior, particularly, when lottery tickets emerge. One more step towards an understanding.
Jonathan Frankle
@jefrankle
Jul 8, 2020
At ICML next week, @KDziugaite @roydanroy @mcarbin and I will present Linear Mode Connectivity and the Lottery Ticket Hypothesis. We study the effect of SGD noise (like data order) on neural net optimization. Those results shed new light on lottery tickets arxiv.org/abs/1912.05671
Michael Carbin
@mcarbin
Sep 21, 2020
Our new work demonstrating there's still a ways to go on pruning at initialization: the community seems to only know which layers -- but not which individual weights -- to prune. With a flurry of activity around these ideas, I look forward to other teams' findings as well!
Jonathan Frankle
@jefrankle
Sep 21, 2020
Several methods have recently been proposed for pruning neural networks at initialization. In our new paper (@KDziugaite, @roydanroy, @mcarbin), we rigorously study these methods to determine why they "miss the mark" and underperform pruning after training arxiv.org/abs/2009.08576
Michael Carbin
@mcarbin
May 6, 2019
Congrats @jefrankle! I'm very privileged to work with fantastic students like Jonathan.
ICLR
@iclr_conf
May 6, 2019
Best Paper Award 1: The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks Jonathan Frankle · Michael Carbin
Michael Carbin
@mcarbin
Oct 13, 2020
Last year we showed an NN can learn to model the performance of code on a CPU. But, the NN was opaque. Now, we (@alex_renda_ @TomChen17 @charith_mendis) show how to learn 11k parameters of an otherwise hand-configured 10kLOC simulator to get an accurate *and* interpretable model
Alex Renda
@alex_renda_
Oct 13, 2020
New paper at MICRO: “DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates”. DiffTune learns CPU simulator parameters from scratch, leading the simulator to higher accuracy than with expert-provided parameters. arxiv.org/abs/2010.04017. 🧵1/12
Michael Carbin
@mcarbin
Oct 4, 2019
I'm always briefly puzzled when people ask, "Your group is doing ML now?" because our work is still just Approximate Computing to me. Here are my thoughts on the connections.
Mike Hicks
@michael_w_hicks
Oct 3, 2019
Want to address the issues with overparameterization in deep learning? The PL/Systems/Architecture communities exploring Approximate Computing have some answers, says @mcarbin in his PL Perspectives post. blog.sigplan.org/2019/10/03/mac…
Michael Carbin
@mcarbin
Mar 27, 2024
The eagle has landed
Cody Blakeney
@code_star
Mar 27, 2024
It’s finally here 🎉🥳 In case you missed us, MosaicML/ Databricks is back at it, with a new best in class open weight LLM named DBRX. An MoE with 132B total parameters and 32B active 32k context length and trained for 12T tokens 🤯
6.4K
Michael Carbin
@mcarbin
May 6, 2019
Learn about our work on Lottery Tickets -- small neural networks that train from scratch on big problems -- from @jefrankle today @ 3:45pm (#ICLR2019).
Michael Carbin
@mcarbin
Oct 21, 2022
🚨MLSys 2023 paper deadline in 1 week🚨 @tqchenml and I look forward to your submissions! Key Dates: - Paper submission and co-author registration:, October 28, 2022 4pm ET - Author response: Jan 16 to Jan, 20, 2023 - Author notification: Feb 17, 2023