Yang Song (@DrYangSong) / X

Yang Song

242 posts

Yang Song

@DrYangSong

Research Principal of MSL

Joined July 2014

Pinned
Yang Song
@DrYangSong
Apr 8
Today we’re excited to release Muse Spark. It’s our first end-to-end test of the new stacks we’ve built at MSL, and a true testament to this incredible team. We’re eager to learn from your feedback!
ai.meta.com
Introducing Muse Spark: Scaling Towards Personal Superintelligence
14K
Yang Song
@DrYangSong
Jul 17, 2019
Excited to share our paper on a different approach to generative modeling. We can estimate gradients of the data distribution and sample with Langevin dynamics. No adversarial method and no approximation for tractable training. Record-breaking inception score of 8.91 on CIFAR-10.
GIF
Yang Song
@DrYangSong
Nov 20, 2023
OpenAI is nothing without its people.
70K
Yang Song
@DrYangSong
Dec 1, 2020
Happy to announce our new work on score-based generative modeling: high quality samples, exact log-likelihoods, and controllable generation, all available through score matching and Stochastic Differential Equations (SDEs)! Paper: arxiv.org/abs/2011.13456
Yang Song
@DrYangSong
Feb 13, 2020
Excited to share our paper on accelerating feedforward computations in ML — such as evaluating a DenseNet or sampling from autoregressive models — via parallel computing. Speedup factors are around 1.2–33 under various conditions and computation models.
GIF
Yang Song
@DrYangSong
May 6, 2021
Checkout my new blog post on generative modeling by score matching and score-based models. I introduce the intuition behind these methods, their pros and cons, and also discuss the close connection to diffusion probabilistic models. yang-song.github.io/blog/2021/scor…
Yang Song
@DrYangSong
Oct 29, 2025
Applications change, but the principles are enduring. After a year's hard work led by @JCJesseLai, we are really excited to share this deep, systematic dive into the mathematical principles of diffusion models. This is a monograph we always wished we had.
Chieh-Hsin (Jesse) Lai
@JCJesseLai
Oct 29, 2025
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core
58K
Yang Song
@DrYangSong
Jun 17, 2020
Do not estimate the probability density of data. Instead, estimate its gradient! We provide improved techniques for training score-based generative models, enabling effortless generation of high resolution images. Comparable quality to GANs yet no need of adversarial training!
Yang Song
@DrYangSong
Nov 22, 2023
We explored Jacobi iteration for accelerating sequential computation in a previous work (arxiv.org/abs/2002.03629), with success in PixelCNN decoding, DenseNet evaluation, and RNN training. It's gratifying to see that an improved method can now significantly speed up LLM decoding.
Arena.ai
@arena
Nov 21, 2023
Introduce lookahead decoding: - a parallel decoding algo to accelerate LLM inference - w/o the need for a draft model or a data store - linearly decreases # decoding steps relative to log(FLOPs) used per decoding step. Blog: lmsys.org/blog/2023-11-2… Code: github.com/hao-ai-lab/Loo…
GIF
arxiv.org
Accelerating Feedforward Computation via Parallel Nonlinear...
Feedforward computation, such as evaluating a neural network or sampling from an autoregressive model, is ubiquitous in machine learning. The sequential nature of feedforward computation, however,...
87K
Yang Song
@DrYangSong
Dec 10, 2023
Our Strategic Explorations team @OpenAI is seeking hardcore researchers to develop fundamental methodologies for advancing image and text generation. We focus on exploratory research with long term impacts. Let’s connect and discuss at NeurIPS if you are interested in joining us!
69K
Yang Song
@DrYangSong
Apr 1, 2021
Thrilled to share that our paper "Score-Based Generative Modeling through Stochastic Differential Equations" has won an Outstanding Paper Award at ICLR 2021! Huge shoutouts to my awesome collaborators: @jaschasd @dpkingma @studentofml @StefanoErmon @poolio!
ICLR
@iclr_conf
Apr 1, 2021
We are thrilled to announce the #ICLR2021 Outstanding Paper Awards! Out of 860 excellent papers, the award committee identified 8 that are especially noteworthy: iclr-conf.medium.com/announcing-icl… Congratulations to the authors!! @shakir_za @iatitov @aliceoh @NailaMurray @katjahofmann
Yang Song
@DrYangSong
Feb 17, 2025
This is the most educative post on diffusion models I’ve seen—perfect for beginners. Thanks @slaterstich for the amazing work!
Slater Stich
@slaterstich
Feb 14, 2025
Diffusion Without Tears is our attempt to make the score-matching + SDE interpretation of diffusion geometrically intuitive. If you're interested in our upcoming interview with @DrYangSong, I recommend reading this first! Link below.
28K
Yang Song
@DrYangSong
Jul 19, 2019
Releasing our paper on MintNet! It's a new flow model built by replacing normal convolutions in ResNets with masked convolutions. It has exact likelihood, fast sampling with fixed-point iteration, and better performance than published results on MNIST, CIFAR-10 and small ImageNet
Yang Song
@DrYangSong
Oct 23, 2024
Thrilled to share our latest work on consistency models! We simplified the math behind continuous-time consistency models, stabilized their training, and scaled them up to 1.5B parameters. We are now one step closer to real-time multimodal generation!
Cheng Lu
@clu_cheng
Oct 23, 2024
Excited to share our latest research progress (joint work with @DrYangSong ): Consistency models can now scale stably to ImageNet 512x512 with up to 1.5B parameters using a simplified algorithm, and our 2-step samples closely approach the quality of diffusion models. See more
30K