David Duvenaud (@DavidDuvenaud) / X

David Duvenaud

1,363 posts

David Duvenaud

@DavidDuvenaud

Machine learning prof @UofT. Former team lead at Anthropic. Working on generative models, inference, & latent structure.

cs.toronto.edu/~duvenaud/

Joined July 2013

David Duvenaud
@DavidDuvenaud
Jul 8, 2020
We just open-sourced differentiable SDE solvers in PyTorch: github.com/google-researc… Now you can put stochastic differential equations in your deep learning models, and neural nets in your SDEs! Credit to @lxuechen.
GIF
David Duvenaud
@DavidDuvenaud
Feb 27, 2025
LLMs have complex joint beliefs about all sorts of quantities. And my postdoc @jamesrequeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: 🧵
00:00
194K
David Duvenaud
@DavidDuvenaud
Dec 9, 2019
Classifiers are secretly energy-based models! Every softmax giving p(c|x) has an unused degree of freedom, which we use to compute the input density p(x). This makes classifiers into generative models without changing the architecture. arxiv.org/abs/1912.03263
David Duvenaud
@DavidDuvenaud
Nov 14, 2018
We just open-sourced a suite of ODE solvers in PyTorch: github.com/rtqichen/torch… Everything happens on the GPU and is differentiable. Now you can use ODEs in your deep learning models! Credit to @rtqichen.
00:00
David Duvenaud
@DavidDuvenaud
Jan 30, 2025
New paper: What happens once AIs make humans obsolete? Even without AIs seeking power, we argue that competitive pressures will fully erode human influence and values. gradual-disempowerment.ai with @jankulveit @raymondadouglas @AmmannNora @degerturann @DavidSKrueger 🧵
400K
David Duvenaud
@DavidDuvenaud
Jun 3, 2021
Gradient descent in differentiable games rotates around solutions instead of converging. For instance, in GANs. We solve this with a simple trick: complex momentum damps the oscillations. arxiv.org/abs/2102.08431 With @jonLorraine9 @davidjesusacu @PaulVicol
GIF
David Duvenaud
@DavidDuvenaud
Sep 27, 2021
I propose we rename "epistemic uncertainty" to "model uncertainty", and "aleatoric uncertainty" to "per-measurement uncertainty". More generally, one can refer to uncertainty at any level, such as per-pixel, per-image, or per-patient uncertainty. No need for obscure jargon.
David Duvenaud
@DavidDuvenaud
Mar 24, 2023
I should have announced this before, but a year ago I switched my research focus to AI existential risk reduction and governance. I think the risk of bad outcomes for humanity due to AGI is substantial, and that coordinating a slowdown in AGI development is probably a good idea.
210K
David Duvenaud
@DavidDuvenaud
Jan 9, 2020
Training Neural SDEs: We worked out how to do scalable reverse-mode autodiff for stochastic differential equations. This lets us fit SDEs defined by neural nets with black-box adaptive higher-order solvers. arxiv.org/pdf/2001.01328… With @lxuechen, @rtqichen and @wongtkleonard.
00:00
David Duvenaud
@DavidDuvenaud
Jun 20, 2018
Neural ODEs: Instead of updating hiddens layers by layer, we specify their derivative wrt depth with a neural network. An ODE solver adaptively computes the output. By amazing students @rtqichen @YuliaRubanova @jessebett. arxiv.org/abs/1806.07366
David Duvenaud
@DavidDuvenaud
Mar 20, 2022
Replying to @lmrwanda
Here's an attempt to translate the pun in the last line: "A friendly dog walks into a bar. His eyes do not see anything. He should crack one open."
David Duvenaud
@DavidDuvenaud
Jul 17, 2020
Neural ODEs are slow. We speed them up by regularizing their higher derivatives, learning ODEs that are easy to solve: arxiv.org/pdf/2007.04504… with @jacobjinkelly @jessebett @SingularMattrix
00:00
David Duvenaud
@DavidDuvenaud
Oct 3, 2019
I heard you like graphs, so we put a graph neural net in your graph generative model, so you can be invariant to order while you add edges to your graph. Scales to 5000 nodes. Paper: arxiv.org/abs/1910.00760 Code: github.com/lrjconan/GRAN
GIF
David Duvenaud
@DavidDuvenaud
Jan 31, 2019
What if your favorite classifier was also a generative model? We show that ResNets can be made invertible, giving a scalable density model with unrestricted layer architectures. With @JensBehrmann @wgrathwohl @rtqichen @jh_jacobsen arxiv.org/abs/1811.00995