Aäron van den Oord (@avdnoord) / X

Aäron van den Oord

365 posts

Aäron van den Oord

@avdnoord

GenMedia lead at DeepMind: Gemini Omni, Veo, Genie, Nano Banana. Research Scientist

London, England

avdnoord.github.io/homepage

Joined January 2013

Aäron van den Oord
@avdnoord
Jun 4, 2019
VQVAE-2 finally out! Powerful autoregressive models in a hierarchical compressed latent space. No modes were collapsed in the creation of these samples ;) Arixv: arxiv.org/abs/1906.00446 With @catamorphist and @vinyals More samples and details 👇 [thread]
GIF
Aäron van den Oord
@avdnoord
Dec 9, 2019
Unsupervised pre-training now outperforms supervised learning on ImageNet for any data regime (see figure) and also for transfer learning to Pascal VOC object detection arxiv.org/abs/1905.09272…
Aäron van den Oord
@avdnoord
Jul 11, 2018
Our latest work is out! Representation Learning with Contrastive Predictive Coding (CPC). Autoregressive modeling meets contrastive losses in the latent space. Learn useful representations in an unsupervised way. -> On Audio, Vision, NLP and RL. Arxiv: arxiv.org/abs/1807.03748
Aäron van den Oord
@avdnoord
Nov 22, 2017
Introducing Parallel WaveNet, or how to generate 500,000 audio samples per second :). This is our generative Text-To-Speech model that made it into the #Google Assistant. deepmind.com/blog/high-fide…
GIF
Aäron van den Oord
@avdnoord
May 30, 2018
VQ-VAE (arxiv.org/abs/1711.00937 and avdnoord.github.io/homepage/vqvae/) is now open source in DM-Sonnet! Here's an example iPython notebook on how to use it for images: github.com/deepmind/sonne…
Aäron van den Oord
@avdnoord
Jun 30, 2021
Honored to have received the MIT TR 35 innovators award. Very grateful to have been able to work with amazing colleagues on this research! @techreview
Aäron van den Oord
From technologyreview.com
Aäron van den Oord
@avdnoord
Jul 25, 2025
We updated our Imagen 4 models and Ultra is tied for #1 on the lmarena leaderboard! The models are available in Google AI Studio and the Gemini API - try them out and let us know what you think.
Arena.ai
@arena
Jul 25, 2025
Exciting Text-to-Image leaderboard update! Two new Imagen 4.0 models from @GoogleDeepMind just dropped: 🥇 Imagen 4.0 Ultra (v2) ties at #1 with @OpenAI’s GPT-Image-1 🥉 Imagen 4.0 (v2) lands strong at #3 Congrats to the Google Imagen team!
105K
Aäron van den Oord
@avdnoord
May 23, 2019
Excited to share our latest results on Contrastive Predictive Coding! -A linear classifier on CPC features yield 61% ACC, outperforming the original AlexNet result with unsupervised learning. -New state of the art in semi-supervised learning w 1% labels. arxiv.org/abs/1905.09272
Aäron van den Oord
@avdnoord
Jan 22, 2025
Our image model is on LMSYS : ) It's been an amazing effort by the team, I'm very proud of what we achieved over the last year! Try it out on now ImageFX, and soon available on AI Studio
Arena.ai
@arena
Jan 22, 2025
Breaking news from Text-to-Image Arena! 🖼️✨ @GoogleDeepMind’s Imagen 3 debuts at #1, surpassing Recraft-v3 with a remarkable +70-point lead! Congrats to the Google Imagen team for setting a new bar! Try the best text2image at LMArena and cast your vote! More analysis👇
85K
Aäron van den Oord
@avdnoord
Nov 6, 2017
VQ-VAE: our paper on learning discrete representations! Unsupervisedly discovers phonemes and voice style transfer arxiv.org/abs/1711.00937
Aäron van den Oord
@avdnoord
Oct 4, 2017
WaveNet is now on your phone :). We have made it 1000x faster since the original paper one year ago. deepmind.com/blog/wavenet-l…
GIF
Aäron van den Oord
@avdnoord
Nov 15, 2017
Slides from my SANE 2017 talk "Neural Discrete Representation Learning". avdnoord.github.io/homepage/slide…
Aäron van den Oord
@avdnoord
Mar 30, 2019
Excited to announce our #ICML2019 Workshop on Self-Supervised Learning! Covering- Vision, NLP, Audio, Robotics, RL ... sites.google.com/view/self-supe… Submissions now open - deadline April 25! Speakers: @ylecun, @chelseabfinn, Andrew Zisserman, Alexei Efros, Jacob Devlin, Abhinav Gupta
Aäron van den Oord
@avdnoord
Aug 26, 2025
After Veo 3, Genie 3, Imagen 4, ... we present nano-banana*! 🍌🚀 (*aka Gemini-2.5-Flash-Image-Preview)
Google DeepMind
@GoogleDeepMind
Aug 26, 2025
Image generation with Gemini just got a bananas upgrade and is the new state-of-the-art image generation and editing model. 🤯 From photorealistic masterpieces to mind-bending fantasy worlds, you can now natively produce, edit and refine visuals with new levels of reasoning,
00:00
18K