Pinned
Oliver Wang
682 posts
Principal Scientist at Google DeepMind
Joined February 2013
- I'm glad to see that mountain goats are starting to take their safety more seriously. #Veo2
00:00
00:00- In order to edit an image, you need to know something about its perspective. In this work, we propose "Perspective Fields", a new image space representation of camera perspective consisting of an up vector and latitude value per-pixel.
00:00 - I love it when I run into a mushroom migration in the woods. #Veo2
00:00 - Check out what you can do when you mix Gemini's world knowledge with the ability to show things visually. Multimodal communication abilities unlock new use cases!
00:00 - 6 years ago onboard my flight to CVPR in Vegas I tried a new method called COLMAP on a video I captured out of my window and was amazed at how well it worked! The fact that COLMAP remains invaluable today is a testament to its success. Thanks for all the point clouds Johannes!
00:00 - We are hiring! Looking for research engineers to build cool stuff with our SOTA foundation models (eg. Imagen 3 and Veo 2). Google DeepMind is a fantastic place to be with amazing colleagues and huge potential for impact. Please reach out: boards.greenhouse.io/deepmind/jobs/…
- 🍌🍌It's finally here! In addition to the largest ELO lead in lmarena history, I'm most excited about the fact that people really loved using the model. QPS was way above what we expected, and the model racked up 2.5M votes (also a record)! Amazing job team banana 🚀🚀🍌🍌🚨🍌Big Reveal: who was "Nano Banana?" The anonymous model, “nano-banana,” that caught the world's attention with its ability to follow complex instructions, preserve character identity, and maintain contextual details was: Gemini-2.5-Flash-Image-Preview by @GoogleDeepMind 🍌✨
00:00 - Wow, 42 papers with Adobe co-authors at CVPR this year! Nearly all of these are the result of our summer internship program or university collaborations. Great job everyone!
- Check out our latest work where we perform space-time view synthesis of dynamic sequences from monocular video! w/ @zl548 @Jimantha @simon_niklaus. cs.cornell.edu/~zl548/NSFF/ I'm really excited about this work and there are lots of cool things to unpack, some favorites: [1/4]
00:00 - Now that we have all these awesome NeRF-like volumetric 3D reconstructions, what are we going to do with them? Well, if we want edit them, we will need some way to easily select regions. 1/N
00:00



