Our team at Google DeepMind Foundational Research is hiring full-time Research Scientists and Research Interns! Multimodal, Reasoning, self-improving agents, Video Understanding. Looking for candidates with strong papers at top ML and CV conferences. Email: [email protected]
Alireza Fathi
506 posts
Senior Staff Research Scientist / Manager @ Google DeepMind
- Our team at Google DeepMind is seeking a Research Scientist with a strong publication record (multiple first-author papers) on multi-modal LLMs in top ML venues like NeurIPS, ICLR, CVPR. Email me at [email protected] @CordeliaSchmid
- ✨ Our team at Google DeepMind is hiring Research Interns (Summer 2025)! Multimodal, text-to-3D, Personalized LLMs, Video Understanding and Generation. Looking for candidates with multiple first-author papers in top ML conferences. Email: [email protected] @CordeliaSchmid
- Our team at Google DeepMind Foundational Research has an opening for a full-time Research Scientist! Areas of Interest are Multimodal, 3D and Spatial Reasoning, Self-improving Agents. Looking for candidates with strong publications at top ML and CV conferences. Email:
- Robotics at Google has released a very high quality dataset of scanned objects. It could enable interesting research in 3d shape modeling. app.ignitionrobotics.org/GoogleResearch…
- Jitendra Malik's thoughts on Foundation Models, in the Stanford HAI workshop
- We have released TensorFlow 3D!Announcing the release of TensorFlow 3D, a set of training and evaluation pipelines for state-of-the-art 3D semantic segmentation, object detection and instance segmentation, with support for distributed training. Check it out and download the code at goo.gle/3pchcSG
- Augmenting Large Language & Visual models with Retrieval helps the model to answer questions that were not present in the training data. REVEAL is one of the recent works by our team arxiv.org/abs/2212.05221 @acbuller, @ahmetius, @jesu9, @MrZiruiWang , David Ross, @CordeliaSchmid
- Most of the previous work on 3d object detection use only one frame of data. In our #eccv2020 paper, we present a 3d sparse LSTM model that achieves more accurate results when applied to a sequence of point clouds. arxiv.org/abs/2007.12392
- Our recent work on object-centric neural rendering. Our new formulation makes it possible to move the objects around in the scene and still be able to render high quality images from different views.We made NeRF compositional! By learning object-centric neural scattering functions (OSFs), we can now compose dynamic scenes from captured images of objects. Website: shellguo.com/osf Joint work with @alirezafathi @jiajunwu_cs Thomas Funkhouser
GIF - I am glad that our #cvpr2020 reviews are very positive, but at the same time I am very worried that the quality of the reviews have significantly degraded compared to few years ago.
- Congratulations to Yue Wang (research intern), Rui Huang (AI resident), Wanyue Zhang (AI resident) and @_abhijit_kundu_ for getting their papers accepted to #eccv2020.
- Today marks my 7th year at Google! How time flies! Thank you, Google, for giving me the opportunity to work on what I enjoy...






