Yuge Shi (Jimmy) (@YugeTen) / X

Yuge Shi (Jimmy)

694 posts

Yuge Shi (Jimmy)

@YugeTen

石宇歌 · @GoogleDeepMind, Genie 1, 2 🧞‍♀️ , Veo 3 📽️, Omni 🧘‍♀️

London, England

Joined December 2017

Pinned
Yuge Shi (Jimmy)
@YugeTen
Jan 31, 2025
✨New blog post✨: my attempt as a vision researcher at finally understanding RLHF -- a deep dive into PPO & DeepSeek's GRPO! No hot take, I promise.
yugeten.github.io
A vision researcher’s guide to some RL stuff: PPO & GRPO
89K
Yuge Shi (Jimmy)
@YugeTen
Jun 20, 2022
Dear conference organisers: I promise I will choose a better birth place next time, but in the meantime can we please 1) host conferences at more visa-friendly locations or 2) leave more time between paper decision time and conference so we have enough time to apply for visas?
Yuge Shi (Jimmy)
@YugeTen
Mar 5, 2023
I am starting work at @DeepMind as a research scientist tomorrow ☀️☀️ I will be joining the Openendedness team led by Tim Rocktäschel @_rockt. Excited (and equally nervous, admittedly) to start this new chapter of life and meet & work with amazing people!
68K
Yuge Shi (Jimmy)
@YugeTen
Apr 19, 2023
Successfully defended my thesis with Max Welling (@wellingmax) and Andrew Zisserman today! No professors were physically harmed in the process. A huge thank you to both for the insightful discussions!
59K
Yuge Shi (Jimmy)
@YugeTen
Jun 20, 2022
Replying to @YugeTen
Not everyone can get into the US by taking 20 minutes to fill out an online form, some has to dig up their entire employment/travelling history and refresh the visa appointment website desperately for days and days
Yuge Shi (Jimmy)
@YugeTen
Sep 13, 2019
After much procrastination, finally uploaded my first blog post "Gaussian Processes, not quite for dummies"! Have a lookie here: yugeten.github.io/posts/2019/09/… Share if like :)
yugeten.github.io
Gaussian Process, not quite for dummies
Before diving inFor a long time, I recall having this vague impression about Gaussian Processes (GPs) being able to magically define probability distributions over sets of functions, yet I procrast...
Yuge Shi (Jimmy)
@YugeTen
Jul 13, 2021
Paper + Code release! We propose Fish🐟, an effective algorithm for domain generalisation. It learns invariant features by maximising gradient inner product across domains. 📜:arxiv.org/abs/2104.09937 👩‍💻:github.com/YugeTen/fish Work done during internship at FAIR with @syhw. 🧵
Yuge Shi (Jimmy)
@YugeTen
Jul 21, 2024
Pretty pissed off about not getting to ICML for the third time because of visa, but super proud to have 2 orals, 1 spotlight and 1 workshop at the conference! I'm not going to fix open borders with this tweet but I do hope it nudges you to go talk to my amazing collaborators --
53K
Yuge Shi (Jimmy)
@YugeTen
May 16, 2022
Random pixel mask is NOT random word mask in image space: we propose ADIOS, which learns semantically meaningful masks for masked image modelling in a self-supervised manner. Joint work with Adam Kosiorek (@arkosiorek), now accepted to #ICML2022. 📜: arxiv.org/abs/2201.13100
Yuge Shi (Jimmy)
@YugeTen
May 20, 2025
Compared to the much anticipated unveiling of #Veo3 I am more excited to share my usage of “no cap” in generating a rapping video
00:00
44K
Yuge Shi (Jimmy)
@YugeTen
Mar 23, 2022
The school of informatics at the University of Edinburgh and DeepMind are offering an ML PhD scholarship for students who identify as gender/racial/ethnic minorities in 2022/23. See thread for details. (1/n)
Yuge Shi (Jimmy)
@YugeTen
Jun 20, 2020
New blog post: How I learned to stop worrying and write ELBO in a billion ways. I talk about a list of interesting approaches including IWAE and DReG that are important to the evolution of ELBO over the past few years. Have fun reading, RT if like!
yugeten.github.io
How I learned to stop worrying and write ELBO (and its gradients) in a billion ways
Latex equations not rendering? Try using a different browser or this link here.
Yuge Shi (Jimmy)
@YugeTen
Feb 17, 2023
Dare I say...computer vision with human feedback is on its way in? Super happy to have contributed to this important piece of work during my internship at Google Brain!
Alexander Kolesnikov
@__kolesnikov__
Feb 17, 2023
Vision meets RL! We reveal that policy gradient can be used for tuning vision models to optimize complex metrics, such as mAP, PQ or “color diversity”, observing large performance boosts on tasks like object detection, panoptic segmentation, etc. arxiv.org/abs/2302.08242
39K
Yuge Shi (Jimmy)
@YugeTen
Jan 23, 2023
How robust are unsupervised representation learning methods (e.g. SSL) to distirbution shift compared to supervised learning? 𝐒𝐡𝐨𝐫𝐭 𝐚𝐧𝐬𝐰𝐞𝐫: Quite! 𝐋𝐨𝐧𝐠 𝐚𝐧𝐬𝐰𝐞𝐫: Our #ICLR2023 paper arxiv.org/pdf/2206.08871… Joint work with Imant Daunhawer & @AmartyaSanyal.
46K