Yi Tay (@YiTayML) / X

Yi Tay

3,997 posts

Yi Tay

@YiTayML

research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.

mixture-of-locations

Joined October 2016

Pinned
Yi Tay
@YiTayML
Dec 4, 2025
Happy to share that the @GoogleDeepMind Gemini team is starting a new research team in Singapore! This new team will be focused on advanced reasoning, LLM/RL and improving bleeding edge SOTA models such as Gemini, Gemini Deep Think and beyond. 🔥 This team will be led by yours
320K
Yi Tay
@YiTayML
Mar 5, 2024
Long overdue but here's a new blogpost on training LLMs in the wilderness from the ground up 😄🧐 In this blog post, I discuss: 1. Experiences in procuring compute & variance in different compute providers. Our biggest finding/surprise is that variance is super high and it's
918K
Yi Tay
@YiTayML
Nov 18, 2025
Gemini 3! This is our most intelligent model that brings any idea to life. 😻 This is the best model in the world, by a crazy wide margin! Aside from a huge increase across the absolutely everything, look at its coding capabilities and quality of aesthetics and fidelity.
00:00
221K
Yi Tay
@YiTayML
Mar 2, 2023
New open source Flan-UL2 20B checkpoints :) - Truly open source 😎 No forms! 🤭 Apache license 🔥 - Best OS model on MMLU/Big-Bench hard 🤩 - Better than Flan-T5 XXL & competitive to Flan-PaLM 62B. - Size ceiling of Flan family just got higher! Blog:
yitay.net
A New Open Source Flan 20B with UL2 — Yi Tay
Releasing the new open source Flan-UL2 20B model.
452K
Yi Tay
@YiTayML
Jul 21, 2025
Our IMO gold model is not just an "experimental reasoning" model. It is way more general purpose than anyone would have expected. This general deep think model is going to be shipped so stay tuned! 🔥
Melvin Johnson
@melvinjohnsonp
Jul 21, 2025
So happy to see this incredible achievement. Huge congrats to @lmthang, @quocleix, @YiTayML and the IMO team on the result. This was a great collaboration across teams to build a general Gemini DeepThink model that can also get gold at IMO.
354K
Yi Tay
@YiTayML
Nov 25, 2024
Personal / life update: I have returned to @GoogleDeepMind to work on AI & LLM research. It was an exciting 1.5 years at @RekaAILabs and I truly learned a lot from this pretty novel experience. I wrote a short note about my experiences and transition on my personal blog here
yitay.net
Returning to Google DeepMind — Yi Tay
Returning to Google and recounting my experiences as a startup co-founder.
325K
Yi Tay
@YiTayML
Oct 3, 2023
It’s been a short 6 months since I left Google Brain and it has been a uniquely challenging yet interesting experience to build everything from the ground up in an entirely new environment (e.g., the wilderness) Today, we’re excited to announce the first version of the
Reka
@RekaAILabs
Oct 3, 2023
We are excited to announce the 1st version of our multimodal assistant, Yasa-1, a language assistant with visual and auditory sensors that can take actions via code execution 🪄. Yasa-1 can understand text, images, videos, sounds & more! 🚀 Check out more details below👇
00:00
479K
Yi Tay
@YiTayML
Jun 12, 2023
Hot take 🔥: Lots of buzz these days about new foundation open-source models but what if I told you there have been no real advance since 2019's T5 models 😀 Take a look at this table from this new InstructEval paper: arxiv.org/abs/2306.04757. Some thoughts/observations: 1.
503K
Yi Tay
@YiTayML
Jul 25, 2022
"Scaling laws vs Model Architectures" from @GoogleAI. Lessons: - Not all arch scale the same way. - Vanilla Transformer does pretty well 😀 - Touching the attention too much is "dangerous". 😔 - Perf at base may not translate to large+ scale. pdf: arxiv.org/abs/2207.10551
Yi Tay
@YiTayML
Mar 30, 2023
Over the past 3.3 years at Google, I have been blessed with so many wonderful friendships and experiences. I have grown so much. However, it’s time to move on to a new adventure! I wrote a blogpost about my wonderful experience here:
yitay.net
Leaving Google Brain — Yi Tay
Documenting my 3.3 years at Google Research and Brain.
393K
Yi Tay
@YiTayML
Apr 15, 2024
It's been a wild ride. Just 20 of us, burning through thousands of H100s over the past months, we're glad to finally share this with the world! 💪 One of the goals we’ve had when starting Reka was to build cool innovative models at the frontier. Reaching GPT-4/Opus level was a
Reka
@RekaAILabs
Apr 15, 2024
Meet Reka Core, our best and most capable multimodal language model yet. 🔮 It’s been a busy few months training this model and we are glad to finally ship it! 💪 Core has a lot of capabilities, and one of them is understanding video --- let’s see what Core thinks of the 3 body
00:00
217K
Yi Tay
@YiTayML
Jun 27, 2023
We’re coming out of stealth with $58M in funding to build generative models and advance AI research at @RekaAILabs 🔥🚀 Language models and their multimodal counterparts are already ubiquitous and massively impactful everywhere. That said, we are still at the beginning of this
261K
Yi Tay
@YiTayML
Sep 16, 2020
Inspired by the dizzying number of efficient Transformers ("x-formers") models that are coming out lately, we wrote a survey paper to organize all this information. Check it out at arxiv.org/abs/2009.06732. Joint work with @m__dehghani @dara_bahri and @metzlerd. @GoogleAI 😀😃
Yi Tay
@YiTayML
Feb 18, 2022
Excited to share our latest work at @GoogleAI on "Transformer Memory as a Differentiable Search Index"! TL;DR? We parameterize a search system with only a single Transformer model 😎. Everything in the corpus is encoded in the model! 🙌 Paper: arxiv.org/abs/2202.06991