Sneha Kudugunta (@snehaark) / X

Sneha Kudugunta

637 posts

Sneha Kudugunta

@snehaark

tpu go brr @GoogleDeepMind @uwcse

San Francisco Bay Area, CA

sneha-rk.github.io

Joined September 2014

Sneha Kudugunta
@snehaark
Dec 10, 2023
bro have you made your neurips poster yet
95K
Sneha Kudugunta
@snehaark
Sep 12, 2023
Excited to announce MADLAD-400 - a 2.8T token web-domain dataset that covers 419 languages(!). Arxiv: arxiv.org/abs/2309.04662 Github: github.com/google-researc… 1/n
231K
Sneha Kudugunta
@snehaark
Sep 13, 2019
New EMNLP paper “Investigating Multilingual NMT Representation at Scale” w/ @ankurbpn, @orf_bnw, @caswell_isaac, @naveenariva. We study transfer in massively multilingual NMT @GoogleAI from the perspective of representational similarity. Paper: arxiv.org/pdf/1909.02197… 1/n
Sneha Kudugunta
@snehaark
Jan 2, 2024
Tired: Catching imposter syndrome by reading PhD applications from students way smarter than you. Wired: Getting excited about talking them into building cool things with you ✨
35K
Sneha Kudugunta
@snehaark
Jan 14, 2022
We wrote a blogpost about our work on Task-level Mixture-of-Experts (TaskMoE), and why they're a great way to efficiently serve large models (vs more common approaches like training-> compression via distillation).
Google AI
@GoogleAI
Jan 14, 2022
Read all about Task-level Mixture-of-Experts (TaskMoE), a promising step towards efficiently training and deploying large models, with no loss in quality and with significantly reduced inference latency ↓ goo.gle/3I5ulXj
Sneha Kudugunta
@snehaark
Oct 13, 2021
#EMNLP2021 Findings paper “Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference” w/ @bignamehyp, @ankurbpn, Maxim Krikun, @lepikhin, @lmthang, @orf_bnw about TaskMoE, an inference friendly alternative to token-based MoEs. Link: arxiv.org/abs/2110.03742 1/n
Sneha Kudugunta
@snehaark
Dec 20, 2023
Late tweet, but thank you ENSLP #NeurIPS2023 for the best paper award, and @Devvrit_Khatri for the excellent presentation on behalf of the team @adityakusupati! Excited to push further on conditional computation for tiny fast flexible models 🚀
Aditya Kusupati
@adityakusupati
Oct 16, 2023
Announcing MatFormer - a nested🪆(Matryoshka) Transformer that offers elasticity across deployment constraints. MatFormer is an architecture that lets us use 100s of accurate smaller models that we never actually trained for! arxiv.org/abs/2310.07707 1/9
22K
Sneha Kudugunta
@snehaark
Nov 5, 2019
Our Colab is out! Link: github.com/google-researc… I'll be talking about our paper today (11/5) (w/ @ankurbpn, @iseeaswell, @naveenariva, @orf_bnw) "Investigating Multilingual NMT Representations at Scale" at AWE Hall 2C (17:24) @emnlp2019. #emnlp2019 #NLProc #googleAI
Sneha Kudugunta
@snehaark
Sep 13, 2019
New EMNLP paper “Investigating Multilingual NMT Representation at Scale” w/ @ankurbpn, @orf_bnw, @caswell_isaac, @naveenariva. We study transfer in massively multilingual NMT @GoogleAI from the perspective of representational similarity. Paper: arxiv.org/pdf/1909.02197… 1/n
Sneha Kudugunta
@snehaark
Sep 13, 2019
Replying to @snehaark @ankurbpn and 3 others
Huge thanks to my collaborators at @GoogleAI, without whom this work would not have been possible. This work was done as a part of the Google AI Residency - applications open soon, so definitely check it out! g.co/airesidency 8/8
Sneha Kudugunta
@snehaark
Jan 10, 2023
Which one of y'all did this?
13K
Sneha Kudugunta
@snehaark
Oct 16, 2023
MatFormer is a small but significant step towards true conditional computation models. Why use many neuron when few neuron do trick? 🙃
Aditya Kusupati
@adityakusupati
Oct 16, 2023
Announcing MatFormer - a nested🪆(Matryoshka) Transformer that offers elasticity across deployment constraints. MatFormer is an architecture that lets us use 100s of accurate smaller models that we never actually trained for! arxiv.org/abs/2310.07707 1/9
32K
Sneha Kudugunta
@snehaark
Nov 15, 2023
GPU bewafa hain
8.2K
Sneha Kudugunta
@snehaark
Dec 12, 2023
I'm at #NeurIPS2023 today presenting MADLAD-400 with @BZhangGo and @adityakusupati at 5:15pm in Hall B1/B2 #314! Come by and chat w/ us about creating *massive* datasets, making sure they're not garbage, and multilingual LMs :D
34K
Sneha Kudugunta
@snehaark
Feb 15, 2020
Be kind to yourself. Don't be your own Reviewer 2. ✨ #selfcare