Rishabh Tiwari (@rish2k1) / X

Rishabh Tiwari

126 posts

Rishabh Tiwari

@rish2k1

CS PhD @UCBerkeley | Ex-Deepmind, FAIR | Research area: Efficient LLM reasoning, scaling RL

Berkeley, CA

Joined May 2019

Pinned
Rishabh Tiwari
@rish2k1
May 13
Very excited about this line of research of fast-slow learning, 1) potential to solve a lot of issues with current RL (eg. entropy collapse, sparse rewards) 2) an intuitive way of incorporating rich feedback with RL 3) provides a way to transfer knowledge of text-only based
Kusha Sareen
@KushaSareen
May 13
Can LLMs adapt continually without losing base skills? Fast-Slow Training (FST) pairs "slow" weights with "fast" context. FST vs. RL: • 3x more sample-efficient • Higher performance ceiling • Less KL drift (better plasticity) • Continual learning: succeeds where RL stalls
00:00
28K
Rishabh Tiwari
@rish2k1
Jun 9, 2024
Excited to share that I will be joining @UCBerkeley as a PhD student this Fall. I wish to continue working on the efficiency and interpretability of deep learning models, hoping to deepen our understanding of these systems.
79K
Rishabh Tiwari
@rish2k1
Jul 31, 2023
🚨 Excited to share our #ICML2023 work on the Feature Sieve, by which we automatically identify and suppress irrelevant or spurious features in deep networks, hence mitigating simplicity bias. Paper: openreview.net/pdf?id=DnTVBs6…
GIF
16K
Rishabh Tiwari
@rish2k1
Jan 9, 2024
What a way to end my #WACV2024. Got a chance to chat (turned to short AMA 😅) with amazing and super humble @CSProfKGD. Hope you get a chance to tick off visiting India from your bucket list soon. @AditayTripathi @sourabhgothe
7.2K
Rishabh Tiwari
@rish2k1
Jun 9, 2024
Replying to @rish2k1
PS cross posting my LinkedIn post in the hopes of expanding my X network (first step towards becoming a serious AI researcher😅).
4.9K
Rishabh Tiwari
@rish2k1
Jun 9, 2024
Replying to @rish2k1
This accomplishment would not have been possible without the guidance from my incredible mentors: @doktorshenoy, @jainprateek_, @divy93t, @ManishGuptaMG1, @DjDvij.
5.4K
Rishabh Tiwari
@rish2k1
Jan 5, 2024
A very interesting paper by my good friend @rach_it_ w/ @GoogleAI and @GoogleDeepMind . Shows the strategy to compose different LLMs to unlock new capabilities. Seems like a critical step in the right direction towards practical usability of LLMs.
Rachit Bansal
@rach_it_
Jan 5, 2024
Extending an LLM for new knowledge sources is tedious—fine-tuning is expensive/causes forgetting, LoRA is restrictive. Excited to share our work where we show that an LLM can be efficiently *composed* with specialized (L)LMs to enable new tasks! arxiv.org/abs/2401.02412 🧵(1/8)
2.1K
Rishabh Tiwari
@rish2k1
May 3, 2024
A new benchmark dataset to evaluate LLMs on indic languages!
Harman Singh (in NYC for summer)
@Harman26Singh
May 3, 2024
IndicGenBench is now released on huggingface: huggingface.co/collections/go…
2.6K
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(8/8) Joint work with @doktorshenoy (Google Research India). @GoogleAI @GoogleIndia
564
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(7) Optimization: We use accuracy on validation set for setting hparams. We require no knowledge of potential spurious attribute dimensions, nor even a curated / balanced validation set. Of course, well-chosen validation data can be exploited to guide the resulting classifier.
654
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(5) Hypothesis: Simple features are learned early, lower in the network, and proliferate through the rest of the network. What if we can automatically identify and suppress such features?
296
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(2) Problem: Simplicity bias: Models depend on weak, easily computed predictive features even if stronger alternatives exist. Spurious correlations: DNNs latch onto apparent feature-label correlations present in biased training data distributions.
692
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(3) Results (quant): SIFER gives around 65.7% accuracy without access to test distribution outperforming other baselines. Using a validation set drawn from test distribution further improves accuracy by 6.3%. SIFER beats SOTA by significant margins across debiasing benchmarks.
414
Rishabh Tiwari
@rish2k1
Jul 31, 2023
Replying to @rish2k1
(6) Workflow: We alternate between identifying simple features (i.e., reading them out from intermediate network layers), and erasing them in lower layers. This simple yet powerful scheme mediates between (“sieves through”) competing features, picking only those that generalize.
GIF
315