Prof. Anima Anandkumar (@AnimaAnandkumar) / X

Prof. Anima Anandkumar

2,632 posts

Prof. Anima Anandkumar

@AnimaAnandkumar

AI+Science, Bren Professor @caltech, Time100, Fmr Sr Director of #AI research @nvidia Fmr Principal Scientist @awscloud

tensorlab.cms.caltech.edu/users/anima/

Joined May 2021

Prof. Anima Anandkumar
@AnimaAnandkumar
Mar 7, 2024
For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training. Training LLMs from scratch currently requires huge
AK
@_akhaliq
Mar 7, 2024
GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection Training Large Language Models (LLMs) presents significant memory challenges, predominantly due to the growing size of weights and optimizer states. Common memory-reduction approaches, such as low-rank
408K
Prof. Anima Anandkumar
@AnimaAnandkumar
Dec 7, 2023
I have decided to leave my position at Nvidia to focus on starting something new, as some of you may already know. I look forward to scaling models with physical and scientific understanding to accelerate progress toward AGI. I will share more soon. Excited to meet everyone at
369K
Prof. Anima Anandkumar
@AnimaAnandkumar
Aug 11, 2024
My great great grandfather was such an inspiration. He cracked the code of a previously unknown script on palm leaves from an ancient temple and translated Arthashastra into English. When I read the Arthashastra as a young girl I was blown away by its richness. It covered not
Quanta Magazine
@QuantaMagazine
Aug 11, 2024
The Arthashastra, written in 300 BCE, is the first known text on economics. In 1905, it was rediscovered by the scholar Rudrapatna Shamasastry. His great-great-granddaughter is Anima Anandkumar, now a machine learning scientist at CalTech and Nvidia. quantamagazine.org/the-ai-researc…
208K
Prof. Anima Anandkumar
@AnimaAnandkumar
Dec 11, 2023
Launching Lean Co-pilot for LLM-human collaboration to write formal mathematical proofs that are 100% accurate. We use LLMs to suggest proof tactics in Lean and also allow humans to intervene and modify in a seamless manner. github.com/lean-dojo/Lean… Automating theorem proving
GitHub - lean-dojo/LeanCopilot: LLMs as Copilots for Theorem Proving in Lean
From github.com
323K
Prof. Anima Anandkumar
@AnimaAnandkumar
Jan 5, 2022
My mom was one of the first female engineers in my community. Initially, my grandpa refused since she would be too qualified and hence, unmarriageable. My mom went on a hunger strike for three days and my grandpa relented. I stand on the shoulder of giants #womeninstem
Prof. Anima Anandkumar
@AnimaAnandkumar
Jun 6, 2022
Our big day came together so beautifully, surrounded by family and friends at the historic @caltech Athenaeum. More pictures to come soon! @bjenik #wedding
Prof. Anima Anandkumar
@AnimaAnandkumar
Sep 3, 2024
Congratulations @jiawzhao on an excellent PhD defense! Jiawei has been a pioneer in hardware-efficient training. When he started his PhD, everyone was focusing on inference efficiency, and training runs were small, Jiawei took the bold step to pursue training efficiency. Slides:
91K
Prof. Anima Anandkumar
@AnimaAnandkumar
Jun 16, 2024
It is important to reconnect to our past and cherish the people and institutions that nurtured us and helped shape who we are today. I got a special opportunity to do so and receive the Distinguished Alumnus Award at @iitmadras My family and @iitmadras mentors got to speak. I
65K
Prof. Anima Anandkumar
@AnimaAnandkumar
Jun 2, 2024
Looking forward to being back at @iitmadras It has been 20 years since I left but memories are still fresh. It is an honor to be back to receive the distinguished alumni award.
48K
Prof. Anima Anandkumar
@AnimaAnandkumar
Aug 21, 2025
It is interesting that the new @deepseek_ai v3.1 is trained using the UE8M0 FP8 scale data format which is logarithmic number system. Our multiplicative weights update (Madam) for training in that format was done several years ago while at @nvidia It yields maximum hardware
73K
Prof. Anima Anandkumar
@AnimaAnandkumar
Sep 16, 2025
Physics-AI that can generalize across different complex 3D geometries is a challenging problem. We propose a principled solution combining Neural Operators with Optimal Transport. Optimal transport provides determines the most efficient transformation between two densities. By
35K
Prof. Anima Anandkumar
@AnimaAnandkumar
Jun 12, 2022
This is one of my favorite #Pics so far from our #wedding (credit: @alicejacobsmd ) I jumped for the kiss a bit too early before @francesarnold could finish her speech and give us the permission :) Thanks so much @francesarnold for officiating our #wedding
Prof. Anima Anandkumar
@AnimaAnandkumar
Oct 11, 2024
Announcing LeanAgent: the first life-long learning agent for formal theorem proving in Lean. LLMs have been integrated with interactive proof assistants like Lean for theorem proving with 100% accuracy. So far, these LLMs are static, cannot learn new knowledge online, and
76K
Prof. Anima Anandkumar
@AnimaAnandkumar
Jan 18, 2023
Thank you @TheOfficialACM for recognizing me as an ACM Fellow! This honor belongs to my team and collaborators. Every day I am energized and inspired to work with them. Also thankful to my mentors and my family for supporting me all this way!
Association for Computing Machinery
@TheOfficialACM
Jan 18, 2023
💐Meet the 2022 ACM Fellows! 57 of the ACM members have been selected for their wide-ranging and fundamental contributions in the #computing field. Please join us in congratulating these new inductees! Learn more about their achievements here: bit.ly/3CXulZz #ACMFellows
110K