Pinned
🚨 New Paper: The Art of Scaling Reinforcement Learning Compute for LLMs 🚨
We burnt a lot of GPU-hours to provide the community with the first open, large-scale systematic study on RL scaling for LLMs.
x.com/Devvrit_Khatri…
Wish to build scaling laws for RL but not sure how to scale? Or what scales? Or would RL even scale predictably?
We introduce: The Art of Scaling Reinforcement Learning Compute for LLMs














