Deep (Learning) Focus
Subscribe
Sign in
Home
Notes
The Author
Archive
About
RL Scaling Laws for LLMs
How scaling laws have evolved from pretraining to reinforcement learning...
READ THE LATEST
Most Popular
View all
Decoder-Only Transformers: The Workhorse of Generative LLMs
Mar 4, 2024
•
Cameron R. Wolfe, Ph.D.
167
15
10
Demystifying Reasoning Models
Feb 18, 2025
•
Cameron R. Wolfe, Ph.D.
281
5
30
Understanding and Using Supervised Fine-Tuning (SFT) for Language Models
Sep 11, 2023
•
Cameron R. Wolfe, Ph.D.
90
5
8
AI Agents from First Principles
Jun 9, 2025
•
Cameron R. Wolfe, Ph.D.
362
25
44
Latest
Top
Discussions
The Anatomy of an LLM Benchmark
Common patterns used to create the most effective LLM evaluation datasets...
Mar 30
•
Cameron R. Wolfe, Ph.D.
98
4
15
Applying Statistics to LLM Evaluations
Most LLM evaluations are conducted without a deep consideration of statistics.
Mar 9
•
Cameron R. Wolfe, Ph.D.
127
8
13
Rubric-Based Rewards for RL
Extending the benefits of large-scale RL training to non-verifiable domains...
Feb 16
118
11
17
Continual Learning with RL for LLMs
Exploring the impressive continual learning capabilities of RL training...
Jan 26
•
Cameron R. Wolfe, Ph.D.
146
15
19
GRPO++: Tricks for Making RL Actually Work
How to go from the vanilla GRPO algorithm to functional RL training at scale...
Jan 5
•
Cameron R. Wolfe, Ph.D.
130
10
18
Olmo 3 and the Open LLM Renaissance
Fully-open artifacts with the potential to make LLM research a reality for anyone...
Dec 15, 2025
•
Cameron R. Wolfe, Ph.D.
82
7
14
Group Relative Policy Optimization (GRPO)
How the algorithm that teaches LLMs to reason actually works...
Nov 24, 2025
•
Cameron R. Wolfe, Ph.D.
116
10
14
See all
Deep (Learning) Focus
I contextualize and explain important topics in AI research.
Subscribe
Recommendations
View all 13
The VC Corner
Ruben Dominguez
Ahead of AI
Sebastian Raschka, PhD
AI for Software Engineers
Logan Thorneloe
Javarevisited Newsletter
javinpaul
The Founders Corner®
Ruben Dominguez
Deep (Learning) Focus
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts