Vivek’s Substack

Vivek’s Substack

Home
Notes
Archive
About
Multi-Head Attention: One Sentence, Many Perspectives
Breaking down how attention heads specialize and collaborate in transformers
Oct 4, 2025 • Vivek Nayyar
No Peeking! How LLMs Learn With Causal Attention
From masks to dropout: making LLMs learn step by step
Sep 27, 2025 • Vivek Nayyar
Attention, Please! How LLMs Learn Relationships Between Words
How Neural Networks Learn to Connect Words
Jul 13, 2025 • Vivek Nayyar
How Does a LLM Read Your Sentence? Let's Break It Down
This article is meant to give you a simple and intuitive explanation of how large language models (LLMs) like GPT convert a sentence into numbers …
Jul 11, 2025 • Vivek Nayyar
Coming soon
This is Vivek’s Substack.
May 13, 2025 • Vivek Nayyar
Vivek’s Substack
Vivek’s Substack
My personal Substack

Vivek’s Substack

AboutArchive
© 2026 Vivek Nayyar · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture