Karlo Šmid Blog

April 26, 2026

Recap on attention mechanism

Workbook questions and notes on the attention mechanism from Build a LLM from Scratch.

llm-from-scratch ai tutorials

March 29, 2026

The Heart of an LLM: Attention Mechanism in Elixir

A practical walkthrough of the attention mechanism in Elixir, from simple self-attention to causal and multi-head attention, based on Chapter 3 of Build a LLM from Scratch.

llm-from-scratch ai tutorials

March 01, 2026

Workbook Answers on Chapter 2, Build a LLM from Scratch, Working with Text Data

Workbook answers and notes for Chapter 2 of Build a LLM from Scratch: Working with Text Data.

llm-from-scratch ai tutorials

January 27, 2026

Build LLM from Scratch, Chapter 2 — Working with Text Data

An Elixir/Nx walkthrough of preparing text for LLM training: tokenization, token IDs, BPE, sliding windows, token embeddings, and positional embeddings.

llm-from-scratch ai tutorials

January 23, 2026

Foundation of Taking Testing Seriously

My foundation plan for studying Taking Testing Seriously, the foundations, so the lessons from James Bach and Michael Bolton become habits instead of inspirational quotes.

rapid software testing

January 20, 2026

Chapter 1 recap of Build LLM from Scratch

Re-read my Chapter 1 study posts and Giles Thomas’s companion article to reinforce Sebastian Raschka’s Build LLMs from Scratch takeaways and capture the new insights I missed the first time.

llm

January 12, 2026

Building a large language model

Exploring the three main stages of building an LLM — data preparation, pretraining, and fine-tuning — along with key concepts like transformer architecture, emergent properties, and self-supervised...

llm

January 05, 2026

A Closer Look at the GPT Architecture

Exploring GPT architecture through study questions — understanding next-word prediction, self-supervised learning, decoder-only design, autoregressive generation, and how model size impacts capabil...

LLM

January 04, 2026

Utilizing Large Datasets

Exploring the role of large datasets in LLMs — tokenization, pretraining, and fine-tuning. Study questions from Build LLM from Scratch by Sebastian Raschka.

llm

January 03, 2026

Introduction to Transformer Architecture

Diving into the Transformer architecture — encoder vs decoder, self-attention, BERT vs GPT, and zero-shot/few-shot learning. Study questions from Build LLM from Scratch.

llm

January 02, 2026

Stages of Building and Using LLMs

Continuing with study questions from Build LLM from Scratch by Sebastian Raschka — covering pretraining, fine-tuning, and the two-stage process of building LLMs.

llm

January 01, 2026

Applications of LLMs

Exploring LLM applications — from chatbots and virtual assistants to knowledge retrieval and machine translation. Study questions from Build LLM from Scratch.

llm

December 30, 2025

I am human, but what is an LLM?

Answering study questions from Build LLM from Scratch by Sebastian Raschka — testing my understanding of what an LLM is, how it works, and how it relates to generative AI.

llm

December 28, 2025

What did LLM in primary school?

TL;DR In the previous post we gave a high-level overview of the LLM Transformer architecture with examples. Today, we explain what ChatGPT did in “primary school”: the pre-training of an LLM. The q...

llm

December 27, 2025

Transformer architecture, the hearth of LLM model

TL;DR In the last post, I wrote about custom LLM models and where they outperform general purpose LLM. Today we will present the transformer architecture, the hearth of llm model. The Transformer N...

llm

December 26, 2025

Custom build LLM vs general purpose LLM

TL;DR I am reading book Build LLM from scratch by Sebastian Raschka. In previous post we covered the topic what is the primary function of a llm. Today’s topic is next question from chapter 1, What...

Uncategorized

December 23, 2025

Attention is all you need

TL;DR I am reading Build a Large Language Model from Scratch and, to deepen my understanding of what I read, I am writing blog posts about the questions that accompany the book. This post covers th...

llm

December 21, 2025

Difference between llm and traditional machine learning

TL;DR Currently, I am reading two books: Taking Testing Seriously by Michael Bolton and James Bach, and Build a Large Language Model from Scratch by Sebastian Raschka. I learned about Build a Large...

llm

December 18, 2025

Taking Testing Seriously, Chapter 1

TL;DR

rapid software testing