Rag | Towards Data Science

Advanced RAG Retrieval: Cross-Encoders & Reranking

LLM Applications

A deep-dive and practical guide to cross-encoders, advanced techniques, and why your retrieval pipeline deserves…

Ian Ho

April 11, 2026

28 min read

Grounding Your LLM: A Practical Guide to RAG for Enterprise Knowledge Bases

Large Language Models

A clear mental model and a practical foundation you can build on

Priyansh Bhardwaj

April 8, 2026

17 min read

Proxy-Pointer RAG: Achieving Vectorless Accuracy at Vector RAG Scale and Cost

Large Language Model

A new way to build vector RAG—structure-aware and reasoning-capable

Partha Sarkar

April 5, 2026

23 min read

Agentic RAG Failure Modes: Retrieval Thrash, Tool Storms, and Context Bloat (and How to Spot Them Early)

Large Language Models

Why agentic RAG systems fail silently in production and how to detect them before your…

Mostafa Ibrahim

March 20, 2026

8 min read

Beyond Prompt Caching: 5 More Things You Should Cache in RAG Pipelines

Agentic AI

A practical guide to caching layers across the RAG pipeline, from query embeddings to full…

Maria Mouschoutzi

March 19, 2026

13 min read

Introducing Gemini Embeddings 2 Preview

Large Language Models

One embedding model to rule them all

Thomas Reid

March 17, 2026

10 min read

Why Care About Prompt Caching in LLMs?

Large Language Models

Optimizing the cost and latency of your LLM calls with Prompt Caching

Maria Mouschoutzi

March 13, 2026

12 min read

Scaling Vector Search: Comparing Quantization and Matryoshka Embeddings for 80% Cost Reduction

Machine Learning

Navigating the performance cliff: How pairing MRL with int8 and binary quantization balances infrastructure costs…

Oleg Tereshin

March 12, 2026

11 min read

Understanding Context and Contextual Retrieval in RAG

Large Language Models

Why traditional RAG loses context and how contextual retrieval dramatically improves retrieval accuracy

Maria Mouschoutzi

March 7, 2026

10 min read

RAG with Hybrid Search: How Does Keyword Search Work?

Machine Learning

Understanding keyword search, TF-IDF, and BM25

Maria Mouschoutzi

March 4, 2026

10 min read