Towards Data Science

Skip to content

Publish AI, ML & data-science insights to a global community of data professionals.

Sign in

Submit an Article

Latest
Editor’s Picks
Deep Dives
Newsletter

Write For TDS

LinkedIn
X

Search

From Local LLM to Tool-Using Agent
LLM Applications

Using Gemma 4, Ollama, OpenAI Agents SDK, and Tavily MCP to build a lightweight research…

Shuai Guo

Jun 26

8 min read
Water Cooler Small Talk, Ep. 11: Overfitting in RAG evaluation
Large Language Models

Why memorizing for the exam doesn’t mean you understand the subject

Maria Mouschoutzi

Jun 26

10 min read

Latest

Amplify the Expert: A Philosophy for Building Enterprise RAG
Large Language Model

Enterprise Document Intelligence [Vol.1 #M1] – The thesis behind every architectural choice in this series

angela shi

Jun 26

20 min read
How to Ace Data and ML Behavioural Interviews
Data Science

How to smash through data / ML behavioural interviews

Egor Howell

Jun 26

10 min read
Vector RAG Isn’t Enough — I Built a Context Graph Layer for Multi-Agent Memory
Large Language Model

I benchmarked raw chat history, vector-only RAG, and a context graph on the same multi-agent…

Emmimal P Alexander

Jun 25

19 min read
The Hot Path Belongs to GBDTs, Agents Own the Cold Path: A Payment-Fraud Benchmark
Machine Learning

A reproducible benchmark on latency, cost, and reproducibility, and where agents actually earn their keep.

Sandeep Bharadwaj Mannapur

Jun 25

17 min read
Beyond the Straight Line: Choosing Between OLS, Interaction Terms, and Tweedie Regression
Data Science

Whether you should stick to a classic Ordinary Least Squares regression, introduce interaction terms, or…

Gustavo Santos

Jun 25

14 min read
3 Agents. 3 LLMs. 1 Aging GPU: Engineering Parallel Inference on Bare Metal
Agentic AI

Beat the 8GB VRAM limit. Learn how to run three different LLMs on a single…

Anubhab Banerjee

Jun 25

21 min read
An LLM as arbiter in RAG retrieval: picking the right candidate with reasons
Large Language Models

Enterprise Document Intelligence [Vol.1 #7C] – One LLM call ranks the candidates with reasons. The…

angela shi

Jun 25

31 min read
One Month Into Learning Data Engineering in Public: Here’s What I Didn’t Write About
Data Engineering

A reflection on the first month of learning data engineering in public, and what actually…

Ibrahim Salami

Jun 25

8 min read
How to Build a Credit Scoring Grid From a Logistic Regression Model
Machine Learning

Turning model coefficients into a 0–1000 score, with risk classes and stability checks

JUNIOR JUMBONG

Jun 24

7 min read

See all of the latest

Editor’s Picks

Your First Task as a Data Engineer in a New Company? Make the ETL Pipeline Testable
Data Engineering

A practical data engineering onboarding workflow for environment setup, automated testing, and AI-assisted development.

Jiayan Yin

Jun 24

9 min read
Why I Stopped Using One Agent and Built a Multi-Agent Pipeline Instead
Agentic AI

A practical walkthrough using text-to-SQL as the example

Priyansh Bhardwaj

Jun 24

13 min read
I Spent an Hour on a Data Preprocessing Task Before Asking Gemini
Data Science

How Gemini solved my Pandas problem in seconds, and why data science fundamentals still matter…

Soner Yıldırım

Jun 23

7 min read
GPU-Resident Top-K for Agentic RAG: I Built a CUDA Kernel So My Retrieval Step Would Stop Bouncing Off the GPU
Agentic AI

The PCIe transfer latency is silently bottlenecking your agentic inference. Here is how building a…

Anubhab Banerjee

Jun 19

31 min read
Structured Outputs with LLMs: JSON Mode, Function Calling, and When to Use Each
Large Language Models

Getting reliable, readable responses out of your LLM, and knowing which tool to reach for

Maria Mouschoutzi

Jun 18

13 min read
Your Churn Threshold Is a Pricing Decision
Data Science

How unit economics should set your classification cutoff, and why they rarely do.

Fabio Oliveira

Jun 17

15 min read
You Probably Don’t Need an Agent Framework
Large Language Models

Most LLM applications need a clear workflow, not an autonomous agent. Here’s how to build…

Shuai Guo

Jun 17

19 min read
Drilling Into AI’s Financial Sustainability
Artificial Intelligence

Budgets for AI tokens can’t be infinite, no matter how much hyperscalers wish they were

Stephanie Kirmer

Jun 16

8 min read
I Built 11 Models to Predict the 2026 World Cup. They Crown Four Different Champions.
Data Science

A single model hands you a single answer and no sense of how much it…

Ari Joury, PhD

Jun 15

11 min read

The Variable Newsletter

Exciting Changes Are Coming to the TDS Author Payment Program
Writing

Authors can now benefit from updated earning tiers and a higher article cap

TDS Editors

Mar 2

2 min read
TDS Newsletter: Vibe Coding Is Great. Until It’s Not.
The Variable

Sorting through the good, bad, and ambiguous aspects of vibe coding

TDS Editors

Feb 5

4 min read

Deep Dives

Finding the right anchors for RAG: keyword, embedding, and TOC signals in parallel
Large Language Models

Enterprise Document Intelligence [Vol.1 #7B] – Retrieval is filtering on structured tables: keywords first, TOC…

angela shi

Jun 24

33 min read
Retrieval Is Filtering, Not Search: A Mental Model for Enterprise RAG
Large Language Models

Enterprise Document Intelligence [Vol.1 #7A] – Stop searching strings. Filter line_df and toc_df. Pick anchors…

angela shi

Jun 23

21 min read
Encoding Categorical Data for Outlier Detection
Data Science

Why one-hot encoding isn’t always the best approach, and alternative encodings

W Brett Kennedy

Jun 22

21 min read
Neural Networks, Explained for Beginners: Start Here If They’ve Confused You
Deep Learning

The intuition behind neural networks and why they need activation functions.

Nikhil Dasari

Jun 22

20 min read
Tool Calling, Explained: How AI Agents Decide What to Do Next
Agentic AI

Understanding how LLMs interact with the world around them, from returning data to taking action

Maria Mouschoutzi

Jun 21

12 min read
Proteins: A Mosaic Pattern to Rule Them All?
Machine Learning

For decades, the existence of the hydrophobic core, a region in the 3D structure of…

Francisco Javier Lobo-Cabrera

Jun 18

12 min read

YouTube
X
LinkedIn
Threads
Bluesky

Your home for data science and Al. The world’s leading publication for data science, data analytics, data engineering, machine learning, and artificial intelligence professionals.

© Insight Media Group, LLC 2026

Write For TDS
About
Advertise
Privacy Policy
Terms of Use