Data Neighbor Newsletter
Subscribe
Sign in
Home
Podcast
Chat
Newsletter
Archive
About
Building AI That Actually Works
What Salesforce's Research Leader Taught Us About Enterprise Agents
READ THE LATEST
Most Popular
View all
Redesigning Metrics for AI Products
Dec 9, 2025
•
Shane Butler
11
2
3
Open-Source LLMs vs. ChatGPT: Which One Should You Use?
Mar 16, 2025
•
Hai Guan
,
Shane Butler
, and
Maarten Grootendorst
7
2
Rough Thoughts on What an AI Evaluation Lifecycle Might Look Like
Dec 1, 2025
•
Shane Butler
4
1
AI evals are becoming a product capability, not a model debugging task
Dec 29, 2025
•
Shane Butler
3
1
Latest
Top
Discussions
AI evals are becoming a product capability, not a model debugging task
As AI features scale, the hard part stops being “can we score outputs” and becomes “can the whole team use evaluation to make better product decisions.”
Dec 29, 2025
•
Shane Butler
3
1
AI Evaluation's Missing Layer
A lot of what gets called “AI evaluation” right now looks like debugging.
Dec 24, 2025
•
Shane Butler
1
The Fastest Way to Lose Customers With AI
Is Your Team Ready to Scale AI? A Quick Diagnostic for AI Evaluation
Dec 16, 2025
•
Shane Butler
2
1
Users Lie (But Turkey Sandwiches Don't)
Or the importance of testing feedback informed hypotheses with actual behavioral data
Dec 11, 2025
•
Shane Butler
1
1
1
Redesigning Metrics for AI Products
AI demands a new set of metrics. It requires evolving unreliable deterministic metrics to a systematic analytics process that rigorously ties…
Dec 9, 2025
•
Shane Butler
11
2
3
Rough Thoughts on What an AI Evaluation Lifecycle Might Look Like
I’ve spent the last 10 years in data science and the past year almost exclusively focused on AI evaluation.
Dec 1, 2025
•
Shane Butler
4
1
If you don’t know where to begin with AI evaluation, start here.
Five simple steps anyone can use to get initial signal on quality.
Nov 30, 2025
•
Shane Butler
2
3
1
See all
Data Neighbor Newsletter
We reveal the secrets to a thriving data career, providing insights, emerging trends, and an insider's view of how data and AI transform our profession
Subscribe
Recommendations
The Data Hustle
Sai Kumar Bysani
Tech For Product
Colin Matthews
Exploring Language Models
Maarten Grootendorst
Data Neighbor Newsletter
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts