Braintrust

Braintrust · 2026-04-27T20:25:06.481Z

Braintrust x Nasdaq Thank you to Wing Venture Capital and congratulations to everyone on this year's Enterprise Tech 30.

Software Development

The observability layer for production AI

See jobs Follow

View all 152 employees

About us

Braintrust is the AI observability platform helping teams measure, evaluate, and improve AI in production. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Website: https://braintrust.dev/
External link for Braintrust
Industry: Software Development
Company size: 51-200 employees
Headquarters: San Francisco
Type: Privately Held
Founded: 2023

Products

Braintrust

Automated Testing Software

Braintrust is the AI observability platform. By connecting evals and observability in one workflow, Braintrust gives builders the visibility to understand how AI behaves in production and the tools to improve it. Teams at Notion, Stripe, Zapier, Vercel, and Ramp use Braintrust to compare models, test prompts, and catch regressions — turning production data into better AI with every release.

Locations

Primary

San Francisco, US

Get directions

Employees at Braintrust

See all employees

Updates

Braintrust

12,183 followers
2d
Report this post
An eval platform is more than just a test runner. Evals require shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model changes. Phillip Hetzel explains the design principles behind Braintrust's platform in this session from AI Engineer Europe. https://lnkd.in/e9bTXvsK

Why building eval platforms is hard — Phil Hetzel, Braintrust

https://www.youtube.com/

Like Comment Share
Braintrust

12,183 followers
2d
Report this post
Evals course module ten: building a multi turn chat app. Move from single-turn to multi-turn use-cases by building a chatbot CLI app with production logging. Use init_logger, wrap_openai, and @traced to capture every conversation as a single trace. More here → https://lnkd.in/dJqiKkAF
Like Comment Share
Braintrust

12,183 followers
3d
Report this post
For AI PMs, evals are the new PRD. At Product-Led Alliance Summit New York, Ameya Bhatawdekar discussed the new product development loop and how to translate every element of a traditional PRD into its eval equivalent. Watch here → https://lnkd.in/gs2DHeSV
Like Comment Share
Braintrust

12,183 followers
3d
Report this post
Evals course module nine: how to analyze your eval results. Learn about the four ways to analyze eval data: experiment comparison, Loop queries, the Braintrust MCP server, and manual filtering in the UI. More here → https://lnkd.in/gjFmpPvU
Like Comment Share
Braintrust

12,183 followers
4d
Report this post
Evals course module eight: how to read a trace. Learn about traces and how to navigate them in the Braintrust UI. Understand span types (root, LLM, scorer, function, task, tool) and use chain-of-thought reasoning to debug scores. More here → https://lnkd.in/gu26fpk9
Like Comment Share
Braintrust

12,183 followers
4d
Report this post
If you're building AI products but aren't writing evals, this is the place to start. In Evals for engineers, solutions engineer Doug Guthrie will show you how to: - Instrument an agent with the Braintrust SDK - Look at traces across model calls, tool use, and outputs - Build datasets from failure modes and write scoring functions - Iterate on your prompt and measure quality over time

Online workshop: Intro to evals for engineers · Zoom · Luma luma.com

Like Comment Share
Braintrust reposted this
Hex

27,847 followers
4d
Report this post
PMs aren't writing PRDs anymore. Braintrust's Ameya Bhatawdekar on why the new PM superpower in the age of AI is evals. TL;DR: Find where your AI fails → build better evals → ship with confidence

1 Comment

Like Comment Share
Braintrust

12,183 followers
4d
Report this post
Earning stakeholder trust means making signals from your eval and observability data legible across your organization. Braintrust does this in three ways: - Dashboards aggregate metrics across logs and experiments. - Custom trace views turn complex traces into domain-specific interfaces. - Loop translates natural-language questions into SQL over your production data. Read more → https://lnkd.in/gjb-6uGu
Like Comment Share
Braintrust

12,183 followers
5d
Report this post
Evals course module seven: how to deal with nondeterminism. Why the same eval can produce different scores across runs, and what to do about it. How temperature affects variance and how trial_count averages results for reliable signal. More here → https://lnkd.in/dtZstRi6
Like Comment Share
Braintrust

12,183 followers
6d
Report this post
Braintrust x Nasdaq Thank you to Wing Venture Capital and congratulations to everyone on this year's Enterprise Tech 30.
1 Comment

Like Comment Share

Browse jobs

Funding

Braintrust 2 total rounds

Last Round

Series A Nov 8, 2024

US$ 36.0M

Investors

Andreessen Horowitz + 8 Other investors

See more info on crunchbase

Braintrust

Software Development

The observability layer for production AI

About us

Products

Braintrust

Automated Testing Software

Locations

Employees at Braintrust

Ross Stapleton-Gray, Ph.D., CISSP, CIPM

Kati Kankaanpää

Ameya Bhatawdekar

Mike Deeks

Updates

Why building eval platforms is hard — Phil Hetzel, Braintrust

https://www.youtube.com/

Join now to see what you are missing

Similar pages

Braintrust

Baseten

Profound

Render

Basis

Assort Health

Graphite

Resolve AI

Thanks

Decagon

Browse jobs

Manager jobs

Engineer jobs

Designer jobs

Director jobs

Associate jobs

Analyst jobs

Project Manager jobs

Account Executive jobs

Marketing Manager jobs

Scientist jobs

Account Manager jobs

Developer jobs

Director of Product Management jobs

Business Development Representative jobs

Salesperson jobs

Product Designer jobs

Director of Operations jobs

Art Director jobs

Executive jobs

Senior Software Engineer jobs

Funding