AI gets better when it understands the human in front of it.

We build the infrastructure that makes that possible.

Open Data Labs is an applied research institute. We develop technology that helps AI understand the humans using it.

ODL is focused on one question: where does AI get the context it needs? Not from bigger models. Not from synthetic data. From the user, with consent.

We work with AI teams at every stage. Whether you’re scoping your first pilot or scaling a model in production, your product needs to know who it’s talking to. We make that possible.

Products

Infrastructure for every layer, from training data to runtime context to agentic commerce.

Product teams

Context Gateway

One integration. Real user context, from the platforms they already use.

AI & ML teams

Primary Source Datasets

Data direct from contributors. Consented, redacted, ready to train.

Commerce teams

Agentic Concierge

Make your site agent-ready. AI agents arrive, find what your customers want, and convert.

From the lab

Case studies, research, and press.

Fashion • Primary Source Datasets

Fortune 500 fashion brand builds real-time taste prediction engine.

A Fortune 500 fashion retailer used primary source datasets to infer style preference before a customer had any purchase history.

3D isometric illustration of layered data infrastructure.

Research · October 30, 2025

Open Problems in AI Data Economics

We introduce data economics as a coherent field and define the open problems that haven’t yet been formalised. Most AI economics research focuses on downstream effects. We argue you can’t understand AI’s economic trajectory without studying how data, compute, and labour interact at the production layer.

Thweet whiskey ice cream bar, Seongsu Seoul, Korea Blockchain Week 2025.

Experiential • Brand Activation • Context Gateway

Thweet creates 1,497 one-of-one guest experiences.

Thweet turned one creative concept into 1,497 personalised experiences during Korea Blockchain Week 2025.

Press · MIT News · April 2025

Vana lets users own a piece of the AI models trained on their data

MIT News on the founding story, the Media Lab origins, and the case for user-owned AI.

Motion-blurred hand holding a phone with a messaging app, conveying real human conversation data.

AI Training Data • Context Gateway

Frontier AI Lab sources real human conversation data with consent, redaction, and provenance.

A frontier AI lab used Context Gateway to source real human conversational data with documented consent, redaction, and provenance.

3D isometric illustration of data processing architecture.

Research · August 1, 2024

Model Influence Functions: Measuring Data Quality

A framework for attributing model outputs to training data contributions. The methodological foundation for pricing and valuing datasets in commercial AI pipelines.

See more case studies

Your AI deserves better data and richer context.

Book a Call