Skip to main content

Key Capabilities

Persistent Storage

All data and computed results are automatically stored and versioned.

Incremental Updates

Data transformations run automatically on new data—no orchestration code needed.

Multimodal-Native

Images, video, audio, and documents integrate seamlessly with structured data.

AI Integration

Built-in support for OpenAI, Anthropic, Gemini, Hugging Face, and dozens more.

Get started

Many documentation pages are interactive notebooks (marked with in the sidebar). Open them in Colab, Kaggle, or locally to follow along.

Core Primitives

Pixeltable provides a small set of primitives that compose into any multimodal AI workflow, including but not limited to:
PrimitiveWhat It Enables
pxt.create_table() with pxt.Image, pxt.Video, pxt.Audio, pxt.DocumentStore any multimodal data natively
pxt.create_view() with iteratorsExtract frames from video, chunk documents, split audio
add_computed_column()Run any AI model or transformation—incrementally
add_embedding_index()Semantic search on any column
@pxt.udf / @pxt.queryExtend with your own Python code
select(), where(), order_by()SQL-like querying with Python syntax
history(), revert()Time travel and version control
pxt.replicate(), pxt.publish()Share and replicate datasets via Pixeltable Cloud
These primitives are use-case agnostic by design. We don’t build vertical solutions—we build the infrastructure that makes vertical solutions trivial to build.

What can you build?

Declarative Pipelines

Replace complex orchestration with simple computed columns. Define transformations once—they run automatically on all data.

Multimodal Workloads

Production RAG with automatic embedding indexing. Find relevant scenes in video. Semantic search across text, images, and audio.

Version Control and Lineage

Automatic versioning on every change. Time travel queries to any point. Full data lineage for reproducibility.

AI Agents & MCP

Build tool-calling agents with persistent memory, MCP server integration, and automatic conversation history.

ML Feature Engineering

Curate, augment, and export data to PyTorch, Parquet, COCO format, LanceDB, and pandas for training and analytics.

Explore by use case

Next steps

Last modified on January 10, 2026