Open Source Community Tool

Ship semantic search in minutes

The complete developer toolkit for Voyage AI embeddings, vector search, and RAG pipelines.

Terminal, browser, or desktop — embed, compare, benchmark, and deploy with MongoDB Atlas Vector Search.

#1 MTEB Ranking

$ curl -fsSL https://vaicli.com/install.sh | sh

Click to copy · Works on macOS, Linux & WSL

Desktop App

Learn About Voyage AI

Why I built this around Voyage AI

After evaluating embedding providers, Voyage AI stood out for quality and cost. Here's what makes their models worth exploring.

#1 MTEB

SOTA Quality

voyage-3-large ranks #1 on MTEB with 67.29 score, outperforming OpenAI, Cohere, and Google.

$0.02/1M

83% Cost Savings

Starting at $0.02 per million tokens vs OpenAI's $0.13 — same quality, fraction of the cost.

Asymmetric

Shared Embedding Space

Embed documents with voyage-3-large, query with voyage-3-lite. Same vector space, 4x faster queries.

4 Domains

Domain Models

Purpose-built models for code, finance, law, and multilingual — outperform general models by 10-20%.

Reranking

Two-Stage Retrieval

Combine fast embedding search with neural reranking for optimal precision and recall.

<100ms

Production Ready

Sub-100ms latency, 99.9% uptime SLA, and enterprise-grade security for production workloads.

vai is a community-built developer tool — designed to help you explore Voyage AI embeddings and MongoDB Atlas Vector Search. Not affiliated with Voyage AI or MongoDB.

Everything you need to work with embeddings

Six powerful tools in one CLI — from generating embeddings to benchmarking models and building RAG pipelines.

Embed

Generate SOTA embeddings with Voyage AI's models — #1 on MTEB at half the cost of OpenAI.

Try it

Compare

Measure semantic similarity between texts with cosine, dot product, and euclidean distance.

Try it

Multimodal

Embed images and text in a shared vector space — search images with text and vice versa.

Try it

Rerank

Neural reranking improves search precision by 15-30%. Two-stage retrieval made easy.

Try it

Benchmark

Compare latency, quality, and cost across models. Make data-driven decisions.

Try it

Explore

22 interactive guides on embeddings, RAG, vector search, and semantic similarity.

Try it

11 specialized models. One shared space.

Embed documents with voyage-4-large, query with voyage-4-lite — same vector space, 83% cost reduction. The new voyage-4 family sets SOTA on RTEB benchmarks.

Starting at $0.02/1M tokens

RTEB #1: 71.41 NDCG@10

vs OpenAI $0.13/1M

Model	Type	Context	RTEB	Price/1M	Best For
voyage-4-large voyage-4	Embedding	32K	71.41	$0.12	Best quality, multilingual
voyage-4 voyage-4	Embedding	32K	70.07	$0.06	Balanced quality & cost
voyage-4-lite voyage-4	Embedding	32K	68.10	$0.02	Lowest cost, high volume
voyage-code-3	Embedding	32K	—	$0.18	Code & technical docs
voyage-finance-2	Embedding	32K	—	$0.12	Financial documents
voyage-law-2	Embedding	16K	—	$0.12	Legal documents
voyage-context-3 soon	Embedding	32K	—	$0.18	Contextualized chunks
voyage-multimodal-3.5	Multimodal	32K	—	$0.12/M + $0.60/B px	Text + images + video
voyage-4-nano soon voyage-4	Embedding	32K	—	Free (open-weight)	Edge / local / self-hosted
rerank-2.5	Reranking	32K	—	$0.05	Best reranking quality
rerank-2.5-lite	Reranking	32K	—	$0.02	Fast reranking

RTEB Benchmark — NDCG@10 (29 datasets)

voyage-4-large

71.41

voyage-4

70.07

Gemini Embed 001

68.66

voyage-4-lite

68.10

Cohere Embed v4

65.75

OpenAI v3 Large

62.57

Source: Voyage AI, January 2026

Pro tip: All voyage-4 models share the same embedding space. Index documents with voyage-4-large for best quality, then query with voyage-4-lite to save 83% on API costs — no re-indexing needed.

* = default dimension; also supports other dimension sizes

See the Shared Space in Action

Embed the same text with three different models and watch the vectors land in the same neighborhood. Cross-model similarity of 0.95+ — proven live, not just claimed.

Embed documents with the best model. Query with the cheapest. 83% cost savings. No re-vectorization. Ever.

Try the Explorer →Star on GitHub ⭐

Powerful from the command line

Simple, intuitive commands that get out of your way.

Install

Configure

Embed

$ npm install -g voyageai-cli

✓ Installed voyageai-cli globally

$ vai config set api-key YOUR_VOYAGE_API_KEY

✓ API key saved securely

$ vai embed "Hello, vector world!"

✓ Model: voyage-4-large | Dimensions: 1024 | Latency: 89ms

Give your AI agent a knowledge base

The vai MCP server exposes 11 tools over the Model Context Protocol. Any compatible agent can search, embed, rerank, and ingest documents from your collections.

Claude Code

Claude Desktop

Cursor

Windsurf

VS Code

Install

Connect

Use

Agent calls vai_query automatically

11 tools, one protocol

Your agent picks the right tool for the task. No prompting required.

vai_query

Full RAG: embed, search, rerank

Retrieval

vai_search

Raw vector similarity search

Retrieval

vai_rerank

Rerank documents by relevance

Retrieval

vai_embed

Generate embedding vectors

Embedding

vai_similarity

Cosine similarity between texts

Embedding

vai_collections

List collections and indexes

Management

vai_models

Browse Voyage AI models

Management

vai_topics

Discover educational topics

Utility

vai_explain

Deep-dive on any topic

Utility

vai_estimate

Cost calculator for operations

Utility

vai_ingest

Chunk, embed, store documents

Ingestion

$ vai mcp install all

✅ Claude Desktop: installed vai

✅ Claude Code: installed vai

✅ Cursor: installed vai

✅ Windsurf: installed vai

$ vai mcp status

✅ installed Claude Desktop ~/Library/.../claude_desktop_config.json

✅ installed Claude Code ~/.claude/settings.json

✅ installed Cursor ~/.cursor/mcp.json

✅ installed Windsurf ~/.codeium/windsurf/mcp_config.json

View MCP Server docs

Industry Walkthroughs

Built for your domain

Go from sample documents to a working knowledge base in under 30 minutes, with walkthroughs tailored to your industry.

voyage-code-3

Developer Documentation

Internal docs, API references, and runbooks: semantic search in minutes

Explore

voyage-law-2

Legal & Compliance

Semantic search across legal documents, powered by a model trained on legal text

Explore

voyage-finance-2

Financial Services

Earnings calls, risk reports, and policy docs, searchable with a model trained on financial text

Explore

voyage-4-large

Healthcare & Clinical

From clinical guidelines to searchable AI, using your own infrastructure

Explore

View all use cases

Desktop App

A full desktop experience

The Vai desktop app wraps the entire CLI experience in a beautiful Electron interface. Embed text, compare documents, run benchmarks, and explore vector search concepts — all with a visual interface.

Signed & notarized

Auto-updates

macOS.DMG Windows.EXE Linux.AppImage

Vai Desktop

Embed

Compare

Benchmark

Explore

Enter text to embed...

[0.0234, -0.0891, 0.1456, 0.0023, ...]

voyage-4 · 1024 dimensions · 138ms

Community-Built. Globally Used.

Exploring Voyage AI embeddings from the terminal, browser, and desktop.

Countries

Telemetry Events

CLI Activity

Web + Desktop Activity

“vai is now used by developers in 0 countries worldwide”

Delayed aggregate telemetry • Data through the previous UTC day

Ship semantic search .css-fohbyn{background:linear-gradient(135deg, #00D4AA 0%, #40E0FF 100%);-webkit-background-clip:text;background-clip:text;-webkit-background-clip:text;-webkit-text-fill-color:transparent;}in minutes

The complete developer toolkit for Voyage AI embeddings, vector search, and RAG pipelines.

Why I built this around Voyage AI

SOTA Quality

83% Cost Savings

Shared Embedding Space

Domain Models

Two-Stage Retrieval

Production Ready

Everything you need to work with embeddings

Embed

Compare

Multimodal

Rerank

Benchmark

Explore

11 specialized models. One .css-uohran{background:linear-gradient(135deg, #00D4AA, #40E0FF);-webkit-background-clip:text;-webkit-text-fill-color:transparent;}shared space.

RTEB Benchmark — NDCG@10 (29 datasets)

See the Shared Space .css-1nozekp{color:#00D4AA;}in Action

Powerful from the command line

Give your AI agent a knowledge base

11 tools, one protocol

Built for your domain

Developer Documentation

Legal & Compliance

Financial Services

Healthcare & Clinical

A full desktop experience

Ship semantic search in minutes

11 specialized models. One shared space.

See the Shared Space in Action