Add configurable embedding provider support by 100yenadmin · Pull Request #134 · garrytan/gbrain

100yenadmin · 2026-04-15T10:20:48Z

Summary

make embedding provider, model, and dimensions configurable via env vars
preserve the current OpenAI defaults while adding Voyage AI support
add minimal docs and focused tests for the new config path

Validation

bun install
bun test test/embed.test.ts

Closes #133

Copilot

Pull request overview

Adds an env-driven configuration layer to the embedding subsystem so deployments can switch embedding provider/model/dimensions (defaulting to the current OpenAI setup) and records the embedding model used when writing chunk updates.

Changes:

Introduce getEmbeddingConfig() and wire embedBatch to support openai (default) and voyage providers.
Persist the configured embedding model onto chunk updates in the gbrain embed command.
Add docs and tests covering the new configuration path.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
test/embed.test.ts	Adds unit tests for embedding config defaults/overrides and extends env cleanup.
src/core/embedding.ts	Implements env-driven embedding config and adds Voyage embeddings implementation.
src/commands/embed.ts	Records embedding model on chunk upserts during embedding runs.
README.md	Notes embedding provider/model/dimensions env vars in CLI help.
INSTALL_FOR_AGENTS.md	Documents optional embedding provider overrides and Voyage API key.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+export function getEmbeddingConfig(): EmbeddingConfig {
+  const providerRaw = (process.env.EMBEDDING_PROVIDER || DEFAULT_PROVIDER).toLowerCase();
+  const model = process.env.EMBEDDING_MODEL || DEFAULT_MODEL;
+  const dimensionsRaw = process.env.EMBEDDING_DIMENSIONS;
+  const dimensions = dimensionsRaw ? parseInt(dimensionsRaw, 10) : DEFAULT_DIMENSIONS;
+
+  if (providerRaw !== 'openai' && providerRaw !== 'voyage') {
+    throw new Error(`Unsupported embedding provider: ${providerRaw}. Expected openai or voyage.`);
+  }
+
+  if (dimensionsRaw && Number.isNaN(dimensions)) {
+    throw new Error(`Invalid EMBEDDING_DIMENSIONS: ${dimensionsRaw}`);
  }
-  return client;
+
+  return {
+    provider: providerRaw,
+    model,
+    dimensions,
+  };


  const updated: ChunkInput[] = chunks.map(c => ({
    chunk_index: c.chunk_index,
    chunk_text: c.chunk_text,
    chunk_source: c.chunk_source,
    embedding: embeddingMap.get(c.chunk_index),
+    model: embeddingConfig.model,
    token_count: c.token_count || Math.ceil(c.chunk_text.length / 4),
  }));


      const updated: ChunkInput[] = chunks.map(c => ({
        chunk_index: c.chunk_index,
        chunk_text: c.chunk_text,
        chunk_source: c.chunk_source,
        embedding: embeddingMap.get(c.chunk_index) ?? undefined,
+        model: embeddingConfig.model,
        token_count: c.token_count || Math.ceil(c.chunk_text.length / 4),
      }));


+import { getEmbeddingConfig } from '../src/core/embedding.ts';
 import type { BrainEngine } from '../src/core/engine.ts';


+  const providerRaw = (process.env.EMBEDDING_PROVIDER || DEFAULT_PROVIDER).toLowerCase();
+  const model = process.env.EMBEDDING_MODEL || DEFAULT_MODEL;
+  const dimensionsRaw = process.env.EMBEDDING_DIMENSIONS;
+  const dimensions = dimensionsRaw ? parseInt(dimensionsRaw, 10) : DEFAULT_DIMENSIONS;
+


100yenadmin · 2026-05-12T19:58:39Z

Closing as superseded by current upstream provider gateway work.

GBrain now has the v0.27+ provider recipe/gateway architecture with OpenAI, Gemini, Ollama, Voyage, LiteLLM/OpenAI-compatible providers, model discovery, and provider tests. Any remaining provider gaps should be filed as narrow issues/PRs against the current recipe architecture rather than this old configurable-provider branch.

[subagent] feat: add configurable embedding provider support

aa7ed28

Copilot AI review requested due to automatic review settings April 15, 2026 10:20

Copilot started reviewing on behalf of 100yenadmin April 15, 2026 10:21 View session

Copilot AI reviewed Apr 15, 2026

View reviewed changes

prasadus92 mentioned this pull request Apr 18, 2026

feat: Gemini embedding support via GEMINI_API_KEY env var (zero-config, free tier) #89

Closed

garrytan mentioned this pull request Apr 20, 2026

feat: v0.28.9 pluggable embedding providers — Vercel AI SDK #257

Merged

9 tasks

garrytan mentioned this pull request May 10, 2026

v0.32.0 feat: 5 new embedding recipes + discoverability pass (closes 17-PR cluster) #810

Merged

8 tasks

100yenadmin closed this May 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configurable embedding provider support#134

Add configurable embedding provider support#134
100yenadmin wants to merge 1 commit into
garrytan:masterfrom
electricsheephq:sub/voyage-embedding-provider

100yenadmin commented Apr 15, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

100yenadmin commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		import { getEmbeddingConfig } from '../src/core/embedding.ts';
		import type { BrainEngine } from '../src/core/engine.ts';

Conversation

100yenadmin commented Apr 15, 2026

Summary

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

100yenadmin commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants