feat: pluggable embedding provider — E5 adapter for self-hosted embeddings by SZabolotnii · Pull Request #150 · garrytan/gbrain

SZabolotnii · 2026-04-16T12:14:43Z

Summary

Pluggable embedding providers: GBRAIN_EMBEDDING_PROVIDER=e5|openai selects between self-hosted E5 and OpenAI. OpenAI remains default.
E5 HTTP adapter (embedding-e5.ts): talks to self-hosted intfloat/multilingual-e5-small (or compatible) endpoint with auto-dimension detection, retry, batch splitting.
Extracted OpenAI provider (embedding-openai.ts): same behavior, clean separation.
Dynamic vector dimensions: schema creation substitutes vector(384) for E5 or vector(1536) for OpenAI across PGLite and Postgres engines.
12 unit tests: E5 adapter (fetch mock, batching, truncation, dimension detection), OpenAI exports, provider router, schema substitution.
Deployed and verified on VPS with Ayona's E5 container — gbrain init creates vector(384) schema, doctor reports health_score=90.

Env vars

Variable	Default	Description
`GBRAIN_EMBEDDING_PROVIDER`	`openai`	`e5` or `openai`
`GBRAIN_E5_URL`	`http://embeddings-e5:8000/embed`	E5 endpoint URL
`GBRAIN_E5_BATCH_SIZE`	`16`	Texts per E5 request

Test plan

bun test — 858 pass, 0 fail
test/embedding.test.ts — 12 tests covering all providers
VPS deployment verified (E5 384d, doctor healthy)
E2E with real Postgres+pgvector (requires DATABASE_URL)

🤖 Generated with Claude Code

…dings Add support for self-hosted E5 embedding models (intfloat/multilingual-e5-small) as an alternative to OpenAI. Enables zero-cost embeddings using existing infrastructure (e.g., Ayona's E5 container on VPS). - New provider router in embedding.ts (GBRAIN_EMBEDDING_PROVIDER=e5|openai) - E5 HTTP adapter (embedding-e5.ts) with auto-dimension detection, retry, batch - Extracted OpenAI provider into embedding-openai.ts (no behavior change) - Dynamic vector(N) in schema — 384d for E5, 1536d for OpenAI - Schema dimension substitution in db.ts, postgres-engine.ts, pglite-engine.ts Deployed and verified on VPS: gbrain init with E5 creates vector(384) schema, doctor reports health_score=90, all checks OK. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

12 unit tests for E5 adapter (fetch mock, batching, truncation, dimension detection), OpenAI adapter exports, provider router, and schema dimension substitution. CHANGELOG v0.10.2 entry. CLAUDE.md updated with new files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

garrytan · 2026-06-08T02:57:01Z

Thanks for this contribution — and apologies for the slow triage. We did a full pass over the entire PR backlog. gbrain has moved fast, and the maintainer's larger "cathedral" rewrites have superseded a big share of community PRs: the AI gateway + recipes + user_provided_models system replaced almost all individual provider PRs; #1805 fixed the whole Postgres module-singleton class; #1542 unified the type taxonomy; #1657 the retrieval path; #1802 the doctor; and so on.

We're closing this one in that cleanup — either the fix already landed on master, it duplicates another PR or merged change, or it's outside the current merge bar. Where a closed PR carried a genuinely valuable idea, we've recorded it in docs/designs/COMMUNITY_IDEAS.md so nothing good is lost (a few may graduate into TODOs).

Please don't read the close as a judgment of the work — thank you for contributing. If you believe the underlying issue is still live on the latest master, reopen with a quick note and we'll take another look. 🙏

SZabolotnii and others added 2 commits April 16, 2026 12:18

prasadus92 mentioned this pull request Apr 18, 2026

feat: Gemini embedding support via GEMINI_API_KEY env var (zero-config, free tier) #89

Closed

garrytan mentioned this pull request Apr 20, 2026

feat: v0.28.9 pluggable embedding providers — Vercel AI SDK #257

Merged

9 tasks

garrytan mentioned this pull request May 10, 2026

v0.32.0 feat: 5 new embedding recipes + discoverability pass (closes 17-PR cluster) #810

Merged

8 tasks

garrytan closed this Jun 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pluggable embedding provider — E5 adapter for self-hosted embeddings#150

feat: pluggable embedding provider — E5 adapter for self-hosted embeddings#150
SZabolotnii wants to merge 2 commits into
garrytan:masterfrom
SZabolotnii:feat/e5-embedding-adapter

SZabolotnii commented Apr 16, 2026

Uh oh!

garrytan commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SZabolotnii commented Apr 16, 2026

Summary

Env vars

Test plan

Uh oh!

garrytan commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants