fix(oss): auto-detect embedding dimension to fix Qdrant mismatch with non-OpenAI embedders by utkarsh240799 · Pull Request #4297 · mem0ai/mem0

utkarsh240799 · 2026-03-11T07:30:38Z

Problem

Users using non-OpenAI embedding providers (Ollama/nomic-embed-text, Google/Gemini, HuggingFace, etc.) hit Bad Request errors when using Qdrant because the vector store was always created with a hardcoded default dimension of 1536 (OpenAI's dimension), regardless of the actual embedding model's output dimension.

This caused three related issues:

Qdrant collection created with wrong dimensions when using Ollama embedder (nomic-embed-text) #4212 — Ollama + nomic-embed-text (768 dims) → Qdrant collection created with 1536 dims → Bad Request on insert
Bug: Qdrant collection created with wrong vector dimension when using Ollama embeddings #4173 — Same root cause with different embedding providers
Mem0 local qdrant error #4056 — Race condition during Qdrant collection creation + memory_migrations collection using wrong dimension

Solution

1. Probe-based dimension auto-detection

When no explicit dimension is provided, the Memory class now embeds a short probe string ("dimension probe") at initialization time and reads the resulting vector length to determine the correct dimension. This happens before the vector store is created.

Dimension resolution order:

Explicit vectorStore.config.dimension (user override)
embedder.config.embeddingDims (embedder config)
Auto-detected via probe embedding (new fallback)

2. Deferred vector store creation

The vector store is no longer created in the Memory constructor. Instead, it's created in an async _autoInitialize() method after the dimension is known. All public Memory methods (add, search, getAll, etc.) await initialization before proceeding via a lazy init gate (_ensureInitialized()).

3. Atomic Qdrant collection creation

Replaced the check-then-create pattern (TOCTOU race) with create-and-catch-409:

try {
  await this.client.createCollection(name, { vectors: { size, distance: "Cosine" } });
} catch (error) {
  if (error?.status === 409) {
    // Collection already exists — verify dimension matches
  } else {
    throw error;
  }
}

4. Idempotent init guards for all vector stores

Every vector store with async initialization (Qdrant, Redis, Supabase, AzureAISearch, Vectorize) now uses a _initPromise singleton guard. This prevents double-initialization when Memory explicitly calls await vectorStore.initialize() after the constructor already fired it:

private _initPromise?: Promise<void>;

async initialize(): Promise<void> {
  if (!this._initPromise) {
    this._initPromise = this._doInitialize();
  }
  return this._initPromise;
}

Without this guard:

Redis: Double connect() → "Socket already opened", plus dropIndex/createIndex destroys the index the first call just built
Qdrant: Double createCollection → 409 Conflict crash
Vectorize: Double indexes.create() → Cloudflare API error
Supabase/AzureAISearch: Redundant network calls and potential transient failures

5. Google embedder fix

Fixed hardcoded outputDimensionality: 768 in google.ts — now uses this.embeddingDims from config.

Files Changed

File	Change
`src/oss/src/memory/index.ts`	Deferred vector store creation, lazy init gate, probe-based dimension detection
`src/oss/src/config/manager.ts`	Dimension resolution: explicit > embeddingDims > undefined (triggers probe)
`src/oss/src/vector_stores/qdrant.ts`	Atomic create-and-catch-409, idempotent init, transient error resilience
`src/oss/src/vector_stores/redis.ts`	Idempotent init guard
`src/oss/src/vector_stores/supabase.ts`	Idempotent init guard
`src/oss/src/vector_stores/azure_ai_search.ts`	Idempotent init guard
`src/oss/src/vector_stores/vectorize.ts`	Idempotent init guard
`src/oss/src/embeddings/google.ts`	Use `embeddingDims` config instead of hardcoded 768

Testing

Unit Tests (61 tests)

dimension-autodetect.test.ts (22 tests) — ConfigManager dimension resolution, MemoryVectorStore backward compat, Memory auto-initialization, error propagation
config-manager.test.ts (5 tests) — ConfigManager dimension priority chain
vector-stores-compat.test.ts (34 tests) — Backward compatibility for all 7 vector store implementations (MemoryVectorStore, Qdrant, Redis, Supabase, AzureAISearch, Vectorize, Langchain) with mocked clients

End-to-End Tests (21 tests, against real instances)

qdrant-e2e.test.ts (8 tests) — Against real Qdrant v1.13.0:
- Reproduces the exact Bad Request error from Qdrant collection created with wrong dimensions when using Ollama embedder (nomic-embed-text) #4212/Bug: Qdrant collection created with wrong vector dimension when using Ollama embeddings #4173
- Verifies 768-dim CRUD works after fix
- Memory auto-detects dims via probe (full integration)
- Explicit dimension and embeddingDims skip probe (backward compat)
- Concurrent ensureCollection race condition (5 parallel instances)
- getUserId/setUserId after concurrent init
- memory_migrations uses dim=1 independently of main collection
redis-e2e.test.ts (13 tests) — Against real Redis Stack:
- Idempotent init (multiple concurrent calls)
- Full CRUD: insert, search, get, update, delete
- List with filters
- getUserId/setUserId roundtrip
- Correct dimension propagation from config

Total: 82 tests, all passing

Checklist

Fixes Qdrant collection created with wrong dimensions when using Ollama embedder (nomic-embed-text) #4212, Bug: Qdrant collection created with wrong vector dimension when using Ollama embeddings #4173, Mem0 local qdrant error #4056
Backward compatible — explicit dimension config still works, no probe triggered
All existing tests pass
New unit tests for dimension resolution and backward compatibility
End-to-end tests against real Qdrant and Redis Stack instances
No breaking changes to public API
All vector store implementations verified (Qdrant, Redis, Supabase, AzureAISearch, Vectorize, Langchain, MemoryVectorStore)
Race conditions handled (concurrent init, TOCTOU collection creation)
Error messages are helpful (probe failure tells user to set dimension explicitly)

… non-OpenAI embedders When using embedders like Ollama's nomic-embed-text (768 dims), the hardcoded 1536 default caused Qdrant to reject vectors with Bad Request. - Probe embedder at init to auto-detect dimension when not explicitly set - Defer vector store creation until dimension is known - Gate all public Memory methods behind lazy init promise - Fix Qdrant collection creation race condition (TOCTOU → atomic create + catch 409) - Fix Google embedder hardcoded outputDimensionality (768 → config value) - Use absolute path for default historyDbPath (~/.mem0/memory.db) - Propagate init errors to callers with clear message suggesting explicit config Fixes #4212, #4173, #4056 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ection - Make Qdrant.initialize() idempotent with promise gate to prevent double-init race between constructor and explicit callers - Make ensureCollection resilient to transient 500 errors during dimension verification after 409 - Await vectorStore.initialize() in Memory._autoInitialize() to guarantee collections exist before any public method runs - Add 8 e2e tests against real Qdrant verifying all three issues: dimension mismatch (#4212/#4173), race condition (#4056), and memory_migrations dimension isolation (#4056) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Prevent double-initialization when Memory explicitly calls initialize() after the constructor already fired it. Without this guard, Redis reconnects, Supabase re-verifies, AzureAISearch re-lists indexes, and Vectorize re-creates indexes on every call. Adds 13 Redis Stack e2e tests and 34 backward-compatibility unit tests covering all vector store implementations. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

whysosaket · 2026-03-11T10:58:48Z

Once init fails, the Memory instance is permanently dead... no retry, no recovery. If the embedder was briefly down at startup, user has to recreate everything.

mem0-ts/src/oss/src/vector_stores/qdrant.ts

whysosaket · 2026-03-11T11:01:47Z

E2E tests hard-fail without local Qdrant/Redis and have no skip mechanism.

- Log transient verification errors in ensureCollection instead of silently swallowing them (auth/network failures are now visible) - Auto-retry initialization on failure so a transiently unavailable embedder or vector store at startup doesn't permanently kill the Memory instance - Skip e2e tests gracefully when Qdrant/Redis containers aren't running Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… non-OpenAI embedders (mem0ai#4297) Co-authored-by: utkarsh240799 <utkarsh240799@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

utkarsh240799 and others added 3 commits March 11, 2026 01:03

utkarsh240799 requested a review from whysosaket March 11, 2026 07:31

revert: remove unrelated historyDbPath change from defaults.ts

5c28af5

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

whysosaket reviewed Mar 11, 2026

View reviewed changes

mem0-ts/src/oss/src/vector_stores/qdrant.ts Show resolved Hide resolved

whysosaket previously approved these changes Mar 11, 2026

View reviewed changes

utkarsh240799 dismissed whysosaket’s stale review via 630974e March 11, 2026 12:21

utkarsh240799 requested a review from whysosaket March 11, 2026 14:49

whysosaket approved these changes Mar 12, 2026

View reviewed changes

whysosaket merged commit 59c3b05 into main Mar 12, 2026
3 checks passed

whysosaket deleted the fix/qdrant-dimension-mismatch branch March 12, 2026 16:15

This was referenced Mar 12, 2026

Bug: Qdrant collection created with wrong vector dimension when using Ollama embeddings #4173

Closed

Mem0 local qdrant error #4056

Closed

This was referenced Mar 16, 2026

ensureCollection throws 403 with Qdrant scoped JWTs — only catches 409, not 401/403 #4355

Closed

fix(qdrant): handle 401/403 in ensureCollection for scoped JWTs #4356

Merged

utkarsh240799 mentioned this pull request Mar 19, 2026

Qdrant collection created with wrong vector dimensions — mem0 defaults to 1536 (OpenAI's embedding size) #4195

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(oss): auto-detect embedding dimension to fix Qdrant mismatch with non-OpenAI embedders#4297

fix(oss): auto-detect embedding dimension to fix Qdrant mismatch with non-OpenAI embedders#4297
whysosaket merged 5 commits intomainfrom
fix/qdrant-dimension-mismatch

utkarsh240799 commented Mar 11, 2026 •

edited

Loading

Uh oh!

whysosaket commented Mar 11, 2026

Uh oh!

Uh oh!

whysosaket commented Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

utkarsh240799 commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

1. Probe-based dimension auto-detection

2. Deferred vector store creation

3. Atomic Qdrant collection creation

4. Idempotent init guards for all vector stores

5. Google embedder fix

Files Changed

Testing

Unit Tests (61 tests)

End-to-End Tests (21 tests, against real instances)

Total: 82 tests, all passing

Checklist

Uh oh!

whysosaket commented Mar 11, 2026

Uh oh!

Uh oh!

whysosaket commented Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

utkarsh240799 commented Mar 11, 2026 •

edited

Loading