fix: 50x speedup for FTS5 search with collection filter by possibilities · Pull Request #455 · tobi/qmd

possibilities · 2026-03-23T15:36:03Z

Problem

qmd search and the BM25 leg of qmd query are extremely slow on large collections when using the -c (collection) filter. A simple keyword search on a 16K-document collection takes 20 seconds instead of milliseconds.

Root cause: When searchFTS() combines documents_fts MATCH ? with d.collection = ? in the same WHERE clause, SQLite's query planner abandons the FTS5 index and falls back to a catastrophically slow execution plan. This is a known SQLite behavior — FTS5 virtual tables don't participate well in complex WHERE clauses with conditions on JOINed columns.

The underlying FTS5 index is fast. A direct SELECT ... FROM documents_fts WHERE MATCH ? completes in 8ms. The 20-second query time is entirely due to the query planner choosing the wrong plan.

Fix

Wrap the FTS5 query in a CTE so SQLite executes it first with proper index usage, then filters by collection on the materialized results:

-- Before (17s):
SELECT ... FROM documents_fts f
JOIN documents d ON d.id = f.rowid
WHERE documents_fts MATCH ? AND d.collection = ?

-- After (8ms):
WITH fts_matches AS (
  SELECT rowid, bm25(documents_fts, 10.0, 1.0) as bm25_score
  FROM documents_fts WHERE documents_fts MATCH ?
  ORDER BY bm25_score ASC LIMIT ?
)
SELECT ... FROM fts_matches fm
JOIN documents d ON d.id = fm.rowid
WHERE d.collection = ?

When filtering by collection, the CTE fetches limit * 10 candidates to ensure enough results survive filtering. Without a collection filter, the old plan was already optimal — no CTE overhead is added.

Benchmarks

Tested on a real index (44,682 documents, 3.3 GB, Bun 1.x on Linux with RTX 3070):

Command	Before	After	Speedup
`qmd search "knowctl" -c large-collection -n 3`	19.8s	0.4s	50x
`qmd search "zmq" -c large-collection -n 3`	5.9s	0.4s	15x
`qmd search` (no collection filter)	0.3s	0.3s	no change
`qmd search` (small collection)	0.3s	0.3s	no change

The qmd query hybrid pipeline also benefits since it calls searchFTS for its BM25 leg.

Reproduction

# Create a large collection (10K+ documents)
qmd search "common-term" -c large-collection -n 3  # slow before fix

The issue scales with how many documents match the search term in the FTS index, not with collection size per se. Common terms ("config", "import", "function") in large corpora are most affected.

Test plan

All 201 existing store.test.ts tests pass, including the searchFTS filters by collection name test
Live-tested on production index with 44K documents
Verified results are identical before and after (same documents, same scores)
No regression for queries without collection filter

…llection filter When searchFTS combines FTS5 MATCH with a collection filter (d.collection = ?) in the same WHERE clause, SQLite's query planner abandons the FTS5 index and falls back to a full scan. This turns an 8ms query into a 17+ second query on large collections (16K+ documents). The fix wraps the FTS5 query in a CTE so it runs first with proper index usage, then filters by collection on the materialized results. Benchmarks on a 16,258-document collection: Before: qmd search "knowctl" -c <collection> → 19.8s After: qmd search "knowctl" -c <collection> → 0.4s The CTE fetches limit*10 candidates from the FTS index to ensure enough results survive collection filtering. Without a collection filter, the query plan was already optimal, so no CTE overhead is added in that case.

possibilities · 2026-03-23T15:39:46Z

Dumb agent posted this without my approval. Normally I'd humanify it and think through it a lot more before posting. I tamed my agent for the future, sorry it got sloppy on you. That said what do you think of this. I set up qmd and have been struggling to make it work well on my system. This seems to be a big breakthrough but I'm over my head. Thanks @tobi !

possibilities · 2026-03-24T16:49:31Z

Some other improvements I've found:

perf/vec-search-collection-prefilter — Pre-filter sqlite-vec searches by collection using vec0's native hash_seq IN (json_each) constraint. Currently searchVec() scans all vectors (~600K in our index) even when targeting a specific collection. For small collections (e.g. 92 vectors for a single topic), this eliminates 99.98% of distance computations. Measured: retrieval-vec drops from ~4s to ~1.1s for 4 sequential searches. Threshold at 50K vectors — larger collections fall back to full scan.

feat/query-pipeline-profiling — Opt-in console.time instrumentation for the full query pipeline (gated behind QMD_PROFILE=1). Covers: bm25-probe, expand, retrieval-lex, embed-batch, retrieval-vec, rrf-fusion, chunking, rerank, blend-dedup, total. LLM-level timers separate model-load from inference. Zero overhead when disabled.

possibilities · 2026-03-24T17:59:25Z

fix/pipe-structured-search-snippet — Fix snippet extraction in pipe-query's structured-search handler. The handler used req.query for highlight terms, but structured-search requests send req.searches (not req.query), so snippets were extracted with an empty string. Now computes a display query from the searches array, preferring lex > vec > first search's query term.

Wraps FTS5 query in CTE to prevent query planner from abandoning FTS5 index when combined with collection filter. Adapted with corrected BM25 weights from tobi#462. Cherry-picked from tobi#455 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merges dev-upstream-fixes (cherry-picked PRs tobi#462, tobi#463, tobi#455, tobi#418, tobi#456, tobi#442, tobi#453) into dev. Resolved mcp/server.ts bind conflict — keep 0.0.0.0 for Docker container accessibility. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Resolve conflict: use CTE approach from tobi#455 with updated BM25 weights (1.5, 4.0, 1.0) from tobi#462. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Resolve conflict: use CTE approach from tobi#455 with updated BM25 weights (1.5, 4.0, 1.0) from tobi#462.

Resolve conflict: use CTE approach from #455 with updated BM25 weights (1.5, 4.0, 1.0) from #462. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Resolve conflict: use CTE approach from tobi#455 with updated BM25 weights (1.5, 4.0, 1.0) from tobi#462. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: update bun.lock after dependency install

bc80e72

tobi merged commit 827ad83 into tobi:main Mar 28, 2026

jaylfc added a commit to jaylfc/qmd that referenced this pull request Apr 5, 2026

Merge origin/main into fix/fts5-collection-filter-performance

4f5e191

Resolve conflict: use CTE approach from tobi#455 with updated BM25 weights (1.5, 4.0, 1.0) from tobi#462.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: 50x speedup for FTS5 search with collection filter#455

fix: 50x speedup for FTS5 search with collection filter#455
tobi merged 2 commits into
tobi:mainfrom
possibilities:fix/fts5-collection-filter-performance

possibilities commented Mar 23, 2026

Uh oh!

possibilities commented Mar 23, 2026

Uh oh!

possibilities commented Mar 24, 2026 •

edited

Loading

Uh oh!

possibilities commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

possibilities commented Mar 23, 2026

Problem

Fix

Benchmarks

Reproduction

Test plan

Uh oh!

possibilities commented Mar 23, 2026

Uh oh!

possibilities commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

possibilities commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

possibilities commented Mar 24, 2026 •

edited

Loading