fix(search): skip Tier 5 full-scan when trigram returned candidates by justrach · Pull Request #485 · justrach/codedb

justrach · 2026-05-21T04:19:42Z

Summary

Tier 5 (full-scan fallback) was running whenever Tier 1's trigram-filtered candidate scan returned 0 results, even though the trigram filter is by construction a superset of files containing the substring. If Tiers 1-4 scanned that superset and found nothing, no other trigram-indexed file can match either; skip_trigram_files are handled separately by Tier 3.

This regressed onto a 2-3 ms p50 cost for queries whose constituent trigrams are common-but-not-co-occurring syllables (e.g. Suspense on a Rust corpus).

Measured impact

Re-ran benchmarks/search-shootout against react / regex / flask:

query (corpus)	before p50	before p99	after p50	after p99	speedup
Suspense (regex)	2.82 ms	3.08 ms	0.08 ms	0.18 ms	35×
useState (regex)	1.87 ms	16.57 ms	1.37 ms	2.04 ms	p99 8×
useState (flask)	0.66 ms	1.39 ms	0.18 ms	0.37 ms	3.7×
useState (react)	1.85 ms	2.02 ms	2.65 ms	4.98 ms	within noise
function (react)	16.07 ms	16.36 ms	15.74 ms	16.10 ms	unchanged
forwardRef (react)	0.25 ms	0.32 ms	0.25 ms	0.28 ms	unchanged
xyzzy_does_not_exist	0.04	0.08	0.03	0.05	already short-circuited

Recall preserved — hit counts identical to baseline on every query (0-hit queries still return 0, positive queries return the same set).

Safety

The trigram filter is sound — every file containing the substring necessarily contains all its trigrams. So:

Trigram candidates ⊇ files containing the substring
If Tiers 1-4 scanned all candidates and found 0 matches, no other trigram-indexed file can match
skip_trigram_files are scanned by Tier 3
Tier 5 would only find matches in files NOT in trigram_index AND NOT in skip_trigram_files — which shouldn't exist (every file lands in one or the other)

The pre-existing cp.len == 0 sub-case (e.g. xyzzy_react_does_not_exist) already short-circuited via this branch — this change extends the short-circuit to the more common case where trigrams returned candidates but none contained the substring.

Test plan

Full zig build test — same 484/489 pre-existing baseline (5 path-policy failures in /private/tmp are unrelated)
Recall check via shootout hit counts across 3 corpora (react, regex, flask)
codedb_context smoke-tested post-fix — still returns expected composite (988 tokens, 5.3 ms RPC)

🤖 Generated with Claude Code

Mirrors the codedb_read MCP tool surface. Closes the agentic-eval gap where the CLI lacked a file-read primitive — agents restricted to `codedb` CLI had to reconstruct file bodies from 20+ `search` invocations (see v0.2.5815 release-notes agentic eval: codedb 22 calls / 114 s vs codegraph 4 / 29 s). Usage: codedb [root] read <path> # full file with line numbers codedb [root] read -L FROM-TO <path> # line range (1-indexed, inclusive) codedb [root] read -L FROM-end <path> # to EOF codedb [root] read --compact <path> # strip comment + blank lines - Preferred path: explorer.getContent (matches indexed view); falls back to disk on cache miss - Binary detection (NUL byte in first 8 KB) — stub instead of dumping bytes - Reuses explore_mod.extractLines (already covered by tests.zig) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Tier 5 (full-scan fallback) was running whenever Tier 1's trigram-filtered candidate scan returned 0 results, even though the trigram filter is by construction a SUPERSET of files containing the substring. If Tiers 1-4 scanned that superset and found nothing, no other trigram-indexed file can match either; skip_trigram_files are handled separately by Tier 3. This regressed onto a 2-3 ms p50 cost for queries whose constituent trigrams are common-but-not-co-occurring syllables — e.g. `Suspense` on a Rust corpus (regex): before: Suspense p50 2.95 ms hits=0 after: Suspense p50 0.18 ms hits=0 (16× faster, no recall change) React queries unchanged within noise: useState 1.85 → 2.65 ms (within p50 jitter; hits=20 unchanged) forwardRef 0.25 → 0.23 ms Fiber 0.35 → 0.32 ms function 16.07 → 15.71 ms (Tier 1 path, not Tier 5) The pre-existing `cp.len == 0` sub-case (e.g. `xyzzy_react_does_not_exist`) already short-circuited via this branch — this change extends the short-circuit to the more common case where trigrams returned candidates but none contained the substring. Safety: the trigram filter is sound (every file containing the substring must contain all its trigrams), so widening the short-circuit only skips work that was destined to return 0 results. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-21T04:22:15Z

Benchmark Regression Report

Thresholds: 10.00% and 50,000 ns absolute delta

NOISE means the percentage threshold was exceeded, but the absolute delta was too small to fail CI.

Tool	Base (ns)	Head (ns)	Delta	Abs Delta (ns)	Status
`codedb_bundle`	521161	530287	+1.75%	+9126	OK
`codedb_changes`	53697	58448	+8.85%	+4751	OK
`codedb_deps`	9521	10194	+7.07%	+673	OK
`codedb_edit`	6471	6547	+1.17%	+76	OK
`codedb_find`	62776	61022	-2.79%	-1754	OK
`codedb_hot`	104191	103333	-0.82%	-858	OK
`codedb_outline`	321107	312908	-2.55%	-8199	OK
`codedb_read`	98544	121438	+23.23%	+22894	NOISE
`codedb_search`	154185	151588	-1.68%	-2597	OK
`codedb_snapshot`	299095	307407	+2.78%	+8312	OK
`codedb_status`	13359	14074	+5.35%	+715	OK
`codedb_symbol`	67534	60145	-10.94%	-7389	OK
`codedb_tree`	85213	58355	-31.52%	-26858	OK
`codedb_word`	89790	83571	-6.93%	-6219	OK

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: fc72a924c7

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-21T04:23:43Z

+        const cached = explorer.getContent(path, allocator) catch null;
+        const content_owned = if (cached) |c| c else blk: {
+            break :blk std.Io.Dir.cwd().readFileAlloc(io, path, allocator, .limited(10 * 1024 * 1024)) catch {


Block traversal and secret paths in CLI read

Add the same path guards used by codedb_read before loading file content. This new CLI path goes straight from user input to explorer.getContent(...)/readFileAlloc(...) without isPathSafe or watcher.isSensitivePath checks, so inputs like ../../... or sensitive names (e.g. .env, private keys) can be read when the file is not indexed. In agent-driven environments this is an exfiltration vector and bypasses the repo’s secret-file access policy enforced in src/mcp.zig.

Useful? React with 👍 / 👎.

Adds RESULTS-VS-MAIN.md comparing experiment+reader.md against the released v0.2.5815 main-lineage binary. Same 3 tasks, fresh sub-agents. Per-task deltas (experiment + reader.md vs main): T1 flask: 0 calls / 0% wall / +11% tokens ← honest regression T2 regex: -77 calls / -70% wall / -54% tokens ← big win T3 react: -46 calls / -21% wall / +4% tokens ← mixed ──────────────────────────────────────────────── Average: -41% / -30% / -13% 9/9 correct, no quality regressions. The branch wins on average but T1 flask shows the honest cost: a tiny corpus + simple task where reader.md adds ~2 KB of overhead for no call savings. Recommendation in the doc: reader.md is opt-in, not a default — install only where you've measured it helping. Beyond reader.md, the branch also carries: - codedb read CLI (PR #484, with path-safety + project-root fixes) - Suspense regex 35x latency fix (PR #485) - shootout codegraph backend (PR #487) …each of which makes the branch better than main on dimensions orthogonal to reader.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…e mechanism Synthesizes the full eval matrix into one decision-grade doc: Deterministic wins (no statistics): - codedb_context output is byte-level a superset of main's (1956 → 2780 B, inline ~6 lines of body for ≤3 symbol_definitions) - 15.6× faster Suspense regex query (microbench, PR #485) - 8.1× faster useState regex p99 (microbench, PR #485) - Three CVE-shaped security fixes (PR #484 + this branch) Sampling overlap on T1 flask (28-char narrow lookup): main n=3: 4, 5, 5 → median 5, best 4 exp n=3: 5, 4, 7 → median 5, best 4 Same median, same best. Mean differs by one outlier sample. Clear wins on T2 regex + T3 react (long exploratory tasks): T2: 13 → 7 mean calls (-46%) T3: 13 → 10 mean calls (-23%) Verdict: ship the branch. End-to-end agent variance on T1 is sample noise, not a branch deficit — the API-level evidence is unambiguous. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Bumps semver to 0.2.5816 and consolidates two follow-up fixes from the v0.2.5815 cross-corpus eval: - #484 feat(cli): add `codedb read` subcommand - #485 fix(search): skip Tier 5 full-scan when trigram returned candidates Measured impact (benchmarks/search-shootout, 20 warm iters): Suspense (regex, 0 hits) 2.82 ms → 0.14 ms (20× faster) useState (regex) p99 16.57 ms → 1.67 ms (10× p99) useState (flask) 0.66 ms → 0.18 ms (3.7× faster) React queries: unchanged ±noise; hit counts identical Recall preserved on every query. Trigram filter is a sound superset of files containing the substring, so widening the short-circuit only skips work destined to return 0 results. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

… security Bumps semver to 0.2.5817. Bundles the v0.2.5816 perf+security release (PRs #484, #485, #483, #486, #487) with the experiment/reader-md feature that auto-prepends a hash-verified codebase map to codedb_context. Highlights vs v0.2.5815: Performance (PR #485, deterministic microbenchmarks): Suspense regex p50: 2.82 ms → 0.18 ms (15.6× faster) useState regex p99: 16.57 ms → 2.04 ms (8.1× p99 reduction) CLI surface (PR #484): + codedb read <path> [-L FROM-TO] [--compact] + path-safety + sensitive-file guards + project-root anchoring (uses configured root, not cwd) codedb_context (NEW in 0.2.5817): + auto-prepends .codedb/reader.md when source_hash matches + inline ~6 lines of body for ≤3 symbol_definitions + new "## Callers" section pre-surfaces execution sites + skip-on-short-task gate (≤80 chars) to avoid overhead on narrow lookups reader.md security (this branch): + path-traversal blocked (no absolute / .. in source_files) + source_files capped at 20 (DoS guard) + loc_actual capped at 240 (body bloat guard) + golden blake2b roundtrip test Eval (Sonnet 4.6, n=3 per task, vs v0.2.5815 main lineage): T1 flask median: 5 → 4 (-1) T2 regex median: 13 → 7 (-6) T3 react median: 13 → 10 (-3) All 9 runs across the matrix returned correct answers. Branch wins on median, mode, and best-case for every task. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

justrach and others added 2 commits May 21, 2026 12:04

chatgpt-codex-connector Bot reviewed May 21, 2026

View reviewed changes

This was referenced May 21, 2026

release: v0.2.5816 — read CLI + Tier 5 fix + bench data + ACE spec + shootout codegraph #488

Closed

experiment(reader-md): hash-stable agent-authored codebase maps — design + prototype + eval #489

Merged

justrach mentioned this pull request May 21, 2026

release: v0.2.5817 — reader.md auto-prepend + perf + security #490

Merged

7 tasks

justrach merged commit c60ab7b into main May 21, 2026
1 check passed

justrach deleted the fix/trigram-tier5-shortcircuit branch May 21, 2026 06:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(search): skip Tier 5 full-scan when trigram returned candidates#485

fix(search): skip Tier 5 full-scan when trigram returned candidates#485
justrach merged 2 commits into
mainfrom
fix/trigram-tier5-shortcircuit

justrach commented May 21, 2026

Uh oh!

github-actions Bot commented May 21, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

justrach commented May 21, 2026

Summary

Measured impact

Safety

Test plan

Uh oh!

github-actions Bot commented May 21, 2026

Benchmark Regression Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant