meta: LCM Performance & Cache Optimization Sprint — All PRs

## Overview

This issue tracks the complete performance and cache optimization sprint for lossless-claw, covering 5 PRs across 4 issues. The work addresses database lock races, unnecessary cache-busting compaction, SQLite tuning, compaction read/write path optimization, and search quality improvements.

## PRs (in merge order)

| PR | Issue | What | Status | CI |
|----|-------|------|--------|-----|
| [#288](https://github.com/Martian-Engineering/lossless-claw/pull/288) | #287 | **DB lock race fix** — eager-first init with deferred fallback on SQLite lock | Ready | ✅ |
| [#289](https://github.com/Martian-Engineering/lossless-claw/pull/289) | #282 | **Cache-aware compaction** — skip guards + budget pressure priority + tuning guide | Ready | ✅ |
| [#294](https://github.com/Martian-Engineering/lossless-claw/pull/294) | #291 | **SQLite PRAGMAs + indexes** — 64MB cache, sync=NORMAL, missing indexes | Ready | ✅ |
| [#295](https://github.com/Martian-Engineering/lossless-claw/pull/295) | #291 | **Core DB optimization** — context cache, bulk ordinals, delta tracking | Ready | Pending fork CI |
| [#296](https://github.com/Martian-Engineering/lossless-claw/pull/296) | #293 | **Search quality** — configurable BM25 sort, phrase matching fix | Ready | Pending fork CI |

## Total Impact Across All PRs

| Metric | Before | After |
|--------|--------|-------|
| **Startup** | "database is locked" loop on macOS reboot | Fixed — eager-first, defer on lock |
| **Unnecessary compactions** | Every turn on high-traffic conversations | Skipped when gain < 5% or under budget ceiling |
| **DB operations per compaction sweep** | 120-160 | ~30-40 (75% reduction) |
| **Write lock per compaction pass** | ~30ms (2×N individual UPDATEs) | ~1.5ms (2 bulk UPDATEs) |
| **SQLite page cache** | 2MB (default) | 64MB (demand-allocated) |
| **Write sync** | FULL (fsync every commit) | NORMAL (~50% faster writes) |
| **Search results** | Recency only | Configurable: recency / relevance / hybrid |
| **Phrase matching** | Broken (\`"error handling"\` → two terms) | Fixed (preserved as phrase) |
| **Expansion model docs** | Not documented | Full guide with per-tier configs + cost analysis |
| **Cache miss cost awareness** | None | Skip guards prevent wasteful cache-busting |

## Architecture Decisions

### Cache-aware compaction (PR #289)
Three-tier decision logic: basic threshold → budget headroom → cache-aware reduction. Budget pressure always overrides cache concerns (prevents starvation). Per-pass chunk estimation. Configurable via \`leafSkipReductionThreshold\` and \`leafBudgetHeadroomFactor\`.

### DB optimization (PRs #294 + #295)
- **Context cache**: per-phase Map with reference counting for concurrent safety. Invalidated after each write.
- **Delta tracking**: \`tokensAfter = tokensBefore - removedTokens + addedTokens\`. Mathematically proven equivalent. leafPass/condensedPass return deltas.
- **Ordinal resequencing**: rank-based correlated subquery handles gaps. 2 bulk UPDATEs instead of 2×N.
- **PRAGMAs**: cache_size=64MB, synchronous=NORMAL, temp_store=MEMORY, optimize on close.

### Search quality (PR #296)
Configurable \`sort\` parameter on \`lcm_grep\`: recency (default) / relevance (FTS5 BM25) / hybrid. Preserves LCM's design intent (DAG = relevance filter, recency = routing default) while enabling relevance for long-history users.

## Documentation Delivered

- **[Compaction Tuning Guide](docs/compaction-tuning.md)** — TLDR + engineer deep-dive + per-tier configs + economics
- **Architecture updates** — cache-aware guards, Mermaid diagrams, prompt cache impact
- **Configuration updates** — all new settings documented with env vars
- **Expansion model cost** — documented the hidden \`lcm_expand_query\` sub-agent token cost
- **Model comparison table** — 10 models with exact per-call costs and context windows
- **References section** — LCM paper, CMV paper, demand-paging paper, cache docs

## Issues

| Issue | Status |
|-------|--------|
| [#282](https://github.com/Martian-Engineering/lossless-claw/issues/282) | Fixed by PR #289 |
| [#287](https://github.com/Martian-Engineering/lossless-claw/issues/287) | Fixed by PR #288 |
| [#291](https://github.com/Martian-Engineering/lossless-claw/issues/291) | Fixed by PRs #294 + #295 (PRAGMA + core optimizations) |
| [#292](https://github.com/Martian-Engineering/lossless-claw/issues/292) | Open — sub-agent team coordination (feature proposal) |
| [#293](https://github.com/Martian-Engineering/lossless-claw/issues/293) | Fixed by PR #296 |

## Recommended Merge Order

1. **#288** first (DB lock fix — no dependencies)
2. **#294** second (PRAGMAs — no code dependencies, just config)
3. **#289** third (cache-aware compaction — the largest PR, 21 commits)
4. **#295** fourth (DB optimizations — touches compaction.ts which #289 also touches)
5. **#296** last (search — independent but smallest scope)

PRs #295 and #289 may have merge conflicts in \`compaction.ts\` since both modify \`evaluateLeafTrigger\`. Resolve by keeping both sets of changes — #289 adds skip guards, #295 adds caching.

## Test Coverage

| PR | New Tests | Total Pass |
|----|-----------|------------|
| #289 | 21 (16 skip logic + 5 config) | 239/239 |
| #294 | 0 (config-only) | 561/561 |
| #295 | 0 (perf-only, existing tests cover correctness) | 169/169 |
| #296 | 0 (small scope, existing search tests cover) | 202/202 |

## Review History

Each PR went through multiple adversarial review rounds:
- **#289**: 5 Copilot reviews + 5 adversarial agents + 3 research agents + quality audit
- **#294**: 1 Copilot review, all comments resolved
- **#295**: 1 Copilot review with 4 comments, all resolved (concurrency, types, delta accuracy, ordinal gaps)
- **#296**: Based on deep LCM architecture analysis + paper research

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

meta: LCM Performance & Cache Optimization Sprint — All PRs #297

Overview

PRs (in merge order)

Total Impact Across All PRs

Architecture Decisions

Cache-aware compaction (PR #289)

DB optimization (PRs #294 + #295)

Search quality (PR #296)

Documentation Delivered

Issues

Recommended Merge Order

Test Coverage

Review History

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

PR	Issue	What	Status	CI
#288	#287	DB lock race fix — eager-first init with deferred fallback on SQLite lock	Ready	✅
#289	#282	Cache-aware compaction — skip guards + budget pressure priority + tuning guide	Ready	✅
#294	#291	SQLite PRAGMAs + indexes — 64MB cache, sync=NORMAL, missing indexes	Ready	✅
#295	#291	Core DB optimization — context cache, bulk ordinals, delta tracking	Ready	Pending fork CI
#296	#293	Search quality — configurable BM25 sort, phrase matching fix	Ready	Pending fork CI

Metric	Before	After
Startup	"database is locked" loop on macOS reboot	Fixed — eager-first, defer on lock
Unnecessary compactions	Every turn on high-traffic conversations	Skipped when gain < 5% or under budget ceiling
DB operations per compaction sweep	120-160	~30-40 (75% reduction)
Write lock per compaction pass	~30ms (2×N individual UPDATEs)	~1.5ms (2 bulk UPDATEs)
SQLite page cache	2MB (default)	64MB (demand-allocated)
Write sync	FULL (fsync every commit)	NORMAL (~50% faster writes)
Search results	Recency only	Configurable: recency / relevance / hybrid
Phrase matching	Broken (`"error handling"` → two terms)	Fixed (preserved as phrase)
Expansion model docs	Not documented	Full guide with per-tier configs + cost analysis
Cache miss cost awareness	None	Skip guards prevent wasteful cache-busting

Issue	Status
#282	Fixed by PR #289
#287	Fixed by PR #288
#291	Fixed by PRs #294 + #295 (PRAGMA + core optimizations)
#292	Open — sub-agent team coordination (feature proposal)
#293	Fixed by PR #296

PR	New Tests	Total Pass
#289	21 (16 skip logic + 5 config)	239/239
#294	0 (config-only)	561/561
#295	0 (perf-only, existing tests cover correctness)	169/169
#296	0 (small scope, existing search tests cover)	202/202

meta: LCM Performance & Cache Optimization Sprint — All PRs #297

Description

Overview

PRs (in merge order)

Total Impact Across All PRs

Architecture Decisions

Cache-aware compaction (PR #289)

DB optimization (PRs #294 + #295)

Search quality (PR #296)

Documentation Delivered

Issues

Recommended Merge Order

Test Coverage

Review History

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions