DuckDB checkpoint optimization and timing fix by erikdarlingdata · Pull Request #159 · erikdarlingdata/PerformanceMonitor

erikdarlingdata · 2026-02-19T19:23:32Z

Summary

Add checkpoint_threshold=1GB to DuckDB connection string to prevent auto-checkpoint stalls during writes
Add manual CHECKPOINT after each collection cycle during idle time (15-32ms vs 3,920ms mid-write)
Change collector execution from parallel Task.WhenAll to sequential per-server to eliminate DuckDB write contention
Fix using var timing bug across all 16 collector files — appender Dispose (flush + connection close) now captured inside DuckDB stopwatch for accurate collection_log timing

Background

DuckDB's default 16MB WAL auto-checkpoint was triggering 2-3 second stop-the-world pauses mid-write. Combined with database bloat (3.8GB for 35MB of real data), collectors were taking 3-7 seconds each and driving CPU to 50-60%. After these changes + manual database compaction, collectors run in 100-400ms with normal CPU usage.

Test plan

dotnet build -c Debug — 0 warnings, 0 errors
Verified checkpoint_threshold reads as 953.6 MiB in DuckDB config
Collectors run 100-400ms per cycle (was 3-7 seconds)
CHECKPOINT completes in 15-32ms (was 3,920ms)
Full collection cycle ~4 seconds for 15 collectors (was 50+ seconds)
CPU usage returned to normal

🤖 Generated with Claude Code

- Add checkpoint_threshold=1GB to connection string to prevent auto-checkpoint stalls - Add manual CHECKPOINT after each collection cycle during idle time - Change collector execution from parallel Task.WhenAll to sequential per-server - Fix using var timing bug across all 16 collector files: change to explicit using blocks so appender Dispose (flush + connection close) is captured inside the DuckDB stopwatch, giving accurate timing in collection_log Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Three fixes for "DuckDBOpen failed: Cannot open file" errors introduced by PR #159 (checkpoint) and PR #162 (compaction): 1. Timer initialization: DateTime.MinValue → DateTime.UtcNow prevents compaction/archival from firing on the very first collection cycle 2. Inline checkpoint: moved CHECKPOINT to end of RunDueCollectorsAsync using the existing connection pool instead of opening a separate DuckDB instance that conflicts via OS file locks 3. Atomic file swap: replaced two-step File.Move in CompactAsync with File.Replace (single OS operation, no window where the database file is missing) plus retry logic for locked files and WAL cleanup Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

erikdarlingdata merged commit 2b54d43 into dev Feb 19, 2026
3 checks passed

This was referenced Feb 19, 2026

Lite: Add scheduled database compaction to prevent DuckDB bloat #161

Closed

Fix DuckDB IO errors from concurrent file access #164

Merged

erikdarlingdata deleted the feature/duckdb-checkpoint-optimization branch February 20, 2026 13:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DuckDB checkpoint optimization and timing fix#159

DuckDB checkpoint optimization and timing fix#159
erikdarlingdata merged 1 commit intodevfrom
feature/duckdb-checkpoint-optimization

erikdarlingdata commented Feb 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

erikdarlingdata commented Feb 19, 2026

Summary

Background

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant