fix(node): tune autovacuum on db-sync hot tables for stable query plans by scottbuckel · Pull Request #1434 · midnightntwrk/midnight-node

scottbuckel · 2026-04-27T20:29:13Z

Lower autovacuum_analyze_scale_factor from the postgres default of 0.1 to 0.01 on the cardano-db-sync tables midnight-node queries (block, tx, tx_out, tx_in, ma_tx_out, datum). The default 10% growth threshold means autoanalyze never fires for big append-heavy tables, leaving the planner on stale statistics and producing extreme worst-case plans (observed >400s queries on otherwise idle preview/preprod DBs) for the cnight-observation lookups.

Applied alongside the existing index creation in
create_cnight_observation_indexes, idempotent.

Refs: #1298

Overview

🗹 TODO before merging

Ready

📌 Submission Checklist

All commits are signed off (git commit -s) for the DCO
Changes are backward-compatible (or flagged if breaking)
Pull request description explains why the change is needed
Self-reviewed the diff
I have included a change file, or skipped for this reason:
If the changes introduce a new feature, I have bumped the node minor version
Update documentation (if relevant)
Updated AGENTS.md if build commands, architecture, or workflows changed
No new todos introduced

🧪 Testing Evidence

Mainnet sync — before vs after autovacuum tuning

Before (default autovacuum_analyze_scale_factor = 0.1 on cexplorer_mainnet, schema otherwise identical, all indexes from create_cnight_observation_indexes present):

2026-04-27 15:54:56  INFO substrate: ⚙️  Syncing  0.0 bps, target=#569462 (9 peers), best: #27
2026-04-27 15:55:01  INFO substrate: ⚙️  Syncing  0.0 bps, target=#569463 (9 peers), best: #27
2026-04-27 15:55:05  WARN sqlx::query: slow statement ... elapsed=432.22s slow_threshold=1s
2026-04-27 15:55:11  INFO substrate: ⚙️  Syncing  0.0 bps, target=#569464 (9 peers), best: #27

best stuck at #27 for many intervals; one query took 432 s. pg_stat_user_tables showed tx_out and ma_tx_out had never been autoanalyzed — last_autoanalyze NULL, analyze_count=1 from initial schema load.

After (this PR's tuning applied via ALTER TABLE + one-shot vacuumdb --analyze-only to backfill stats):

2026-04-27 16:32:27  INFO substrate: ⚙️  Syncing 38.5 bps, target=#569837 (8 peers), best: #67126
2026-04-27 16:32:32  INFO substrate: ⚙️  Syncing 48.9 bps, target=#569838 (8 peers), best: #67371
2026-04-27 16:34:22  INFO substrate: ⚙️  Syncing 43.9 bps, target=#569856 (9 peers), best: #72402

best advanced from #66933 → #72402 in 2 minutes — ~5500 blocks / 120 s ≈ 46 bps sustained vs 0.0 before. One slow statement warning across the entire run, and that one was 2.6 s rather than 400+ s.

Tracking

Upstream issue: Very slow initial sync (0.2 BPS) #1298
SRE side ticket (mitigation already applied to scotty): shieldedtech/shielded-sre#181

Please describe any additional testing aside from CI:

Additional tests are provided (if possible)

🔱 Fork Strategy

Node Runtime Update
Node Client Update
Other:
N/A

Links

Lower autovacuum_analyze_scale_factor from the postgres default of 0.1 to 0.01 on the cardano-db-sync tables midnight-node queries (block, tx, tx_out, tx_in, ma_tx_out, datum). The default 10% growth threshold means autoanalyze never fires for big append-heavy tables, leaving the planner on stale statistics and producing extreme worst-case plans (observed >400s queries on otherwise idle preview/preprod DBs) for the cnight-observation lookups. Applied alongside the existing index creation in create_cnight_observation_indexes, idempotent. Refs: #1298 Signed-off-by: Scott Buckel <scott.buckel@shielded.io>

Signed-off-by: Justin Frevert <justinfrevert@gmail.com>

justinfrevert

Nice improvement! Appreciate the additional testing done as well.

…ns (#1434) * fix(node): tune autovacuum on db-sync hot tables for stable query plans Lower autovacuum_analyze_scale_factor from the postgres default of 0.1 to 0.01 on the cardano-db-sync tables midnight-node queries (block, tx, tx_out, tx_in, ma_tx_out, datum). The default 10% growth threshold means autoanalyze never fires for big append-heavy tables, leaving the planner on stale statistics and producing extreme worst-case plans (observed >400s queries on otherwise idle preview/preprod DBs) for the cnight-observation lookups. Applied alongside the existing index creation in create_cnight_observation_indexes, idempotent. Refs: #1298 Signed-off-by: Scott Buckel <scott.buckel@shielded.io> * change file Signed-off-by: Justin Frevert <justinfrevert@gmail.com> --------- Signed-off-by: Scott Buckel <scott.buckel@shielded.io> Signed-off-by: Justin Frevert <justinfrevert@gmail.com> Co-authored-by: Justin Frevert <justinfrevert@gmail.com>

scottbuckel requested a review from a team as a code owner April 27, 2026 20:29

scottbuckel mentioned this pull request Apr 27, 2026

Very slow initial sync (0.2 BPS) #1298

Closed

change file

589111c

Signed-off-by: Justin Frevert <justinfrevert@gmail.com>

justinfrevert approved these changes Apr 28, 2026

View reviewed changes

justinfrevert added this pull request to the merge queue Apr 28, 2026

Merged via the queue into main with commit df76da9 Apr 28, 2026
37 checks passed

justinfrevert deleted the scott-cnight-observation-autovacuum-tuning branch April 28, 2026 00:51

gilescope added the sync-performance label Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(node): tune autovacuum on db-sync hot tables for stable query plans#1434

fix(node): tune autovacuum on db-sync hot tables for stable query plans#1434
justinfrevert merged 2 commits into
mainfrom
scott-cnight-observation-autovacuum-tuning

scottbuckel commented Apr 27, 2026 •

edited by justinfrevert

Loading

Uh oh!

justinfrevert left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

scottbuckel commented Apr 27, 2026 • edited by justinfrevert Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

🗹 TODO before merging

📌 Submission Checklist

🧪 Testing Evidence

Mainnet sync — before vs after autovacuum tuning

Tracking

🔱 Fork Strategy

Links

Uh oh!

justinfrevert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

scottbuckel commented Apr 27, 2026 •

edited by justinfrevert

Loading