Skip to content

fix: suppress repeated corrupt kanban backups#33319

Closed
Oceanswave wants to merge 1 commit into
NousResearch:mainfrom
Oceanswave:fix/kanban-corrupt-db-retry
Closed

fix: suppress repeated corrupt kanban backups#33319
Oceanswave wants to merge 1 commit into
NousResearch:mainfrom
Oceanswave:fix/kanban-corrupt-db-retry

Conversation

@Oceanswave

@Oceanswave Oceanswave commented May 27, 2026

Copy link
Copy Markdown

Summary

  • make KanbanDbCorruptError a sqlite3.DatabaseError so gateway dispatcher corrupt-board handling can disable the board
  • cache corrupt DB backups per unchanged file fingerprint to avoid repeated .corrupt backup spam
  • skip already-disabled corrupt boards in dispatcher ready-queue health probes

Tests

  • python -m pytest tests/hermes_cli/test_kanban_db.py tests/hermes_cli/test_kanban_notify.py -q

Operational context

Observed a malformed board DB causing repeated backup/log spam from multiple gateway tick paths. This preserves the first backup for a specific corrupt file fingerprint and retries only after the DB file changes.

Fixes #32593

@Oceanswave

Copy link
Copy Markdown
Author

Closing because the upstream database hardening changes have now landed, so this duplicate Kanban corruption-backup retry PR is no longer needed.

@Oceanswave Oceanswave closed this May 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

comp/cli CLI entry point, hermes_cli/, setup wizard comp/gateway Gateway runner, session dispatch, delivery P3 Low — cosmetic, nice to have type/bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Kanban dispatcher writes a new .corrupt.bak on every tick after corruption (7,861 backups / 1.7 GB over 37h)

2 participants