Hermes Agent — BTRFS + SQLite WAL Fix (v5)

Problem

SQLite in WAL mode experiences corruption on filesystems with Copy-on-Write semantics (BTRFS, ZFS):

sqlite3.OperationalError: disk I/O error
PRAGMA integrity_check: Tree 16 page 16 cell 0: 2nd reference to page 77

These errors cause silent database corruption — corrupted B-tree indexes, wrong index entry counts, and gateway crashes.

Root Cause

WAL mode relies on shared memory (-shm files) and sequential writes
On BTRFS/ZFS, COW operations cause transient blocking during checkpoint races — manifesting as disk i/O error
Under high concurrency (multiple gateway processes), writers compete for database locks — manifesting as database is locked
The default SQLite timeout (1ms) is insufficient for these scenarios
WAL retry logic handles transient errors but cannot fix structural corruption

Solution (v5)

Proactive BTRFS detection + protected fallback:

_is_on_btrfs() — Detects BTRFS via /proc/self/mountinfo
Skip WAL on BTRFS — Force DELETE journal mode from the start
_WAL_TRANSIENT_MARKERS — Distinguishes transient from permanent errors
_try_fallback_delete() — Protected DELETE fallback that doesn't crash
Env var configuration — HERMES_SQLITE_BUSY_TIMEOUT, HERMES_SQLITE_WAL_RETRIES, HERMES_SQLITE_WAL_RETRY_DELAY

Testing

45 concurrent operations (readers + writers), 0 errors, 0.51s
PRAGMA integrity_check passes after fix
Kanban dispatcher runs without errors on BTRFS

Installation

# Option 1: git am (preferred)
git -C ~/.hermes/hermes-agent am btrfs-sqlite-fix.patch

# Option 2: patch directly
patch -p1 -d ~/.hermes/hermes-agent < btrfs-sqlite-fix.patch

Status

Not in upstream (NousResearch/hermes-agent)
Issue: #30846
Patch based on: origin/main (updated regularly)

Files Changed

File	Changes
`hermes_state.py`	`_is_on_btrfs()`, `_WAL_TRANSIENT_MARKERS`, `_try_fallback_delete()`, env vars, proactive BTRFS detection
`hermes_cli/kanban_db.py`	Pass `db_path` to `apply_wal_with_fallback()`
`tools/terminal_tool.py`	`_safe_getcwd()` helper

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
README.md		README.md
apply-btrfs-fix.sh		apply-btrfs-fix.sh
btrfs-sqlite-fix.patch		btrfs-sqlite-fix.patch
test_btrfs_fix.py		test_btrfs_fix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hermes Agent — BTRFS + SQLite WAL Fix (v5)

Problem

Root Cause

Solution (v5)

Testing

Installation

Status

Files Changed

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hermes Agent — BTRFS + SQLite WAL Fix (v5)

Problem

Root Cause

Solution (v5)

Testing

Installation

Status

Files Changed

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages