perf(config): mtime-cache load_config() and read_raw_config() by teknium1 · Pull Request #17041 · NousResearch/hermes-agent

teknium1 · 2026-04-28T14:05:53Z

Summary

load_config() per-call cost drops from 13.3 ms → 0.23 ms (57× faster). A single gateway turn hits the config 5-15 times, so turn overhead from config reads drops from 65-200 ms → 1-3 ms.

Both load_config() and read_raw_config() now cache their result keyed on the config file's (mtime_ns, size). On a cache hit they return a deepcopy of the cached value, skipping yaml.safe_load + deep-merge + normalize + env-var expansion entirely. save_config() + migrate_config() write via atomic_yaml_write which produces a fresh inode, so stat() sees a new mtime_ns and the next load repopulates automatically — no explicit invalidation hook needed.

Changes

hermes_cli/config.py:
- Added _LOAD_CONFIG_CACHE and _RAW_CONFIG_CACHE dicts keyed on str(config_path).
- load_config() and read_raw_config() consult the cache before doing the expensive work; populate it after.
- Both return copy.deepcopy() on every call so the 67+ call sites that mutate the result (cfg["model"]["default"] = ...) can't corrupt the cache.
gateway/run.py:
- Migrated 6 direct yaml.safe_load(config.yaml) sites through _load_gateway_config().
- _load_gateway_config() now delegates to read_raw_config() when _hermes_home agrees with the canonical config path, falling back to a direct read so tests that monkeypatch gateway_run._hermes_home without touching HERMES_HOME keep working.

Validation

	Before	After
`load_config()` per-call	13.3 ms	0.23 ms
`read_raw_config()` per-call	~7 ms (est.)	0.13 ms
Gateway turn config overhead (10 reads)	~130 ms	~2 ms
`tests/hermes_cli/test_config*.py` (112 tests)	✓	✓
`tests/gateway/` (87 targeted tests)	✓	✓
Live smoke: `hermes chat` + `/model` switch + tool calls	—	zero errors

Safety

Mutation-safe: load_config() returns a fresh deepcopy on every call, so callers mutating then saving don't corrupt the cache.
Write-through via mtime: save_config() / atomic_yaml_write bump mtime_ns, naturally invalidating the cache for the next reader. No explicit invalidation calls needed.
Profile-safe: cache is keyed on str(config_path), so HERMES_HOME profile switches don't collide.
Test fixture-safe: gateway's _load_gateway_config() falls back to a direct read when _hermes_home is monkeypatched to a path that doesn't match get_config_path().

Phase 1 item 2 of the optimization sweep.

load_config() and read_raw_config() now cache their result keyed on the config file's (mtime_ns, size). On cache hit they return a deepcopy of the cached value, skipping yaml.safe_load + deep-merge + normalize + env-var expansion entirely. save_config() + migrate_config() write via atomic_yaml_write which produces a fresh inode, so stat() sees a new mtime_ns and the next load repopulates automatically — no explicit invalidation hook needed. Measured per-call cost: load_config() cold: 13.3 ms load_config() cached: 0.23 ms (57x faster) read_raw_config() cached: 0.13 ms A single gateway turn hits the config 5-15 times (session context, auxiliary client resolution, memory config, plugin hooks, approval lookups, per-tool settings). That's 65-200 ms/turn of pure YAML re-parsing on main. After this change: 1-3 ms/turn. Also migrates gateway/run.py's 6 direct yaml.safe_load(config.yaml) call sites through _load_gateway_config, which now shares the read_raw_config cache when _hermes_home agrees with the canonical config path. The direct-read fallback is retained for tests that monkeypatch gateway_run._hermes_home without touching HERMES_HOME. Safety: - load_config() returns a deepcopy on every call; the 67+ call sites that mutate the result (cfg["model"]["default"] = ..., etc.) can't corrupt the cache. - save_config() / atomic_yaml_write bump mtime, naturally invalidating the cache for the next reader. - Cache is keyed on str(config_path), so HERMES_HOME profile switches don't collide. Verified: - 112 config tests pass (test_config, test_config_env_expansion, test_config_env_refs, test_config_drift, test_config_validation, test_aux_config). - 87 gateway tests pass (test_verbose_command, test_session_info, test_compress_focus, test_runtime_footer, test_resume_command, test_reasoning_command, test_approve_deny_commands, test_run_progress_interrupt). - Live hermes chat smoke — 2 turns + /model switch + tool calls, zero errors in agent.log.

load_config() and read_raw_config() now cache their result keyed on the config file's (mtime_ns, size). On cache hit they return a deepcopy of the cached value, skipping yaml.safe_load + deep-merge + normalize + env-var expansion entirely. save_config() + migrate_config() write via atomic_yaml_write which produces a fresh inode, so stat() sees a new mtime_ns and the next load repopulates automatically — no explicit invalidation hook needed. Measured per-call cost: load_config() cold: 13.3 ms load_config() cached: 0.23 ms (57x faster) read_raw_config() cached: 0.13 ms A single gateway turn hits the config 5-15 times (session context, auxiliary client resolution, memory config, plugin hooks, approval lookups, per-tool settings). That's 65-200 ms/turn of pure YAML re-parsing on main. After this change: 1-3 ms/turn. Also migrates gateway/run.py's 6 direct yaml.safe_load(config.yaml) call sites through _load_gateway_config, which now shares the read_raw_config cache when _hermes_home agrees with the canonical config path. The direct-read fallback is retained for tests that monkeypatch gateway_run._hermes_home without touching HERMES_HOME. Safety: - load_config() returns a deepcopy on every call; the 67+ call sites that mutate the result (cfg["model"]["default"] = ..., etc.) can't corrupt the cache. - save_config() / atomic_yaml_write bump mtime, naturally invalidating the cache for the next reader. - Cache is keyed on str(config_path), so HERMES_HOME profile switches don't collide. Verified: - 112 config tests pass (test_config, test_config_env_expansion, test_config_env_refs, test_config_drift, test_config_validation, test_aux_config). - 87 gateway tests pass (test_verbose_command, test_session_info, test_compress_focus, test_runtime_footer, test_resume_command, test_reasoning_command, test_approve_deny_commands, test_run_progress_interrupt). - Live hermes chat smoke — 2 turns + /model switch + tool calls, zero errors in agent.log. Co-authored-by: teknium1 <teknium@users.noreply.github.com>

…search#17041) load_config() and read_raw_config() now cache their result keyed on the config file's (mtime_ns, size). On cache hit they return a deepcopy of the cached value, skipping yaml.safe_load + deep-merge + normalize + env-var expansion entirely. save_config() + migrate_config() write via atomic_yaml_write which produces a fresh inode, so stat() sees a new mtime_ns and the next load repopulates automatically — no explicit invalidation hook needed. Measured per-call cost: load_config() cold: 13.3 ms load_config() cached: 0.23 ms (57x faster) read_raw_config() cached: 0.13 ms A single gateway turn hits the config 5-15 times (session context, auxiliary client resolution, memory config, plugin hooks, approval lookups, per-tool settings). That's 65-200 ms/turn of pure YAML re-parsing on main. After this change: 1-3 ms/turn. Also migrates gateway/run.py's 6 direct yaml.safe_load(config.yaml) call sites through _load_gateway_config, which now shares the read_raw_config cache when _hermes_home agrees with the canonical config path. The direct-read fallback is retained for tests that monkeypatch gateway_run._hermes_home without touching HERMES_HOME. Safety: - load_config() returns a deepcopy on every call; the 67+ call sites that mutate the result (cfg["model"]["default"] = ..., etc.) can't corrupt the cache. - save_config() / atomic_yaml_write bump mtime, naturally invalidating the cache for the next reader. - Cache is keyed on str(config_path), so HERMES_HOME profile switches don't collide. Verified: - 112 config tests pass (test_config, test_config_env_expansion, test_config_env_refs, test_config_drift, test_config_validation, test_aux_config). - 87 gateway tests pass (test_verbose_command, test_session_info, test_compress_focus, test_runtime_footer, test_resume_command, test_reasoning_command, test_approve_deny_commands, test_run_progress_interrupt). - Live hermes chat smoke — 2 turns + /model switch + tool calls, zero errors in agent.log. Co-authored-by: teknium1 <teknium@users.noreply.github.com>

teknium1 merged commit df51ad7 into main Apr 28, 2026
10 of 11 checks passed

teknium1 deleted the hermes/hermes-dab6fbf1 branch April 28, 2026 14:06

alt-glitch added type/perf Performance improvement or optimization P2 Medium — degraded but workaround exists comp/cli CLI entry point, hermes_cli/, setup wizard comp/gateway Gateway runner, session dispatch, delivery area/config Config system, migrations, profiles labels Apr 28, 2026

konsisumer mentioned this pull request Apr 28, 2026

fix(hindsight): fail loudly when local_embedded is missing hindsight-all #7741

Closed

1 task

This was referenced Apr 28, 2026

fix: resolve 7 identified issues [automated] Sldark23/hermes-agent#1

Closed

fix: resolve 7 identified issues [automated] #17090

Open

github-actions Bot mentioned this pull request May 1, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.23 to v2026.4.30 Docker-Hub-sirmark/docker-hermes-agent#4

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(config): mtime-cache load_config() and read_raw_config()#17041

perf(config): mtime-cache load_config() and read_raw_config()#17041
teknium1 merged 1 commit into
mainfrom
hermes/hermes-dab6fbf1

teknium1 commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

teknium1 commented Apr 28, 2026

Summary

Changes

Validation

Safety

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants