perf(prompt-cache): date-only timestamp + loud gateway-DB roundtrip logging by teknium1 · Pull Request #27675 · NousResearch/hermes-agent

teknium1 · 2026-05-18T00:23:51Z

Summary

System prompt is now byte-stable for the full day, and silent gateway-side prefix-cache misses now log loudly instead of disappearing into DEBUG.

The Conversation started: line was minute-precision (%I:%M %p) — byte-unstable across every rebuild path. Within a CLI session the in-memory cache held, but on the gateway path (fresh AIAgent per turn, prompt restored from session DB), any silent failure in the read or write path forced a full re-prefill on every subsequent turn. Local prefix-caching backends (llama.cpp, vLLM) saw it as KV-cache invalidation; remote prefix-caching providers saw it as an Anthropic-style cache miss.

Changes


C1 Date-only timestamp	`Sunday, May 17, 2026` instead of `Sunday, May 17, 2026 03:42 PM`. System prompt byte-stable for the full day. Credit @iamfoz (#20451).
A Loud DB write logging	`update_system_prompt` failure was `logger.debug`. Now `logger.warning` with the session id and exception.
B Three-way stored-state read	`session_row.get('system_prompt') or None` collapsed missing / NULL / empty into one path. Now distinguished and warned on null/empty when a continuing session hits them.
Refactor	Extracted restore logic into `_restore_or_build_system_prompt()` so the prefix-cache path is testable in isolation.

Validation

	Before	After
System prompt byte-stable within a day	No (minute-precision drifts every 60s)	Yes
Silent DB write failure visible in agent.log	No (DEBUG only)	Yes (WARNING)
NULL stored system_prompt detected	No (rebuilt forever)	Yes (WARNING + rebuild)
Empty stored system_prompt detected	No (rebuilt forever)	Yes (WARNING + rebuild)

E2E proof (live SessionDB, no mocks): fresh AIAgent constructed for turn 2 across a 65-second minute-boundary sleep restored byte-identical bytes from the DB. NULL stored prompt fires the new warning. Date-only timestamp survives the rebuild path.

Tests:

tests/agent/test_system_prompt_restore.py — 10 new tests covering happy path, fresh build, all silent-failure recovery paths, and byte-stability invariant
tests/run_agent/test_run_agent.py::TestBuildSystemPrompt::test_datetime_is_date_only_not_minute_precision — pins date-only invariant
Existing TestSystemPromptStability / TestBuildSystemPrompt / TestInvalidateSystemPrompt — all pass (24/24)

Closes

fix: date-only timestamp to preserve KV-cache across same-day sessions #20451 — date-only timestamp (@iamfoz, co-author credited)
fix(prompt): stabilize system prompt prefix for KV cache reuse #18547 — stabilize system prompt prefix
perf: stabilize system prompt timestamp across compression cycles #8689 — stabilize timestamp across compression
Question: does the minute-precision timestamp in _build_system_prompt invalidate prompt caching for upstream inference servers? #15866 — timestamp invalidates upstream prefix caching
perf: system prompt timestamp changes after compression, invalidating prefix cache on local LLM backends #8687 — system prompt timestamp changes after compression

Notes

The Model and Provider lines are kept in the prompt as-is. They're stable within a session (no /model switch) and provide self-identification context. Date-only timestamp closes the only minute-by-minute volatility source.

Within-session invariant audit: every call site of _build_system_prompt and every assignment to _cached_system_prompt was inventoried. No path mutates the prompt during a healthy session. The cache holds; this PR makes the prompt itself byte-stable across the cache invalidation boundaries (compression event, fresh-agent gateway turn, session resume).

@iamfoz

…ogging The system prompt's 'Conversation started:' line carried minute precision (%I:%M %p), making it byte-unstable across every rebuild path. Within a CLI session the in-memory cache held, but on the gateway path (fresh AIAgent per turn → restore from session DB), any silent failure in the read or write path dropped the cache stem and forced a full re-prefill on every subsequent turn. Local prefix-caching backends (llama.cpp / vLLM) saw this as KV-cache invalidation; remote prefix-caching providers saw it as an Anthropic-style cache miss. Three changes: 1. Date-only timestamp ('Sunday, May 17, 2026' instead of '... 03:42 PM'). System prompt now byte-stable for the full day. The model can still query exact time via tools when it actually needs it. Credit: @iamfoz (PR #20451). 2. Loud logging on session DB write failures. The update_system_prompt call used to log at DEBUG, hiding disk-full / locked-database / schema drift behind a silent fall-through that forced fresh rebuilds on every subsequent turn. Now WARN with the session id and exception so persistent issues show up in agent.log without verbose mode. 3. Three-way stored-state distinction on read. The previous 'session_row.get("system_prompt") or None' collapsed three states into one (missing row / null column / empty string). Now we tell them apart and WARN when a continuing session lands on null/empty (which means the previous turn's write never persisted — every subsequent turn rebuilds and the prefix cache misses every time). The restore block is extracted into _restore_or_build_system_prompt() so the prefix-cache path can be unit-tested in isolation. E2E proof: fresh AIAgent constructed for turn 2 across a minute-boundary sleep restores byte-identical bytes from the session DB. NULL stored prompt fires the new warning. Date-only timestamp survives the rebuild path. All on real SessionDB, no mocks. Tests: - tests/agent/test_system_prompt_restore.py (10 new tests) - tests/run_agent/test_run_agent.py::TestBuildSystemPrompt:: test_datetime_is_date_only_not_minute_precision Closes #20451 (date-only), #18547 (prefix stabilization), #8689 (stabilize timestamp across compression), #15866 (timestamp caching question), #8687 (compression timestamp), #27339 (claim #3: live timestamp in cached system prompt). Co-authored-by: Martyn Forryan <9133432+iamfoz@users.noreply.github.com>

github-actions · 2026-05-18T00:24:33Z

🔎 Lint report: `hermes/hermes-59647556` vs `origin/main`

ruff

Total: 0 on HEAD, 0 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 0 pre-existing issues carried over.

ty (type checker)

Total: 8750 on HEAD, 8749 on base (🆕 +1)

🆕 New issues (1):

Rule	Count
`unresolved-import`	1

First entries

tests/agent/test_system_prompt_restore.py:21: [unresolved-import] unresolved-import: Cannot resolve imported module `pytest`

✅ Fixed issues: none

Unchanged: 4608 pre-existing issues carried over.

Diagnostics are surfaced as warnings — this check never fails the build.

alt-glitch · 2026-05-18T00:43:14Z

Supersedes #20451 (same date-only timestamp approach by @iamfoz). Also addresses #8687, #15866, and #8689. If merged, those competing PRs (#20451, #8689) and related issues should be closeable.

alt-glitch added type/perf Performance improvement or optimization P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder labels May 18, 2026

iamfoz mentioned this pull request May 18, 2026

fix: date-only timestamp to preserve KV-cache across same-day sessions #20451

Closed

teknium1 merged commit 4a3f13b into main May 18, 2026
21 checks passed

teknium1 deleted the hermes/hermes-59647556 branch May 18, 2026 06:20

BrewTestBot mentioned this pull request May 28, 2026

hermes-agent 2026.5.28 Homebrew/homebrew-core#285115

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(prompt-cache): date-only timestamp + loud gateway-DB roundtrip logging#27675

perf(prompt-cache): date-only timestamp + loud gateway-DB roundtrip logging#27675
teknium1 merged 1 commit into
mainfrom
hermes/hermes-59647556

teknium1 commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

alt-glitch commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented May 18, 2026

Summary

Changes

Validation

Closes

Notes

Uh oh!

github-actions Bot commented May 18, 2026

🔎 Lint report: hermes/hermes-59647556 vs origin/main

ruff

ty (type checker)

Uh oh!

alt-glitch commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

🔎 Lint report: `hermes/hermes-59647556` vs `origin/main`