fix(tui): responsive /compress with live progress + CLI-parity feedback by OutThisLife · Pull Request #17661 · NousResearch/hermes-agent

OutThisLife · 2026-04-29T23:28:27Z

Summary

Route session.compress through the TUI gateway long-handler pool so manual compaction never blocks the JSON-RPC loop.
Show a single live progress line in the transcript with a braille spinner glyph and live message + token estimate, instead of a silent wait.
Return a structured summary (headline, token_line, optional note) shared with the classic CLI's /compress flow, so the final result reads like the CLI.
Mirror the gateway's status.update(kind: "compressing") event into the transcript so the live line stays visible while the LLM call is in flight.

Background

Profiled the compaction path locally on a long TUI session:

Synthetic 1,000-message, ~345k-token compressor run with the LLM summary stubbed: ~0.002s.
Cold TUI wrapper run: ~0.745s, mostly lazy imports/tool discovery.
Pre-fix dispatch check: session.compress ran inline on the gateway loop, so other RPCs blocked behind it for the full LLM compaction wall time.
Post-fix dispatch check: session.compress returns immediately (~0.1ms), and a concurrent fast.ping also returns in ~0.1ms.

What you see in the TUI now

· /compress
⠋ compressing 242 messages (~77,331 tok)…
···
· ✓ Compressed: 242 → 10 messages
·   Rough transcript estimate: ~77,331 → ~4,257 tokens

Test plan

scripts/run_tests.sh tests/tui_gateway/ -q
scripts/run_tests.sh tests/test_tui_gateway_server.py -k 'session_compress_uses_compress_helper or mirror_slash_side_effects'
cd ui-tui && npm run type-check
cd ui-tui && npm test -- src/__tests__/createGatewayEventHandler.test.ts
Manual: in a long-history TUI session, run /compress; verify the live spinner line shows the message + token estimate while compaction runs, and the multi-line summary lands when it finishes.

Route TUI session compression through the existing long-handler pool so slow compaction does not block other gateway RPCs.

Print a local status line before the compress RPC starts so slow manual compaction does not look like a no-op.

Copilot

Pull request overview

This PR ensures manual transcript compaction (session.compress) doesn’t block the TUI gateway’s main JSON-RPC dispatch loop by routing it through the existing long-handler thread pool, and adds a regression test to prevent future regressions.

Changes:

Add session.compress to the gateway’s _LONG_HANDLERS set so it’s dispatched via the thread pool.
Update the TUI /compress command UX to immediately print a “compressing…” status line and properly surface RPC failures.
Add a protocol-level regression test that simulates a slow session.compress while verifying a fast RPC still returns immediately; update the session-resume fake DB signature to match current gateway usage.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
ui-tui/src/app/slash/commands/session.ts	Emits a user-visible “compressing context…” line and adds error handling for `/compress`.
tui_gateway/server.py	Routes `session.compress` through the long-handler pool to avoid blocking the dispatcher loop.
tests/tui_gateway/test_protocol.py	Adds a regression test for non-blocking `session.compress` and updates fake DB method signature for `include_ancestors`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Show pre-compaction message count and rough token estimate immediately, emit a status update so the bottom bar reflects ongoing compaction, and report a multi-line summary (headline + token delta + optional note) using the shared summarize_manual_compression helper.

Mirror compression progress status into the transcript so users see the backend message count and token estimate while /compress is still running.

Drop the redundant local "compressing context..." placeholder and prefix the live backend status line with a braille spinner glyph so /compress reads as a single in-progress row.

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Reuse the precomputed token estimate inside _compress_session_history so the gateway does not redo the O(n) work while holding history_lock, keep the status bar pinned during long manual compactions instead of auto-restoring after 4s, and drop the redundant noop bullet that doubled with the system role glyph.

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

tui_gateway/server.py:1024

In _compress_session_history(), the second argument passed to agent._compress_context() is getattr(agent, "_cached_system_prompt", "") or "". AIAgent._compress_context() rebuilds the system prompt via _build_system_prompt(system_message), which appends system_message when it is not None; passing the already-built cached prompt here will nest/duplicate the entire system prompt and can cause the prompt to grow on each /compress. To match the CLI’s manual /compress behavior (and avoid #15281-style duplication), pass None (or the original user-provided system message, if available) instead of _cached_system_prompt.

    compressed, _ = agent._compress_context(
        history,
        getattr(agent, "_cached_system_prompt", "") or "",
        approx_tokens=approx_tokens,
        focus_topic=focus_topic or None,
    )

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Move the snapshot/commit pattern into _compress_session_history so the lock is held only across the in-memory bookkeeping, not during agent._compress_context. Also emit a final neutral status update from session.compress so the pinned compressing indicator clears even on errors.

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Pass system_message=None so AIAgent._compress_context rebuilds the system prompt without nesting the cached identity block. Reuse the handler's pre-snapshotted history inside _compress_session_history to avoid a second O(n) copy under the lock. After compaction, when AIAgent._compress_context rotates session_id, sync the gateway session_key, migrate approval notify + yolo state, restart the slash worker, and clear the stale pending title. Mirrors HermesCLI._manual_compress.

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Stop pre-locking history before _compress_session_history in slash command mirroring, keep session-key sync parity with manual compression, and add a regression test that asserts /compress is invoked without holding history_lock.

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…ck (NousResearch#17661) * fix(tui): offload manual compaction RPC Route TUI session compression through the existing long-handler pool so slow compaction does not block other gateway RPCs. * fix(tui): show compaction progress immediately Print a local status line before the compress RPC starts so slow manual compaction does not look like a no-op. * feat(tui): rich /compress feedback parity with CLI Show pre-compaction message count and rough token estimate immediately, emit a status update so the bottom bar reflects ongoing compaction, and report a multi-line summary (headline + token delta + optional note) using the shared summarize_manual_compression helper. * fix(tui): show live compaction estimate in transcript Mirror compression progress status into the transcript so users see the backend message count and token estimate while /compress is still running. * fix(tui): single live compaction line with spinner glyph Drop the redundant local "compressing context..." placeholder and prefix the live backend status line with a braille spinner glyph so /compress reads as a single in-progress row. * fix(tui): address review nits on /compress feedback Reuse the precomputed token estimate inside _compress_session_history so the gateway does not redo the O(n) work while holding history_lock, keep the status bar pinned during long manual compactions instead of auto-restoring after 4s, and drop the redundant noop bullet that doubled with the system role glyph. * fix(tui): release history_lock during compaction LLM call Move the snapshot/commit pattern into _compress_session_history so the lock is held only across the in-memory bookkeeping, not during agent._compress_context. Also emit a final neutral status update from session.compress so the pinned compressing indicator clears even on errors. * fix(tui): rebuild prompt cleanly + sync session_key after compress Pass system_message=None so AIAgent._compress_context rebuilds the system prompt without nesting the cached identity block. Reuse the handler's pre-snapshotted history inside _compress_session_history to avoid a second O(n) copy under the lock. After compaction, when AIAgent._compress_context rotates session_id, sync the gateway session_key, migrate approval notify + yolo state, restart the slash worker, and clear the stale pending title. Mirrors HermesCLI._manual_compress. * Avoid /compress lock re-entry in slash side effects. Stop pre-locking history before _compress_session_history in slash command mirroring, keep session-key sync parity with manual compression, and add a regression test that asserts /compress is invoked without holding history_lock.

fix(tui): offload manual compaction RPC

83c43f4

Route TUI session compression through the existing long-handler pool so slow compaction does not block other gateway RPCs.

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/tui Terminal UI (ui-tui/ + tui_gateway/) labels Apr 29, 2026

fix(tui): show compaction progress immediately

4e920e6

Print a local status line before the compress RPC starts so slow manual compaction does not look like a no-op.

OutThisLife requested a review from Copilot April 29, 2026 23:36

Copilot started reviewing on behalf of OutThisLife April 29, 2026 23:36 View session

Copilot AI reviewed Apr 29, 2026

View reviewed changes

OutThisLife added 3 commits April 29, 2026 18:43

fix(tui): show live compaction estimate in transcript

771f43a

Mirror compression progress status into the transcript so users see the backend message count and token estimate while /compress is still running.

fix(tui): single live compaction line with spinner glyph

fffb510

Drop the redundant local "compressing context..." placeholder and prefix the live backend status line with a braille spinner glyph so /compress reads as a single in-progress row.

OutThisLife changed the title ~~fix(tui): keep manual compaction from blocking RPCs~~ fix(tui): responsive /compress with live progress + CLI-parity feedback Apr 30, 2026

OutThisLife requested a review from Copilot April 30, 2026 00:07

Copilot started reviewing on behalf of OutThisLife April 30, 2026 00:07 View session