You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add persistent session-usage visibility to the Hermes CLI so users can always see how many tokens have been used in the current session and how full the active model context window is.
This is one of the biggest UX gaps in the current CLI. Hermes already has access to token usage data from API responses and already knows model context limits via model_metadata.py, but that information is not surfaced in a way that helps users manage long-running sessions.
OpenCode and similar coding CLIs do a good job of showing cumulative usage, while Codex-style interfaces make context-window fullness legible at a glance. Hermes should expose both.
How close am I to filling the current context window?
Is a long conversation likely to compact soon or overflow unexpectedly?
That leads to avoidable surprises:
context pressure appears "suddenly"
users cannot tell whether a task is getting expensive
long sessions feel opaque compared with modern coding CLIs
There is already a broader open issue around a full CLI status bar and token/cost tracking (#683), but this narrower issue is specifically about surfacing session token totals plus context-window percentage in a simple, always-visible UX.
Proposed Design
Core behavior
Expose two pieces of session state in the CLI:
cumulative tokens used in the current session
current context-window utilization as a percentage of the active model's max context
Suggested display shape:
prompt/status line widget above the input area, or
another always-visible compact status element in the CLI layout
Example:
claude-sonnet │ 18.4k tokens used │ 41% context
Data sources
token usage from model/API response usage fields
max context from agent/model_metadata.py
session accumulator stored in CLI/session state
UX notes
keep it lightweight and always visible
show raw totals and percentage, not just a bar
degrade gracefully in narrow terminals
avoid requiring a separate slash command for the primary signal
Overview
Add persistent session-usage visibility to the Hermes CLI so users can always see how many tokens have been used in the current session and how full the active model context window is.
This is one of the biggest UX gaps in the current CLI. Hermes already has access to token usage data from API responses and already knows model context limits via
model_metadata.py, but that information is not surfaced in a way that helps users manage long-running sessions.OpenCode and similar coding CLIs do a good job of showing cumulative usage, while Codex-style interfaces make context-window fullness legible at a glance. Hermes should expose both.
Problem
Today, Hermes users cannot easily answer basic session-management questions while working:
That leads to avoidable surprises:
There is already a broader open issue around a full CLI status bar and token/cost tracking (#683), but this narrower issue is specifically about surfacing session token totals plus context-window percentage in a simple, always-visible UX.
Proposed Design
Core behavior
Expose two pieces of session state in the CLI:
Suggested display shape:
Example:
claude-sonnet │ 18.4k tokens used │ 41% contextData sources
usagefieldsagent/model_metadata.pyUX notes
Possible extension points
/usagereadoutInitial Scope
MVP:
Possible follow-up work:
Open Questions
References
usagefieldsagent/model_metadata.pycli.py