Bug Description
Summary:
When users save a custom provider using the /model command without specifying a context length, the system doesn't automatically detect and display what context length was actually used. This leaves users uncertain about whether their custom provider has a short or long context window, which is critical for prompt caching behavior.
Impact:
- Users cannot easily verify if their custom provider has adequate context window
- Prompt caching may be less effective than expected
- No visual feedback distinguishes between auto-detected and default context lengths
Steps to Reproduce
run hermes model
select a custom provider
do not enter a context length
Expected Behavior
After saving a custom provider without a context length, the system should:
- Attempt to auto-detect the context length based on the model name
- Display the detected context length to the user
- Fall back to a sensible default (128K tokens) if detection fails
- Indicate visually when the default was used vs. when auto-detection succeeded
Actual Behavior
Users who save a custom provider without specifying context length have no feedback on what context length is being used. This can lead to unexpected prompt caching behavior and higher token costs.
Affected Component
CLI (interactive chat)
Messaging Platform (if gateway-related)
No response
Operating System
Debian 13.4
Python Version
3.13.3
Hermes Version
0.4
Relevant Logs / Traceback
Root Cause Analysis (optional)
No response
Proposed Fix (optional)
No response
Are you willing to submit a PR for this?
Bug Description
Summary:
When users save a custom provider using the
/modelcommand without specifying a context length, the system doesn't automatically detect and display what context length was actually used. This leaves users uncertain about whether their custom provider has a short or long context window, which is critical for prompt caching behavior.Impact:
Steps to Reproduce
run hermes model
select a custom provider
do not enter a context length
Expected Behavior
After saving a custom provider without a context length, the system should:
Actual Behavior
Users who save a custom provider without specifying context length have no feedback on what context length is being used. This can lead to unexpected prompt caching behavior and higher token costs.
Affected Component
CLI (interactive chat)
Messaging Platform (if gateway-related)
No response
Operating System
Debian 13.4
Python Version
3.13.3
Hermes Version
0.4
Relevant Logs / Traceback
Root Cause Analysis (optional)
No response
Proposed Fix (optional)
No response
Are you willing to submit a PR for this?