-
Notifications
You must be signed in to change notification settings - Fork 33.5k
gpt-5.4 shows 32k context in Hermes instead of 1,050,000 #5173
Copy link
Copy link
Closed as not planned
Labels
P2Medium — degraded but workaround existsMedium — degraded but workaround existscomp/agentCore agent loop, run_agent.py, prompt builderCore agent loop, run_agent.py, prompt builderprovider/openaiOpenAI / Codex Responses APIOpenAI / Codex Responses APIsweeper:implemented-on-mainSweeper: behavior already present on current mainSweeper: behavior already present on current maintype/bugSomething isn't workingSomething isn't working
Metadata
Metadata
Assignees
Labels
P2Medium — degraded but workaround existsMedium — degraded but workaround existscomp/agentCore agent loop, run_agent.py, prompt builderCore agent loop, run_agent.py, prompt builderprovider/openaiOpenAI / Codex Responses APIOpenAI / Codex Responses APIsweeper:implemented-on-mainSweeper: behavior already present on current mainSweeper: behavior already present on current maintype/bugSomething isn't workingSomething isn't working
Type
Fields
Give feedbackNo fields configured for issues without a type.
Bug
Hermes can show
gpt-5.4with a32kcontext window in Codex-backed setups.Repro
gpt-5.4on the Codex endpoint32k contextExpected
gpt-5.4should resolve to1,050,000context tokens.Actual
Hermes can persist a bad cache entry like:
gpt-5.4@https://chatgpt.com/backend-api/codex: 32000Likely cause
A small output-token-related value is being persisted or reused as the model context window.
Reference
OpenAI currently lists
gpt-5.4at1,050,000context and128,000max output tokens:https://developers.openai.com/api/docs/models/gpt-5.4
Fix
I have a patch prepared that:
openai-codexto provider-aware OpenAI metadata