fix: properly pass model.max_tokens config to AIAgent in gateway by chengoak · Pull Request #19991 · NousResearch/hermes-agent

chengoak · 2026-05-05T02:12:42Z

What does this PR do?

Fixes the issue where from config.yaml was not being passed to AIAgent when running via the gateway (Feishu, QQBot, etc.), causing model responses to be truncated due to conservative default output limits.

Changes:

****: Import and use to read from config
****: Include in the runtime dict passed to AIAgent
****: Include in fallback provider resolution
****: Add parameter with config priority: CLI args > config file > model default

Why is this needed?

For custom providers like ByteDance Ark, the model default output token limit is quite conservative. When is configured but not passed through, users see warnings in platforms like Feishu.

Testing

Verified that the config path correctly reads
All changes are backward compatible (None is passed when config is not set)
Gateway routes correctly unpack the runtime dict including max_tokens

- Add max_tokens to _resolve_runtime_agent_kwargs() return value - Add max_tokens to _resolve_turn_agent_config() runtime dict - Add max_tokens to _try_resolve_fallback_provider() return value - Add max_tokens parameter to HermesCLI.__init__ with config support This ensures that model.max_tokens from config.yaml is properly passed to AIAgent, preventing response truncation for custom providers like ark/bytedance models that have conservative default output limits.

- Add max_tokens to _KNOWN_KEYS in config.py - Pass max_tokens through _normalize_custom_provider_entry - Read max_tokens from runtime provider config in gateway - Fallback to model.max_tokens if provider-specific config not present - Propagate max_tokens through credential pool resolution path

chengoak · 2026-05-05T02:48:43Z

Fixes #20004 - This PR ensures max_tokens from custom_providers is properly passed to AIAgent, with fallback to global model.max_tokens config.

chengoak added 2 commits May 5, 2026 10:12

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/gateway Gateway runner, session dispatch, delivery comp/cli CLI entry point, hermes_cli/, setup wizard area/config Config system, migrations, profiles labels May 5, 2026

chengoak mentioned this pull request May 5, 2026

max_tokens config from custom_providers is not passed to AIAgent #20004

Open

Sanjays2402 mentioned this pull request May 5, 2026

fix(gateway): honor max_tokens from custom_providers / providers entries (#20004) #20149

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: properly pass model.max_tokens config to AIAgent in gateway#19991

fix: properly pass model.max_tokens config to AIAgent in gateway#19991
chengoak wants to merge 2 commits into
NousResearch:mainfrom
chengoak:fix/max_tokens-gateway-config

chengoak commented May 5, 2026

Uh oh!

chengoak commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chengoak commented May 5, 2026

What does this PR do?

Changes:

Why is this needed?

Testing

Uh oh!

chengoak commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants