fix: read max_tokens from custom_providers per-model config by zccyman · Pull Request #28988 · NousResearch/hermes-agent

zccyman · 2026-05-20T00:00:02Z

Summary

Fixes #28046 — max_tokens configured under custom_providers[].models.<model>.max_tokens was silently ignored, always defaulting to 4096.

The codebase already had get_custom_provider_context_length() for reading per-model context_length from custom_providers, but no equivalent for max_tokens.

Changes

Extract get_custom_provider_model_field() — generic lookup helper that searches custom_providers entries for any per-model field. Replaces the inline logic in get_custom_provider_context_length().
Add get_custom_provider_max_tokens() — thin wrapper over the generic helper, symmetric to get_custom_provider_context_length().
Read max_tokens in agent_init.py — after the existing context_length custom_providers lookup, adds a symmetric block that calls get_custom_provider_max_tokens() when agent.max_tokens is None.
10 regression tests — full coverage for the new lookup: matching, trailing-slash insensitivity, zero/negative rejection, string coercion, coexistence with context_length, first-match-wins, None inputs.

Backward Compatibility

get_custom_provider_context_length() signature unchanged — custom_providers remains a positional arg (3 existing tests pass without modification)
New code only runs when agent.max_tokens is None (no override from top-level config or constructor) — pure additive fallback
22/22 tests pass (12 existing context_length + 10 new max_tokens)

Root Cause Pattern

This is the same pattern as #28961 (pre_tool_call missing session_id) and #28984 (Typed Plugin Hook Protocol FR): configuration/state flows through call chains without schema enforcement, so adding a new config field requires manually updating every bridge — and omissions are silent.

Previously, only context_length was read from custom_providers per-model config. max_tokens was silently ignored, always falling back to 4096. Changes: - Extract get_custom_provider_model_field() as generic lookup helper - Add get_custom_provider_max_tokens() symmetric to context_length - Read custom_providers max_tokens in agent_init.py after context_length - Add 10 regression tests for max_tokens lookup Closes NousResearch#28046

alt-glitch · 2026-05-20T00:08:03Z

Duplicate of #28142 — same fix (read max_tokens from custom_providers per-model config in agent_init.py). Also see competing PRs #28154 (already marked dupe of #28142) and #28786.

zccyman · 2026-05-20T00:18:26Z

Superseded by PR #28995 which includes this fix plus the Config-Runtime Contract Registry (Phase 1).

This was referenced May 20, 2026

[Feature]: Typed Config-Runtime Contract — eliminate silent config/state/hook binding gaps #28984

Open

feat: Config-Runtime Contract Registry (Phase 1) + fix #28046 + fix #28863 #28995

Open

zccyman closed this May 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: read max_tokens from custom_providers per-model config#28988

fix: read max_tokens from custom_providers per-model config#28988
zccyman wants to merge 1 commit into
NousResearch:mainfrom
atyou2happy:fix/custom-provider-max-tokens

zccyman commented May 20, 2026

Uh oh!

alt-glitch commented May 20, 2026

Uh oh!

zccyman commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zccyman commented May 20, 2026

Summary

Changes

Backward Compatibility

Root Cause Pattern

Uh oh!

alt-glitch commented May 20, 2026

Uh oh!

zccyman commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants