fix(models): consolidated validation + picker — anthropic_messages, native Anthropic, Gemini prefix, OpenAI catalog by teknium1 · Pull Request #15136 · NousResearch/hermes-agent

teknium1 · 2026-04-24T12:48:08Z

Summary

Consolidated salvage of 4 model validation / /model picker PRs — anthropic_messages mode routing, native Anthropic validator, Gemini models/ prefix, and OpenAI live /model counts. Attribution preserved via rebase-merge.

Changes

hermes_cli/models.py + hermes_cli/main.py + hermes_cli/model_switch.py (anthropic_messages + Cloudflare) — probe_api_models() now takes api_mode and sends x-api-key + anthropic-version headers when api_mode == "anthropic_messages" instead of Authorization: Bearer. Always sets User-Agent: hermes-cli/<version> (Cloudflare 1010 bypass). Custom endpoints where the /models probe is unreachable no longer hard-reject — the model is persisted with a warning so proxy endpoints that don't implement /v1/models still work. api_mode is threaded through the pipeline (validate_requested_model → probe_api_models → fetch_api_models). From @Wangshengyang2004 (fix(cli): model validation fails for anthropic_messages and Cloudflare-protected endpoints #12950). Subsumes @cedric-common's fix(models): use x-api-key for Anthropic API probing #13189 (URL-based) with mode-based detection.
hermes_cli/models.py (native Anthropic validator) — validate_requested_model gained a dedicated normalized == "anthropic" branch that calls _fetch_anthropic_models() (proper x-api-key + anthropic-version headers; also handles Claude Code OAuth tokens). Fuzzy auto-correction on close matches, suggestions on unknown IDs, accept with warning on snapshot / early-access IDs that Anthropic gates outside /v1/models. From @H-Ali13381 (fix(models): use Anthropic-native headers for model validation #12618).
hermes_cli/models.py (Gemini models/ prefix) — Gemini's OpenAI-compat /v1beta/openai/models endpoint returns IDs prefixed with models/ (native Gemini-API convention). The set-membership check dropped every curated Gemini ID. Strip the prefix before comparison when normalized == "gemini". Fixes Gateway /model picker fails for Gemini and Anthropic providers (validate_requested_model rejects curated models) #12532. From @briandevans (fix(models): accept Gemini + Anthropic in gateway /model picker (#12532) #12585 — Gemini piece only; the Anthropic piece of that PR was subsumed by fix(models): use Anthropic-native headers for model validation #12618 above, and the full branch carried unrelated catalog-version regressions).
hermes_cli/model_switch.py + hermes_cli/models.py (OpenAI picker counts) — _PROVIDER_MODELS["openai"] gained a curated static catalog. provider_model_ids("openai") probes /v1/models live when OPENAI_API_KEY is set, falling back to the catalog. list_authenticated_providers() uses the catalog when a provider row's models dict is empty and the base_url is api.openai.com, so OpenAI / OpenAI Direct rows no longer show 0 models. Fixes bug(model-picker): OpenAI and OpenAI Direct show 0 models on latest main #14651. From @XieNBi (fix(cli): non-zero /model counts for native OpenAI and direct API rows #14753).

Credit

Validation

scripts/run_tests.sh tests/hermes_cli/test_models.py tests/hermes_cli/test_model_validation.py \
  tests/hermes_cli/test_model_switch_custom_providers.py \
  tests/hermes_cli/test_user_providers_model_switch.py \
  tests/hermes_cli/test_copilot_auth.py

181/181 passing. hermes_cli/models.py, hermes_cli/model_switch.py, hermes_cli/main.py all compile.

One stale test updated — test_custom_endpoint_warns_with_probed_url_and_v1_hint expected persist=False on probe failure; after #12950 it's persist=True (the intentional behavior change, with a comment pointing at #12950).

Conflict resolutions

fix(cli): model validation fails for anthropic_messages and Cloudflare-protected endpoints #12950 — combined with main's _HERMES_USER_AGENT constant.
fix(models): use Anthropic-native headers for model validation #12618 — slotted the native anthropic branch AFTER the MiniMax branch but BEFORE the api_mode=="anthropic_messages" branch; both coexist (different provider vs. different transport).
fix(models): accept Gemini + Anthropic in gateway /model picker (#12532) #12585 — cherry-pick aborted due to stale catalog regressions; Gemini-prefix piece applied directly with authorship preserved via --author=briandevans.

Not included

fix(models): use x-api-key for Anthropic API probing #13189 @cedric-common — URL-based x-api-key trigger subsumed by fix(cli): model validation fails for anthropic_messages and Cloudflare-protected endpoints #12950's mode-based detection. Will close with credit.
fix(cli): validate user-defined providers consistently #14857 @LeonSGP43 — already merged via fix(cli): validate user-defined providers consistently (salvage #14857) #15083 (before this salvage window).

…are-protected endpoints - probe_api_models: add api_mode param; use x-api-key + anthropic-version headers for anthropic_messages mode (Anthropic's native Models API auth) - probe_api_models: add User-Agent header to avoid Cloudflare 403 blocks on third-party OpenAI-compatible endpoints - validate_requested_model: pass api_mode through from switch_model - validate_requested_model: for anthropic_messages mode, attempt probe with correct auth; if probe fails (many proxies don't implement /v1/models), accept the model with an informational warning instead of rejecting - fetch_api_models: propagate api_mode to probe_api_models

The generic /v1/models probe in validate_requested_model() sent a plain 'Authorization: Bearer <key>' header, which works for OpenAI-compatible endpoints but results in a 401 Unauthorized from Anthropic's API. Anthropic requires x-api-key + anthropic-version headers (or Bearer for OAuth tokens from Claude Code). Add a provider-specific branch for normalized == 'anthropic' that calls the existing _fetch_anthropic_models() helper, which already handles both regular API keys and Claude Code OAuth tokens correctly. This mirrors the pattern already used for openai-codex, copilot, and bedrock. The branch also includes: - fuzzy auto-correct (cutoff 0.9) for near-exact model ID typos - fuzzy suggestions (cutoff 0.5) when the model is not listed - graceful fall-through when the token cannot be resolved or the network is unreachable (accepts with a warning rather than hard-fail) - a note that newer/preview/snapshot model IDs can be gate-listed and may still work even if not returned by /v1/models Fixes Anthropic provider users seeing 'service unreachable' errors when running /model <claude-model> because every probe 401'd.

@briandevans

Salvage of the Gemini-specific piece from PR #12585 by @briandevans. Gemini's OpenAI-compat /v1beta/openai/models endpoint returns IDs prefixed with 'models/' (native Gemini-API convention), so set-membership against curated bare IDs drops every model. Strip the prefix before comparison. The Anthropic static-catalog piece of #12585 was subsumed by #12618's _fetch_anthropic_models() branch landing earlier in the same salvage PR. Full branch cherry-pick was skipped because it also carried unrelated catalog-version regressions.

Wangshengyang2004 and others added 5 commits April 24, 2026 05:44

fix(cli): non-zero /model counts for native OpenAI and direct API rows

69d1345

chore(release): map Group H contributors in AUTHOR_MAP

b85261f

teknium1 merged commit 9d1b277 into main Apr 24, 2026
10 of 11 checks passed

teknium1 deleted the hermes/hermes-172af8ae branch April 24, 2026 12:48

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/cli CLI entry point, hermes_cli/, setup wizard labels Apr 24, 2026

perlowja mentioned this pull request Apr 24, 2026

docs(providers): Together/Groq/Perplexity cookbook via custom_providers #15214

Closed

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(models): consolidated validation + picker — anthropic_messages, native Anthropic, Gemini prefix, OpenAI catalog#15136

fix(models): consolidated validation + picker — anthropic_messages, native Anthropic, Gemini prefix, OpenAI catalog#15136
teknium1 merged 5 commits into
mainfrom
hermes/hermes-172af8ae

teknium1 commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

teknium1 commented Apr 24, 2026

Summary

Changes

Credit

Validation

Conflict resolutions

Not included

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants