fix(model_metadata): add xAI Grok context length fallbacks by teknium1 · Pull Request #7093 · NousResearch/hermes-agent

teknium1 · 2026-04-10T10:04:14Z

Cherry-picked from #7039 by @Julientalbot onto current main.

Summary

xAI's /v1/models endpoint does not return context_length metadata. Users pointing at https://api.x.ai/v1 via a custom provider fall through to the 128k probe-down default, losing up to 93% of the usable window (e.g. 128k instead of 2M for grok-4.20).

Adds DEFAULT_CONTEXT_LENGTHS entries for the Grok family — same pattern as Claude, Gemma, MiniMax, Kimi, and GLM.

Test plan

pytest tests/agent/test_model_metadata.py — 77 passed

xAI /v1/models does not return context_length metadata, so Hermes probes down to the 128k default whenever a user configures a custom provider pointing at https://api.x.ai/v1. This forces every xAI user to manually override model.context_length in config.yaml (2M for Grok 4.20 / 4.1-fast / 4-fast) or lose most of the usable context window. Add DEFAULT_CONTEXT_LENGTHS entries for the Grok family so the fallback lookup returns the correct value via substring matching. Values sourced from models.dev (2026-04) and cross-checked against the xAI /v1/models listing: - grok-4.20-* 2,000,000 (reasoning, non-reasoning, multi-agent) - grok-4-1-fast-* 2,000,000 - grok-4-fast-* 2,000,000 - grok-4 / grok-4-0709 256,000 - grok-code-fast-1 256,000 - grok-3* 131,072 - grok-2 / latest 131,072 - grok-2-vision* 8,192 - grok (catch-all) 131,072 Keys are ordered longest-first so that specific variants match before the catch-all, consistent with the existing Claude/Gemma/MiniMax entries. Add TestDefaultContextLengths.test_grok_models_context_lengths and test_grok_substring_matching to pin the values and verify the full lookup path. All 77 tests in test_model_metadata.py pass.

teknium1 merged commit b577697 into main Apr 10, 2026
3 of 4 checks passed

teknium1 mentioned this pull request Apr 10, 2026

fix(model_metadata): add xAI Grok context length fallbacks #7039

Closed

This was referenced Apr 10, 2026

feat(providers): add native xAI provider #7050

Closed

feat(prompt_builder): add GROK_EXECUTION_GUIDANCE to suppress narration without tool calls #7138

Closed

DevvGwardo mentioned this pull request May 5, 2026

feat(xai): add grok-4.3 to static fallback and context-length map #20398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(model_metadata): add xAI Grok context length fallbacks#7093

fix(model_metadata): add xAI Grok context length fallbacks#7093
teknium1 merged 1 commit into
mainfrom
hermes/hermes-5bbf4839

teknium1 commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented Apr 10, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants