feat: add grok to TOOL_USE_ENFORCEMENT_MODELS for direct xAI usage by teknium1 · Pull Request #5595 · NousResearch/hermes-agent

teknium1 · 2026-04-06T18:21:08Z

Summary

Adds "grok" to the TOOL_USE_ENFORCEMENT_MODELS tuple so Grok models receive tool-use enforcement guidance in the system prompt.

Closes #5531

What changed

agent/prompt_builder.py: Added "grok" to TOOL_USE_ENFORCEMENT_MODELS
tests/agent/test_prompt_builder.py: Added assertion test for grok inclusion

Why

Grok models (x-ai/grok-4.20-beta, grok-code-fast-1) accessed via OpenRouter or direct xAI API were not getting the tool-use enforcement guidance that steers models to actually call tools instead of describing intended actions. The substring match on "grok" covers both routing paths.

Test plan

python -m pytest tests/agent/test_prompt_builder.py -n0 -q — 119 passed

Grok models (x-ai/grok-4.20-beta, grok-code-fast-1) now receive tool-use enforcement guidance, steering them to actually call tools instead of describing intended actions. Matches both OpenRouter (x-ai/grok-*) and direct xAI API usage.

github-actions · 2026-04-06T18:21:33Z

⚠️ Supply Chain Risk Detected

This PR contains patterns commonly associated with supply chain attacks. This does not mean the PR is malicious — but these patterns require careful human review before merging.

⚠️ WARNING: Install hook files modified

These files can execute code during package installation or interpreter startup.

Files:

hermes_cli/setup.py

Automated scan triggered by supply-chain-audit. If this is a false positive, a maintainer can approve after manual review.

Grok reasoning models have a failure mode where they describe planned actions in text ("I will check X", "Je vais lancer Y") without actually calling the corresponding tools. The existing TOOL_USE_ENFORCEMENT_GUIDANCE mitigates the "action reflex" trait (NousResearch#5595) but doesn't address the narration-vs-execution split that is specific to reasoning architectures. Add GROK_EXECUTION_GUIDANCE — a targeted system prompt block injected alongside TOOL_USE_ENFORCEMENT_GUIDANCE when the model name contains "grok". Three XML-tagged sections: - <no_intent_phrases>: explicit list of forbidden phrases in English and French ("I will...", "Let me...", "Je vais...", etc.) with the rule: if you need to act, call the tool now; do not narrate the intent. - <execute_first>: mandate that the first response to any work-implying request contain a tool call, not a plan. Chain multiple tool calls in the same turn without intermediate prose. - <no_analysis_hallucination>: forbid structured analyses, diagnosis lists, or recommendations produced from pure reasoning without tool calls to verify the claims. Injected in run_agent.py next to the existing provider-specific guidance blocks (OPENAI_MODEL_EXECUTION_GUIDANCE, GOOGLE_MODEL_OPERATIONAL_GUIDANCE). Tests (6 new in TestGrokExecutionGuidance): - Verifies XML tag structure - Asserts intent-phrase examples are present in both English and French - Asserts the execute-first mandate is documented - Asserts the no-analysis-hallucination rule is present - Size and type checks 124 passed, 1 skipped in tests/agent/test_prompt_builder.py (no regression). NOT YET PUSHED as a PR. To be dogfooded on the author's production instance on xAI before upstream submission, given the precedent of 'behavioral' patches being classified as prostheses in prior work.

…ousResearch#5595) Grok models (x-ai/grok-4.20-beta, grok-code-fast-1) now receive tool-use enforcement guidance, steering them to actually call tools instead of describing intended actions. Matches both OpenRouter (x-ai/grok-*) and direct xAI API usage.

teknium1 merged commit 582dbbb into main Apr 6, 2026
3 of 4 checks passed

Julientalbot mentioned this pull request Apr 10, 2026

feat(prompt_builder): add GROK_EXECUTION_GUIDANCE to suppress narration without tool calls #7138

Closed

alt-glitch mentioned this pull request Apr 24, 2026

Add "grok" to TOOL_USE_ENFORCEMENT_MODELS for direct xAI usage #5530

Closed

briandevans mentioned this pull request May 18, 2026

fix(agent): add qwen and deepseek to TOOL_USE_ENFORCEMENT_MODELS #28195

Closed

19 tasks

FCAR2025 mentioned this pull request May 19, 2026

fix(grok,glm): OPENAI guidance + tool_choice=required on stall retry #28325

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add grok to TOOL_USE_ENFORCEMENT_MODELS for direct xAI usage#5595

feat: add grok to TOOL_USE_ENFORCEMENT_MODELS for direct xAI usage#5595
teknium1 merged 1 commit into
mainfrom
hermes/hermes-ba679ba8

teknium1 commented Apr 6, 2026

Uh oh!

github-actions Bot commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

teknium1 commented Apr 6, 2026

Summary

What changed

Why

Test plan

Uh oh!

github-actions Bot commented Apr 6, 2026

⚠️ Supply Chain Risk Detected

⚠️ WARNING: Install hook files modified

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant