fix(transport): apply default temperature and parallel_tool_calls for custom OpenAI-compat endpoints (#18470) by XwanwanX · Pull Request #18492 · NousResearch/hermes-agent

XwanwanX · 2026-05-01T17:53:50Z

Summary

Hermes routed provider=custom through chat_completions without sending temperature or parallel_tool_calls. Many local OpenAI-compatible stacks (e.g. llama.cpp / vLLM) then keep server defaults (often temperature=1.0) and only batch tool rounds when parallel_tool_calls is explicit.

Fixes #18470.

Changes

ChatCompletionsTransport.build_kwargs
When is_custom_provider is true:
- If no fixed/overridden temperature applies, send temperature: 0.2 unless the user opts out (omit_temperature in compat options).
- When tools are present, send parallel_tool_calls: true by default unless overridden.
custom_providers (normalized in hermes_cli/config.py, surfaced via resolve_runtime_provider):
- Optional: temperature (number), parallel_tool_calls (bool), omit_temperature: true.
- Passed through CLI, gateway runtime dict, and AIAgent(custom_openai_request_options=...).
Delegate: subagents inherit parent compat options when effective_provider == "custom".

Config example

custom_providers:
  - name: Local
    base_url: http://127.0.0.1:8080/v1
    model: your-model-alias
    temperature: 0.25
    parallel_tool_calls: true
    # omit_temperature: true   # uncomment to omit `temperature` and use server defaults

…ndpoints - Default chat_completions sampling temperature for provider=custom stacks (Issue NousResearch#18470) - Send parallel_tool_calls when tools are present; optional YAML overrides via custom_providers - Thread options through CLI, gateway runtime, and delegate subagents

alt-glitch · 2026-05-01T18:08:37Z

Likely duplicate of #18483 — same root cause: chat_completions transport omits temperature and parallel_tool_calls for custom providers. This PR has broader scope (config/gateway/delegate) vs #18483's narrower transport-only fix. See also #18489.

alt-glitch · 2026-05-01T18:09:17Z

Likely duplicate of #18483

alt-glitch mentioned this pull request May 1, 2026

fix(transports): set temperature and parallel_tool_calls for custom chat_completions #18489

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(transport): apply default temperature and parallel_tool_calls for custom OpenAI-compat endpoints (#18470)#18492

fix(transport): apply default temperature and parallel_tool_calls for custom OpenAI-compat endpoints (#18470)#18492
XwanwanX wants to merge 1 commit into
NousResearch:mainfrom
XwanwanX:fix/custom-openai-chat-completions-defaults

XwanwanX commented May 1, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

XwanwanX commented May 1, 2026

Summary

Changes

Config example

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants