fix(cli): allow custom/local endpoints without API key#2556

Merged

teknium1 merged 1 commit into

mainfrom

hermes/hermes-fdcb4c4a

Mar 22, 2026

teknium1 commented Mar 22, 2026

Contributor

Problem

Users running local LLM servers (llama.cpp, ollama, vLLM) get:

Provider resolver returned an empty API key.

This broke after the runtime provider resolver was tightened to enforce non-empty API keys. Local servers don't need auth.

Fix

In _ensure_runtime_credentials(): when a custom base_url IS configured but no API key was found, use a placeholder ("no-key-required") instead of failing. Local servers ignore the Authorization header.

Only applies when the base URL is NOT an OpenRouter URL (those always need real keys).

Reported by @ThatWolfieGuy — llama.cpp with Qwen 3.5 stopped working after updating to v0.4.0.


          fix(cli): allow custom/local endpoints without API key

1b5fb36

Local LLM servers (llama.cpp, ollama, vLLM, etc.) typically don't
require authentication. When a custom base_url is configured but no
API key is found, use a placeholder instead of failing with
'Provider resolver returned an empty API key.'

The OpenAI SDK accepts any string as api_key, and local servers
simply ignore the Authorization header.

Fixes issue reported by @ThatWolfieGuy — llama.cpp stopped working
after updating because the new runtime provider resolver enforces
non-empty API keys even for keyless local endpoints.

teknium1 merged commit 5ddb6a1 into main

1 check passed

promptengines commented Mar 23, 2026

lol just what i was looking for

teknium1 added a commit that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

8b3404a

…auth

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR #2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

teknium1 mentioned this pull request

fix: auxiliary client uses placeholder key for local servers without auth #3842

Merged

teknium1 added a commit that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

3cc5053

…auth (#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR #2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

angelburgosrosado pushed a commit to angelburgosrosado/hermes-agent that referenced this pull request


          Merge pull request NousResearch#2556 from NousResearch/hermes/hermes-…

54f3d2e

…fdcb4c4a

fix(cli): allow custom/local endpoints without API key

angelburgosrosado pushed a commit to angelburgosrosado/hermes-agent that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

600a762

…auth (NousResearch#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR NousResearch#2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

02356abc pushed a commit to 02356abc/hermes-agent that referenced this pull request


          Merge pull request NousResearch#2556 from NousResearch/hermes/hermes-…

f685b93

…fdcb4c4a

fix(cli): allow custom/local endpoints without API key

02356abc pushed a commit to 02356abc/hermes-agent that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

84cc40f

…auth (NousResearch#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR NousResearch#2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

olympus-terminal pushed a commit to olympus-terminal/hermes-agent that referenced this pull request


          Merge pull request NousResearch#2556 from NousResearch/hermes/hermes-…

4365c6f

…fdcb4c4a

fix(cli): allow custom/local endpoints without API key

olympus-terminal pushed a commit to olympus-terminal/hermes-agent that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

50120b3

…auth (NousResearch#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR NousResearch#2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

gweeteve pushed a commit to gweeteve/hermes-agent that referenced this pull request


          Merge pull request NousResearch#2556 from NousResearch/hermes/hermes-…

b1d5b61

…fdcb4c4a

fix(cli): allow custom/local endpoints without API key

gweeteve pushed a commit to gweeteve/hermes-agent that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

caec8e2

…auth (NousResearch#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR NousResearch#2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

Egavasyug pushed a commit to Egavasyug/hermes-agent that referenced this pull request


          Merge pull request NousResearch#2556 from NousResearch/hermes/hermes-…

e9b826b

…fdcb4c4a

fix(cli): allow custom/local endpoints without API key

Egavasyug pushed a commit to Egavasyug/hermes-agent that referenced this pull request


          fix: auxiliary client uses placeholder key for local servers without …

3f1c60a

…auth (NousResearch#3842)

Local inference servers (Ollama, llama.cpp, vLLM, LM Studio) don't
require API keys, but the auxiliary client's _resolve_custom_runtime()
rejected endpoints with empty keys — causing the auto-detection chain
to skip the user's local server entirely.  This broke compression,
summarization, and memory flush for users running local models without
an OpenRouter/cloud API key.

The main CLI already had this fix (PR NousResearch#2556, 'no-key-required'
placeholder), but the auxiliary client's resolution path was missed.

Two fixes:
- _resolve_custom_runtime(): use 'no-key-required' placeholder instead
  of returning None when base_url is present but key is empty
- resolve_provider_client() custom branch: same placeholder fallback
  for explicit_base_url without explicit_api_key

Updates 2 tests that expected the old (broken) behavior.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet