[Bug]: `think=False` incorrectly sent to all `provider=custom` endpoints, not just Ollama

### Bug Description

## Summary

When a user sets `reasoning_effort: none` (or disables reasoning via `/reasoning none`),
Hermes Agent sends `extra_body["think"] = false` to **every** custom provider endpoint. The
`think` parameter is Ollama-specific and is rejected with HTTP 422 by any non-Ollama provider
that validates its request body.

## Acceptance Criteria

- [ ] `think=False` is sent to Ollama/local endpoints when reasoning is disabled.
- [ ] `think=False` is **not** sent to cloud custom providers (Mistral, Fireworks, Together.ai, vLLM remote, etc.).
- [ ] OpenRouter and Anthropic reasoning paths are unaffected.
- [ ] `python -m pytest tests/ -q` passes with no regressions.

---

## Labels

`bug` `provider-compat` `reasoning` `ollama`


### Steps to Reproduce

1. Configure any non-Ollama endpoint as a custom provider, e.g. Mistral AI, Fireworks,
   Together.ai, or a vLLM server:
   ```yaml
   provider: custom
   base_url: https://api.mistral.ai/v1
   model: mistral-large-latest
   ```
2. Disable reasoning in the CLI or config:
   ```
   /reasoning none
   ```
   or in `config.yaml`:
   ```yaml
   reasoning_effort: none
   ```
3. Send any message.

### Expected Behavior

The unknown 'think' field should not have been sent to the provider. 

### Actual Behavior

**Result:** The provider returns HTTP 422 because it received an unknown `think` field:

```json
{
  "detail": [
    {
      "type": "extra_forbidden",
      "loc": ["body", "think"],
      "msg": "Extra inputs are not permitted"
    }
  ]
}
```


### Affected Component

Agent Core (conversation loop, context compression, memory)

### Messaging Platform (if gateway-related)

_No response_

### Debug Report

```shell
.
```

### Operating System

Linux 6.12.74+deb13+1-amd64 x86_64

### Python Version

3.11.15

### Hermes Version

0.9.0

### Additional Logs / Traceback (optional)

```shell

```

### Root Cause Analysis (optional)

**File:** `run_agent.py` — `_build_api_kwargs()`

The guard condition checks only `provider == "custom"`, which matches all custom providers
regardless of their URL:

```python
if self.provider == "custom" and self.reasoning_config and isinstance(self.reasoning_config, dict):
    _effort = (self.reasoning_config.get("effort") or "").strip().lower()
    _enabled = self.reasoning_config.get("enabled", True)
    if _effort == "none" or _enabled is False:
        extra_body["think"] = False  # sent to ALL custom providers
```

The `think` parameter is an Ollama-native API extension. It is **not** part of the OpenAI
Chat Completions specification and is not recognised by any cloud provider.


### Proposed Fix (optional)

## Fix

Replace the `provider == "custom"` guard with an Ollama-specific URL check:

```python
# Only send think=False to Ollama/local endpoints — it is an Ollama-native parameter
# and is rejected by cloud custom providers (Mistral, Fireworks, vLLM, etc.).
_is_ollama_endpoint = (
    "ollama" in self._base_url_lower
    or ":11434" in self._base_url_lower
    or is_local_endpoint(self.base_url or "")
)
if _is_ollama_endpoint and self.reasoning_config and isinstance(self.reasoning_config, dict):
    _effort = (self.reasoning_config.get("effort") or "").strip().lower()
    _enabled = self.reasoning_config.get("enabled", True)
    if _effort == "none" or _enabled is False:
        extra_body["think"] = False
```

`is_local_endpoint()` is already imported in `run_agent.py` and returns `True` for
`localhost`, `127.0.0.1`, and `0.0.0.0` addresses — which covers all local Ollama setups
regardless of port.

---

## Impact

- **Affected:** Any user with `provider=custom` pointing to a non-Ollama endpoint who has
  `reasoning_effort` set to `none` or has disabled reasoning.
- **Not affected:** Ollama users (fix preserves existing behaviour).
- **Not affected:** OpenRouter, Anthropic, AWS Bedrock — these use separate code paths.

---

## Files Changed

| File | Change |
|---|---|
| `run_agent.py` | Replace `provider == "custom"` guard with Ollama URL detection |

---

### Are you willing to submit a PR for this?

- [ ] I'd like to fix this myself and submit a PR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: `think=False` incorrectly sent to all `provider=custom` endpoints, not just Ollama #11237

Bug Description

Summary

Acceptance Criteria

Labels

Steps to Reproduce

Expected Behavior

Actual Behavior

Affected Component

Messaging Platform (if gateway-related)

Debug Report

Operating System

Python Version

Hermes Version

Additional Logs / Traceback (optional)

Root Cause Analysis (optional)

Proposed Fix (optional)

Fix

Impact

Files Changed

Are you willing to submit a PR for this?

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Bug]: think=False incorrectly sent to all provider=custom endpoints, not just Ollama #11237

Description

Bug Description

Summary

Acceptance Criteria

Labels

Steps to Reproduce

Expected Behavior

Actual Behavior

Affected Component

Messaging Platform (if gateway-related)

Debug Report

Operating System

Python Version

Hermes Version

Additional Logs / Traceback (optional)

Root Cause Analysis (optional)

Proposed Fix (optional)

Fix

Impact

Files Changed

Are you willing to submit a PR for this?

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

[Bug]: `think=False` incorrectly sent to all `provider=custom` endpoints, not just Ollama #11237