fix(vision): resolve Nous vision model correctly in auto-detect path by Ifkellx · Pull Request #12683 · NousResearch/hermes-agent

Ifkellx · 2026-04-19T20:37:31Z

Problem

The vision auto-detect chain calls resolve_provider_client() with the vision model from _PROVIDER_VISION_MODELS, but resolve_provider_client() always called _try_nous() without vision=True. This caused it to return the default text model instead of the vision-capable xiaomi/mimo-v2-omni, resulting in 404 errors from the Nous inference API when sending images.

Additionally, _PROVIDER_VISION_MODELS was missing an entry for the nous provider.

Root Cause

The auto-detect path in resolve_vision_provider_client():

Looks up _PROVIDER_VISION_MODELS.get("nous") → returns xiaomi/mimo-v2-omni
Calls resolve_provider_client("nous", model="xiaomi/mimo-v2-omni")
resolve_provider_client calls _try_nous() without vision=True
_try_nous() ignores the passed model, returns the default text model

The fallback path (_resolve_strict_vision_backend) worked correctly because it called _try_nous(vision=True) directly.

Fix

_PROVIDER_VISION_MODELS: Added "nous": "xiaomi/mimo-v2-omni" entry so the vision auto-detect chain picks the correct multimodal model.
resolve_provider_client: Auto-detects vision tasks by checking if the requested model matches a value in _PROVIDER_VISION_MODELS or is a known vision model name, then passes vision=True to _try_nous().

Verification

xiaomi/mimo-v2-omni returns HTTP 200 with image inputs on Nous inference API
google/gemini-3-flash-preview returns 404 with image inputs on Nous inference API
Free tier Nous accounts: only Xiaomi models are available, making this fix essential

Impact

Fixes browser_vision and vision_analyze tools for all Hermes users on Nous (both free and paid tiers).

When Nous Research is the main provider, vision tasks fail with 404 because _PROVIDER_VISION_MODELS has no entry for 'nous'. The auto-detect falls back to the main model (e.g. xiaomi/mimo-v2-pro) which doesn't support images, or to google/gemini-3-flash-preview which Nous also rejects for image inputs. This adds 'nous': 'xiaomi/mimo-v2-omni' to the vision model map, which is the multimodal model available on Nous inference API and confirmed to work with image inputs (HTTP 200). Closes vision failures for all Nous provider users.

Ifkellx closed this Apr 19, 2026

Ifkellx changed the title ~~fix(vision): add nous provider to vision model map~~ fix(vision): resolve Nous vision model correctly in auto-detect path Apr 19, 2026

alt-glitch mentioned this pull request Apr 21, 2026

fix(vision): route Nous main-provider vision through tier-aware backend #13696

Merged

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(vision): resolve Nous vision model correctly in auto-detect path#12683

fix(vision): resolve Nous vision model correctly in auto-detect path#12683
Ifkellx wants to merge 1 commit into
NousResearch:mainfrom
Ifkellx:fix/nous-vision-model

Ifkellx commented Apr 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ifkellx commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Fix

Verification

Impact

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Ifkellx commented Apr 19, 2026 •

edited

Loading