fix(agent): use standard MiniMax-M2.7 for auxiliary model instead of highspeed by shannonsands · Pull Request #4115 · NousResearch/hermes-agent

shannonsands · 2026-03-31T00:56:20Z

What does this PR do?

Changes the default MiniMax auxiliary model from MiniMax-M2.7-highspeed to MiniMax-M2.7. The highspeed variant is the same model running on faster hardware at exactly 2x the price — users with MiniMax as their provider are silently paying double for every auxiliary call (summarization, compression, vision).

Variant	Input	Output	Throughput
MiniMax-M2.7 (standard)	$0.30/M	$1.20/M	~33 tok/s
MiniMax-M2.7-highspeed	$0.60/M	$2.40/M	~51 tok/s

For auxiliary side tasks where latency is not critical, the standard tier is the correct default.

Source: OpenRouter pricing, MiniMax pricing docs

Related Issue

Fixes #4082

Type of Change

🐛 Bug fix (non-breaking change that fixes an issue)
✨ New feature (non-breaking change that adds functionality)
🔒 Security fix
📝 Documentation update
✅ Tests (adding or improving test coverage)
♻️ Refactor (no behavior change)
🎯 New skill (bundled or hub)

Changes Made

agent/auxiliary_client.py lines 59-60: MiniMax-M2.7-highspeed → MiniMax-M2.7 for both minimax and minimax-cn entries in _API_KEY_PROVIDER_AUX_MODELS

How to Test

Configure MiniMax as your provider
Send a message that triggers an auxiliary call (e.g. context compression, web extract summarization)
Verify the auxiliary request targets MiniMax-M2.7 (not MiniMax-M2.7-highspeed) in logs
Confirm API billing reflects standard tier pricing

Checklist

Code

I've read the Contributing Guide
My commit messages follow Conventional Commits
I searched for existing PRs to make sure this isn't a duplicate
My PR contains only changes related to this fix (no unrelated commits)
I've run pytest tests/ -q and all tests pass — N/A (config value change, no logic affected)
I've added tests for my changes — N/A
I've tested on my platform: macOS 15 (Apple Silicon)

Documentation & Housekeeping

I've updated relevant documentation — N/A
I've updated cli-config.yaml.example — N/A
I've updated CONTRIBUTING.md or AGENTS.md — N/A
I've considered cross-platform impact — model name string, platform-agnostic
I've updated tool descriptions/schemas — N/A

…highspeed MiniMax-M2.7-highspeed is the same model running on faster hardware at 2x the price ($0.60/$2.40 per M tokens vs $0.30/$1.20). For auxiliary tasks like summarization and compression where throughput is not critical, the standard variant is the correct default. Closes #4082

teknium1 · 2026-04-08T09:21:08Z

Fixed in PR #6082 — aux model switched from highspeed to standard. Thanks!

kshitijk4poor mentioned this pull request Apr 8, 2026

fix(minimax): correct context lengths, model catalog, thinking guard, aux model, and config base_url #6046

Closed

8 tasks

teknium1 mentioned this pull request Apr 8, 2026

fix(minimax): correct context lengths, model catalog, thinking guard, aux model, and config base_url #6082

Merged

teknium1 closed this Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): use standard MiniMax-M2.7 for auxiliary model instead of highspeed#4115

fix(agent): use standard MiniMax-M2.7 for auxiliary model instead of highspeed#4115
shannonsands wants to merge 1 commit into
mainfrom
fix/minimax-aux-model-pricing

shannonsands commented Mar 31, 2026

Uh oh!

teknium1 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shannonsands commented Mar 31, 2026

What does this PR do?

Related Issue

Type of Change

Changes Made

How to Test

Checklist

Code

Documentation & Housekeeping

Uh oh!

teknium1 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants