Skip to content

fix(agent): use standard MiniMax-M2.7 for auxiliary model instead of highspeed#4115

Closed
shannonsands wants to merge 1 commit into
mainfrom
fix/minimax-aux-model-pricing
Closed

fix(agent): use standard MiniMax-M2.7 for auxiliary model instead of highspeed#4115
shannonsands wants to merge 1 commit into
mainfrom
fix/minimax-aux-model-pricing

Conversation

@shannonsands

Copy link
Copy Markdown
Contributor

What does this PR do?

Changes the default MiniMax auxiliary model from MiniMax-M2.7-highspeed to MiniMax-M2.7. The highspeed variant is the same model running on faster hardware at exactly 2x the price — users with MiniMax as their provider are silently paying double for every auxiliary call (summarization, compression, vision).

Variant Input Output Throughput
MiniMax-M2.7 (standard) $0.30/M $1.20/M ~33 tok/s
MiniMax-M2.7-highspeed $0.60/M $2.40/M ~51 tok/s

For auxiliary side tasks where latency is not critical, the standard tier is the correct default.

Source: OpenRouter pricing, MiniMax pricing docs

Related Issue

Fixes #4082

Type of Change

  • 🐛 Bug fix (non-breaking change that fixes an issue)
  • ✨ New feature (non-breaking change that adds functionality)
  • 🔒 Security fix
  • 📝 Documentation update
  • ✅ Tests (adding or improving test coverage)
  • ♻️ Refactor (no behavior change)
  • 🎯 New skill (bundled or hub)

Changes Made

  • agent/auxiliary_client.py lines 59-60: MiniMax-M2.7-highspeedMiniMax-M2.7 for both minimax and minimax-cn entries in _API_KEY_PROVIDER_AUX_MODELS

How to Test

  1. Configure MiniMax as your provider
  2. Send a message that triggers an auxiliary call (e.g. context compression, web extract summarization)
  3. Verify the auxiliary request targets MiniMax-M2.7 (not MiniMax-M2.7-highspeed) in logs
  4. Confirm API billing reflects standard tier pricing

Checklist

Code

  • I've read the Contributing Guide
  • My commit messages follow Conventional Commits
  • I searched for existing PRs to make sure this isn't a duplicate
  • My PR contains only changes related to this fix (no unrelated commits)
  • I've run pytest tests/ -q and all tests pass — N/A (config value change, no logic affected)
  • I've added tests for my changes — N/A
  • I've tested on my platform: macOS 15 (Apple Silicon)

Documentation & Housekeeping

  • I've updated relevant documentation — N/A
  • I've updated cli-config.yaml.example — N/A
  • I've updated CONTRIBUTING.md or AGENTS.md — N/A
  • I've considered cross-platform impact — model name string, platform-agnostic
  • I've updated tool descriptions/schemas — N/A

…highspeed

MiniMax-M2.7-highspeed is the same model running on faster hardware at
2x the price ($0.60/$2.40 per M tokens vs $0.30/$1.20). For auxiliary
tasks like summarization and compression where throughput is not critical,
the standard variant is the correct default.

Closes #4082
@teknium1

teknium1 commented Apr 8, 2026

Copy link
Copy Markdown
Contributor

Fixed in PR #6082 — aux model switched from highspeed to standard. Thanks!

@teknium1 teknium1 closed this Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Minimax Default Auxiallry Model MiniMax-M2.7-highspeed is twice the price at API rates

2 participants