Skip to content

feat: add qwen/qwen3.6-plus (paid) to OpenRouter and KiloCode model lists#6599

Open
iRonin wants to merge 1 commit into
NousResearch:mainfrom
iRonin:feat/add-qwen36-plus-paid
Open

feat: add qwen/qwen3.6-plus (paid) to OpenRouter and KiloCode model lists#6599
iRonin wants to merge 1 commit into
NousResearch:mainfrom
iRonin:feat/add-qwen36-plus-paid

Conversation

@iRonin

@iRonin iRonin commented Apr 9, 2026

Copy link
Copy Markdown
Contributor

Summary

Add qwen/qwen3.6-plus (paid tier) to the model catalogs for both OpenRouter and KiloCode providers. The :free variant remains listed alongside it.

Changes

hermes_cli/models.py

  • Added qwen/qwen3.6-plus to OPENROUTER_MODELS with descriptor "1M context, thinking"
  • Added qwen/qwen3.6-plus to the kilocode provider list in _PROVIDER_MODELS

agent/model_metadata.py

  • Added "qwen3.6-plus": 1048576 to DEFAULT_CONTEXT_LENGTHS (1M context)

Model Specs

  • Context: 1,048,576 tokens (1M)
  • Thinking: Supported (extended thinking / reasoning)
  • Pricing: Fetched live from OpenRouter /v1/models endpoint
  • Availability: OpenRouter + KiloCode

Both providers support this model ID natively -- no additional normalisation needed beyond the existing qwen vendor prefix detection in model_normalize.py.

@iRonin

iRonin commented Apr 9, 2026

Copy link
Copy Markdown
Contributor Author

Closes #6601

iRonin added a commit to iRonin/hermes-agent-nous that referenced this pull request Apr 9, 2026
Preserves 58 fork feature commits aligned with v2026.4.8 upstream:
- Dual queue system (priority/standard)
- Subagent panel UI with /subagents command
- Multi-stash management
- Terminal title customization
- Per-skill model routing (model/provider/skill/skills)
- Browser tool enhancements with user-data-dir support
- Async delegation infrastructure
- Config system extensions (config.yaml, hermes_cli/config.py)

Conflict resolutions:
- delegate_tool.py: merged fork's schema (model/provider/skill/skills)
  with upstream's ACP support (acp_command/acp_args)
- cli.py: resolved Chrome candidate method naming conflict
- api_server.py: merged /v1/sessions endpoint documentation

Includes PR NousResearch#6599: Qwen 3.6-Plus support (1M context window)
iRonin added a commit to iRonin/hermes-agent-nous that referenced this pull request Apr 9, 2026
All custom features merged onto v2026.4.8 + PR NousResearch#6599 (Qwen 3.6-Plus):
- Terminal title chain (12 commits)
- Ctrl+D delete char, double ESC clear, Ctrl+P peek
- Arrow keys free cursor, Up/Down history
- Ctrl+P history pager, Ctrl+O full message toggle
- Subagent control panel (Ctrl+X)
- Multi-item stash with panel, stash_auto_restore
- Dual queue: followup (Alt+Enter) + steering dispatch
- Per-skill model routing (model/provider/skill/skills)
- Gateway: /v1/sessions, model override
- Browser: /browser connect with Chrome profile
- Session: /resume picker, session title
- Async delegation (delegate_task_async tools)

Conflicts resolved using previously-resolved versions.
@iRonin iRonin force-pushed the feat/add-qwen36-plus-paid branch from cf040be to cc277e9 Compare April 11, 2026 18:49
Add the paid tier of qwen3.6-plus alongside the existing :free variant
to the OpenRouter model list and KiloCode provider list. Also add
1M context (1,048,576) to the default context length lookup.
@iRonin iRonin force-pushed the feat/add-qwen36-plus-paid branch from cc277e9 to 82bb1ef Compare April 11, 2026 20:37
@alt-glitch alt-glitch added type/feature New feature or request P3 Low — cosmetic, nice to have comp/cli CLI entry point, hermes_cli/, setup wizard comp/agent Core agent loop, run_agent.py, prompt builder provider/openrouter OpenRouter aggregator provider/qwen Qwen / Alibaba Cloud (OAuth) labels Apr 30, 2026
@alt-glitch

Copy link
Copy Markdown
Collaborator

Implements #6601. Related closed PR: #8481.

teknium1 pushed a commit that referenced this pull request May 17, 2026
qwen3.6-plus did not have an explicit entry in DEFAULT_CONTEXT_LENGTHS,
so the longest-substring fallback matched the generic 'qwen': 131072
catch-all. That dropped the effective context limit from 1,048,576
tokens to 131,072, prematurely lowered the compression threshold, and
produced misleading warnings about main/compression context mismatch
in long sessions.

Add an explicit 'qwen3.6-plus': 1048576 entry before the catch-all and
cover it with a regression test (bare, qwen/, and dashscope/ prefixes).

Note: PR #6599 also mentions touching model_metadata.py but the actual
diff only edits hermes_cli/models.py, so this fix is independent and
not duplicated by that PR.

Closes #27008
gweeteve pushed a commit to gweeteve/hermes-agent that referenced this pull request Jun 2, 2026
qwen3.6-plus did not have an explicit entry in DEFAULT_CONTEXT_LENGTHS,
so the longest-substring fallback matched the generic 'qwen': 131072
catch-all. That dropped the effective context limit from 1,048,576
tokens to 131,072, prematurely lowered the compression threshold, and
produced misleading warnings about main/compression context mismatch
in long sessions.

Add an explicit 'qwen3.6-plus': 1048576 entry before the catch-all and
cover it with a regression test (bare, qwen/, and dashscope/ prefixes).

Note: PR NousResearch#6599 also mentions touching model_metadata.py but the actual
diff only edits hermes_cli/models.py, so this fix is independent and
not duplicated by that PR.

Closes NousResearch#27008
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

comp/agent Core agent loop, run_agent.py, prompt builder comp/cli CLI entry point, hermes_cli/, setup wizard P3 Low — cosmetic, nice to have provider/openrouter OpenRouter aggregator provider/qwen Qwen / Alibaba Cloud (OAuth) type/feature New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants