Skip to content

Commit 609171d

Browse files
aallanclaude
andcommitted
Promote K2.6 to flagship, demote K2.5 to sonnet
The previous comment defended placing K2.6 in the sonnet slot 'for now' on the grounds that K2.6 is the new flagship — which is self-contradictory if you read it twice. Moonshot promoted K2.6 to its flagship line; the chart tier should reflect that. K2.5 moves to sonnet, joining Claude Sonnet 4 and GPT-4o as 'previous-generation / secondary slot from each provider' — a tighter semantic grouping than the prior mix anyway. No code or test surface affected; chart generation is unchanged in mechanics. Registry-only change. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent b35e4da commit 609171d

1 file changed

Lines changed: 7 additions & 8 deletions

File tree

scripts/plot_results.py

Lines changed: 7 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -93,18 +93,17 @@ class ModelSpec:
9393

9494

9595
MODELS: list[ModelSpec] = [
96-
# Flagship row
96+
# Flagship row — current top-tier model from each provider.
9797
ModelSpec("Claude Opus 4", "claude-opus-4-20250514", "flagship"),
9898
ModelSpec("GPT-4.1", "gpt-4.1-2025-04-14", "flagship"),
99-
ModelSpec("Kimi K2.5", "moonshot-kimi-k2.5", "flagship"),
100-
# Sonnet row
99+
ModelSpec("Kimi K2.6", "moonshot-kimi-k2.6", "flagship"),
100+
# Sonnet row — previous-generation / secondary slot from each provider.
101+
# Kimi K2.5 moves here from flagship after Moonshot promoted K2.6 to
102+
# the active flagship-line slot (kimi-k2-turbo-preview deprecated
103+
# 2026-05-25, see #68).
101104
ModelSpec("Claude Sonnet 4", "claude-sonnet-4-20250514", "sonnet"),
102105
ModelSpec("GPT-4o", "gpt-4o", "sonnet"),
103-
# K2.6 lands in the sonnet slot for now (replacing kimi-k2-turbo-preview,
104-
# deprecated 2026-05-25). Semantically K2.6 is the new flagship-line model
105-
# rather than a "secondary/cheaper" variant; tier placement to be revisited
106-
# in the next re-sweep — see issue #68.
107-
ModelSpec("Kimi K2.6", "moonshot-kimi-k2.6", "sonnet"),
106+
ModelSpec("Kimi K2.5", "moonshot-kimi-k2.5", "sonnet"),
108107
]
109108

110109
# Mode label -> glob pattern fragment inserted between prefix and bench-VER.

0 commit comments

Comments
 (0)