SOTA attempt (val_bpb=1.2064) by spokane-way · Pull Request #49 · openai/parameter-golf

spokane-way · 2026-03-19T03:57:50Z

SEED=1337: 1.20576485
SEED=1338: 1.2061746
SEED=1339: 1.20715923
Sample mean across the three runs: 1.20636623

0hq · 2026-03-19T17:24:57Z

Great, thanks!

* SOTA attempt * Improve score on SXM --------- Co-authored-by: spokane-way <spokane@way>

…ity plateau confirmed Patches 15/16/21 still uncontested in 150+ open + 10 closed PRs (6 consecutive audits). PR openai#1430 stable OPEN, 0 comments, no comp owner activity for 16h. After 13 research fires and 6 audits, the picture is clear: training-time tweaks are exhausted at our 22M/1500-step scale. All 4 post-fire-9 ports (Mousse/MuonEq-R/Depth Recurrence/QK_GAIN=5.0) are neutral within the champion noise band. The "neutrality plateau" at 3.27-3.30 is the empirical ceiling for training-time changes at our compute budget. Best remaining moves (in expected value order): 1. H100 escalation of CHAMP_L4_seed42+EL stack with EMA+Tilt+INT6 GPTQ bundle 2. Coprime stride implementation (task openai#58) — only data-side direction 3. BPE-8192 ngram tables build (task openai#49) — enables tokenizer A/B Spend ~$3.55/$36 (10% utilization). Pod healthy at 7h uptime. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ker identified First tokenizer-side fire (0/24 patches in this category). Subagent found 3 candidates (BPE-Dropout, Complementary Weighting, Three-Tier Classification) but ALL are blocked by our pre-tokenized .bin file pipeline. BPE-Dropout requires live re-tokenization at training time → infeasible. Complementary Weighting subagent incorrectly cited our MLX prototype, not the H100 train_gpt.py. Three-Tier is PR openai#1402 pending validation. Architectural insight: SP1024 may actually be optimal for our 22M architecture (smaller embedding = more params for model body). Top PRs use SP8192 because their depth-recurrence stack benefits from finer tokens. We may not need BPE-8192. Task openai#49 deferred indefinitely. Cross-domain coverage update (16 fires): training: 5, optimizer: 2, eval: 3, compression: 1, data: 2, tokenizer: 1, hardware: 0. Hardware still uncovered. Per user instruction: queued, not shipped. No code patches. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

SOTA attempt

4671da6

spokane-way force-pushed the main branch from ff02bee to 4671da6 Compare March 19, 2026 05:51

Improve score on SXM

99e72b1

spokane-way changed the title ~~SOTA attempt (val_bpb=1.2166)~~ SOTA attempt (val_bpb=1.20637) Mar 19, 2026

spokane-way changed the title ~~SOTA attempt (val_bpb=1.20637)~~ SOTA attempt (val_bpb=1.2064) Mar 19, 2026

0hq added the record submission ready for review label Mar 19, 2026

0hq closed this Mar 19, 2026

0hq reopened this Mar 19, 2026

0hq approved these changes Mar 19, 2026

View reviewed changes

0hq merged commit e89fcf8 into openai:main Mar 19, 2026

notapplica mentioned this pull request Mar 19, 2026

Parameter Golf Formerly Live AI Commentary ⛳ + Analysis / Ideas | every 10 minutes. Now disabled #140

Closed

mrdavtan mentioned this pull request Mar 20, 2026

Non-record: FP16 embed + WD20k + seq2048 + doc-isolated sliding window (val_bpb=1.2045) #151

Closed

4 tasks

maxivione pushed a commit to maxivione/parameter-golf that referenced this pull request Mar 20, 2026

SOTA attempt (val_bpb=1.2064) (openai#49)

c547b22

* SOTA attempt * Improve score on SXM --------- Co-authored-by: spokane-way <spokane@way>

scottspace pushed a commit to scottspace/parameter-golf that referenced this pull request Mar 21, 2026

SOTA attempt (val_bpb=1.2064) (openai#49)

6a08745

* SOTA attempt * Improve score on SXM --------- Co-authored-by: spokane-way <spokane@way>

nedcut pushed a commit to nedcut/parameter-golf that referenced this pull request Mar 26, 2026

SOTA attempt (val_bpb=1.2064) (openai#49)

ae46ae2

* SOTA attempt * Improve score on SXM --------- Co-authored-by: spokane-way <spokane@way>

cocohearts removed the record submission ready for review label Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SOTA attempt (val_bpb=1.2064)#49

SOTA attempt (val_bpb=1.2064)#49
0hq merged 2 commits intoopenai:mainfrom
spokane-way:main

spokane-way commented Mar 19, 2026 •

edited

Loading

Uh oh!

0hq commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

spokane-way commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0hq commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

spokane-way commented Mar 19, 2026 •

edited

Loading