docs: increase MLX smoke validation batch size by brendanboyle87 · Pull Request #36 · openai/parameter-golf

brendanboyle87 · 2026-03-18T23:53:33Z

Summary

update the README MLX smoke command to use VAL_BATCH_SIZE=524288
keep the rest of the local trial-run example unchanged

Why

The default validation batch size setting in the README trial run takes a very long time on a local Mac run for an M4 Max Mac Studio with 128GB, so this raises the documented MLX smoke-test value to a more practical local setting.

- add a PR-audit research log entry covering the clean takeaways from pull requests openai#36 through openai#70 - promote long-context training plus matching long-context eval as a first-class clean branch based on PR openai#61 and PR openai#63 - refine mixed-precision export notes to emphasize using int6/int8 byte savings to fund wider MLP capacity, based on PR openai#65 - update the current snapshot and research thesis so future agents do not over-focus on exporter-only ideas after the broader PR sweep

cocohearts · 2026-03-20T18:17:35Z

??? this is increasing val batch size??

brendanboyle87 · 2026-03-20T19:13:01Z

??? this is increasing val batch size??

Sorry if I was off base here

This was based on the fact that this script is for local mlx dev. there was no intermediate output so I was trying to figure out how long validation would take. Codex gave an estimate in hours vs minutes

“On this machine, a full validation with the old VAL_BATCH_SIZE=8192 is roughly a 5 to 6+ hour job. With VAL_BATCH_SIZE=524288, it is about 5 minutes.

The reason is in train_gpt_mlx.py:766: validation uses VAL_BATCH_SIZE // GRAD_ACCUM_STEPS. With GRAD_ACCUM_STEPS=8 and TRAIN_SEQ_LEN=1024, 8192 means only 1024 eval tokens per batch, which is exactly 1 sequence. 524288 means 65536 eval tokens, or 64 sequences per
batch. On the local validation split here, that works out to 60,568 eval batches vs 947 eval batches.”

docs: increase MLX smoke validation batch size

02eb5d1

0hq added the enhancement New feature or request label Mar 19, 2026

cocohearts closed this Mar 20, 2026

brendanboyle87 deleted the mlx-val-batch-size branch March 20, 2026 19:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: increase MLX smoke validation batch size#36

docs: increase MLX smoke validation batch size#36
brendanboyle87 wants to merge 1 commit intoopenai:mainfrom
brendanboyle87:mlx-val-batch-size

brendanboyle87 commented Mar 18, 2026

Uh oh!

cocohearts commented Mar 20, 2026

Uh oh!

brendanboyle87 commented Mar 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

brendanboyle87 commented Mar 18, 2026

Summary

Why

Uh oh!

cocohearts commented Mar 20, 2026

Uh oh!

brendanboyle87 commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

brendanboyle87 commented Mar 20, 2026 •

edited

Loading