Non-record: H-Net Dynamic Chunking — Learned Tokenization Layer (val_bpb 1.3587) by dentity007 · Pull Request #1191 · openai/parameter-golf

dentity007 · 2026-03-31T20:39:26Z

Summary

Adds a learned dynamic chunking layer (inspired by H-Net) to the standard transformer baseline. The chunker predicts soft boundaries between adjacent token embeddings and blends neighbors where boundaries are low — a differentiable approximation of H-Net's hard chunking.

val_bpb: 1.3587 | 1×RTX 5090, 600s | TTT enabled

Nearly matches baseline (1.3577) with only ~263K extra parameters
Demonstrates learned tokenization is viable for parameter-constrained models
Implements one of OpenAI's explicitly requested research directions (H-net tokenization)
Setting HNET_ENABLED=0 produces identical behavior to base script

Changes

DynamicChunker module: boundary prediction + soft blending
Inserted after embedding norm, before transformer blocks
New env vars: HNET_ENABLED, HNET_LAYERS

Test plan

Verified on 1×RTX 5090 (600s wallclock)
Control run with HNET_ENABLED=0 matches baseline
Model fits in 16MB after int8+zlib compression

🤖 Generated with Claude Code

…er optimization, and SSM exploration

…bpb 1.3587) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

dentity007 and others added 3 commits March 30, 2026 19:12

Add approach notes for parameter golf challenge

ad23b7f

Update approach with depth recurrence, factorized embeddings, tokeniz…

300eb5c

…er optimization, and SSM exploration

Non-record: H-Net Dynamic Chunking — Learned Tokenization Layer (val_…

c4cf717

…bpb 1.3587) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

dentity007 closed this Apr 1, 2026

dentity007 reopened this Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-record: H-Net Dynamic Chunking — Learned Tokenization Layer (val_bpb 1.3587)#1191

Non-record: H-Net Dynamic Chunking — Learned Tokenization Layer (val_bpb 1.3587)#1191
dentity007 wants to merge 3 commits intoopenai:mainfrom
NathanMaine:research/hnet-chunking

dentity007 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dentity007 commented Mar 31, 2026

Summary

Changes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant