Context
The cookbook architecture-demos spec tracks StarCoder2 as status: blocked. Issue #311 ("qualify: SafeTensors non-LLaMA architecture support (GPT-2, GPT-NeoX, OPT, StarCoder, BERT)", closed 2026-02-27) added SafeTensors handling for several non-LLaMA architectures but didn't ship a StarCoder2-specific loader YAML.
Family
- Name: starcoder2
- Vendor: BigCode
- HF architectures:
Starcoder2ForCausalLM
- HF pattern:
bigcode/starcoder2-*
- Reference checkpoints:
bigcode/starcoder2-3b, bigcode/starcoder2-7b, bigcode/starcoder2-15b
Acceptance criteria
Unblock impact
Cookbook reference
- manifest.yaml entry —
name: starcoder2 block
- Predecessor: aprender#311 (added SafeTensors but not the loader)
Context
The cookbook architecture-demos spec tracks StarCoder2 as
status: blocked. Issue #311 ("qualify: SafeTensors non-LLaMA architecture support (GPT-2, GPT-NeoX, OPT, StarCoder, BERT)", closed 2026-02-27) added SafeTensors handling for several non-LLaMA architectures but didn't ship a StarCoder2-specific loader YAML.Family
Starcoder2ForCausalLMbigcode/starcoder2-*bigcode/starcoder2-3b,bigcode/starcoder2-7b,bigcode/starcoder2-15bAcceptance criteria
contracts/model-families/starcoder2.yamlexists with size_variants for 3b, 7b, 15brope_theta=1000000(similar to Llama-3 long-context)Starcoder2ForCausalLMarchitecture string + GQA pattern)starcoder2-3b(smallest variant)Unblock impact
blockedtocertifiedCookbook reference
name: starcoder2block