Summary
APR format inference shows "[N tokens generated, tokenizer not found]" instead of decoded text.
Evidence
apr run model.apr --prompt "What is 2+2?"
Output: "[10 tokens generated, tokenizer not found]"
Root Cause
APR format doesn't bundle the tokenizer. The AprV2Model::load_tokenizer() returns None.
Acceptance Criteria
Proposed Fix
- Option A: Embed tokenizer.json in APR format (increases file size)
- Option B: Load tokenizer from sibling file (requires tokenizer.json next to .apr)
- Option C: Download tokenizer from HuggingFace if model has hf_repo metadata
Files to Modify
realizar/src/apr/mod.rs - AprV2Model::load_tokenizer
realizar/src/infer/mod.rs - run_apr_inference
aprender/src/format/apr.rs - APR format spec
Labels
bug, P1, tokenizer, apr-format
Summary
APR format inference shows "[N tokens generated, tokenizer not found]" instead of decoded text.
Evidence
Root Cause
APR format doesn't bundle the tokenizer. The
AprV2Model::load_tokenizer()returns None.Acceptance Criteria
apr run model.aprshows decoded text outputProposed Fix
Files to Modify
realizar/src/apr/mod.rs- AprV2Model::load_tokenizerrealizar/src/infer/mod.rs- run_apr_inferenceaprender/src/format/apr.rs- APR format specLabels
bug, P1, tokenizer, apr-format