Refactor core GGUF loading to use a shared backend-aware reader policy across Qwen and LFM2 models by zinyando · Pull Request #70 · izwi-ai/izwi

zinyando · 2026-03-05T19:08:45Z

Add centralized GGUF mmap/reader primitives, route Qwen3/Qwen3.5/LFM2 text loaders, Qwen3.5 mmproj loading, and shared GGUF utilities through them, unify device-to-backend mapping, and remove the unused mmproj_inspect binary.

…g through backend policy

zinyando added 5 commits March 5, 2026 20:30

core: add centralized GGUF mmap policy primitives

654a47f

core: add shared GGUF reader with mmap fallback

d86c6b0

core: route qwen35 gguf loading through backend io policy

c479d6e

core: centralize qwen3 and lfm2 gguf reader policy

10fc24a

refactor(core): unify device-to-backend mapping and route GGUF loadin…

b9e5e1a

…g through backend policy

zinyando merged commit 13b39d6 into main Mar 5, 2026

zinyando deleted the centralized-model-io branch March 12, 2026 09:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor core GGUF loading to use a shared backend-aware reader policy across Qwen and LFM2 models#70

Refactor core GGUF loading to use a shared backend-aware reader policy across Qwen and LFM2 models#70
zinyando merged 5 commits intomainfrom
centralized-model-io

zinyando commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zinyando commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant