Hivemind-driven maintenance sprint for ht-llama.cpp. Pick items in triaged order; cross off as done; append "saved for later" at the bottom.
Triage criteria (per ~/IDLE.md)
- Obviously wanted (lowest risk of decision drift)
- Low-hanging fruit
- How much we want it
- Difficulty
Active sprint — top 5
Saved for later
Out of scope (not maintenance)
- Anything that adds new features without an issue / RFC backing it
- Reviewing PRs from non-approved authors (none currently; all 4 open PRs are by
marksverdhei)
Created per ~/IDLE.md guidance. Will close when sprint cleared.
Hivemind-driven maintenance sprint for ht-llama.cpp. Pick items in triaged order; cross off as done; append "saved for later" at the bottom.
Triage criteria (per ~/IDLE.md)
Active sprint — top 5
origin/ht—build-sycl.ymlhas been failing on every recent commit (09b2124fb,b0daec55b). Investigate whether it's a pre-existing upstream issue or a real ht regression. If pre-existing, file a tracking task; if our problem, fix.--fit-print-plan(PR feat(fit-params): --fit-print-plan emits per-device byte plan as JSON (#66 step 2 prep) #72 added the flag with no test coverage). Add a smoke test that verifies the JSON output format. Either intests/as a small new test, or as a shellcheck-style script.tools/server/README.md— the heierchat decoupling left some pointers that may be stale. Verify all cross-repo links resolve.tools/server/server-models.cpp— 1500+ LOC of router state machine handling subprocess lifecycle, the area where heierchat-500s incident lives. Look for obvious logical bugs (race conditions, missing error paths, off-by-one onreserved-style accounting).llama-speculative-simpleagainst gemma-4-31B target. Once titan rolls,scripts/bench-dflash.sh --target Q8_0measures throughput; understand which hotspot we'd target if accept rate stays at ~8%.Saved for later
Out of scope (not maintenance)
marksverdhei)Created per
~/IDLE.mdguidance. Will close when sprint cleared.