Skip to content

fix(cli): point 7B qwen models to single-file GGUF artifacts and align caches#1891

Closed
noahgift wants to merge 3 commits into
mainfrom
fix/cli-qwen-alias-and-caching
Closed

fix(cli): point 7B qwen models to single-file GGUF artifacts and align caches#1891
noahgift wants to merge 3 commits into
mainfrom
fix/cli-qwen-alias-and-caching

Conversation

@noahgift

Copy link
Copy Markdown
Contributor
  • Updated configs/aliases.yaml to point qwen2.5 to q3_k_m.gguf and qwen2.5-coder to q4_k_m.gguf, utilizing the fast single-file .gguf bypass rather than triggering sharded resolution issues or unsupported quantization types in the inference engine.
  • Fixed apr run and apr chat to resolve aliases using the same module as apr pull.
  • Aligned cache checking logic so apr run queries the Pacha download directory, eliminating cache misses after a successful apr pull.
  • Fixed the CLI's file path resolution logic to avoid mistaking local directories for remote Hugging Face repositories.
  • Corrected apr serve usage in README.md to apr serve run.

- Added qwen2.5-coder-1.5b alias to configs/aliases.yaml to fix apr pull qwen2.5-coder-1.5b
- Fixed apr run and apr chat to resolve short name aliases using the same logic as apr pull
- Fixed apr run cache checking logic to look in the Pacha streaming cache directory, ensuring apr run uses models successfully fetched by apr pull without attempting to re-download.
- Fixed apr serve usage in README.md to apr serve run.
@noahgift noahgift force-pushed the fix/cli-qwen-alias-and-caching branch from 58b07c3 to 014b44e Compare May 23, 2026 04:43
- Update configs/aliases.yaml to point qwen2.5 to the Q3_K_M GGUF file and qwen2.5-coder to the Q4_K_M GGUF file.
- Prevent apr run from mistaking local file paths containing slashes for Hugging Face repos.
- Fix pull logic for selecting the best single-file artifact when shards are mixed in.
@noahgift noahgift enabled auto-merge (squash) May 23, 2026 04:44
@noahgift noahgift force-pushed the fix/cli-qwen-alias-and-caching branch from 014b44e to 75e7341 Compare May 23, 2026 04:45
@noahgift

Copy link
Copy Markdown
Contributor Author

Subsumed by #1898 (mega-bundle hiatus close-out). Squash-merge preserves the per-PR commit message — see #1898 commit log.

@noahgift noahgift closed this May 23, 2026
auto-merge was automatically disabled May 23, 2026 07:09

Pull request was closed

noahgift added a commit that referenced this pull request May 23, 2026
PR #1891 wrapped all path_arg through HF alias resolution. For inputs
that look like filesystem paths (absolute or starts with ./, ../) and
don't exist, the alias resolver was rewriting them as hf:// URIs and
returning a mangled path in the FileNotFound error.

Fix: short-circuit with the original path_arg in the error BEFORE alias
resolution kicks in. Preserves the contract that test_run_file_not_found
and test_run_nonexistent_path_without_trace assert.

Closes the workspace-test failure on bundle PR #1898.
noahgift added a commit that referenced this pull request May 23, 2026
, #1896, #1897) (#1898)

* docs(spec): SPEC-DISTILL-001 §87 — PMAT-704 post-mortem on Bug B wrong turn (#1880)

* chore(distill): Stage D dispatch wrapper with PMAT-701 lessons baked in (#1883)

* chore(distill): Phase 5 HumanEval dispatch wrapper (#1886)

* chore: bundle PMAT-702..705 distill cascade + clippy fix (#1897)

* fix(cli): point 7B qwen models to single-file GGUF artifacts and align caches (#1891)

* fix(chat): preserve original path in FileNotFound for filesystem paths

PR #1891 wrapped all path_arg through HF alias resolution. For inputs
that look like filesystem paths (absolute or starts with ./, ../) and
don't exist, the alias resolver was rewriting them as hf:// URIs and
returning a mangled path in the FileNotFound error.

Fix: short-circuit with the original path_arg in the error BEFORE alias
resolution kicks in. Preserves the contract that test_run_file_not_found
and test_run_nonexistent_path_without_trace assert.

Closes the workspace-test failure on bundle PR #1898.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant