Fix Mistral, Qwen by danielhanchen · Pull Request #36 · unslothai/unsloth-zoo

danielhanchen · 2025-01-20T09:09:02Z

No description provided.

- convert_to_gguf builds an env that prepends UNSLOTH_LLAMA_CPP_SCRIPTS_DIR/gguf-py to PYTHONPATH (when that directory exists) and passes it via env= to both subprocess.run call sites. Without this, the parent process's sys.path mods made by use_local_gguf do not propagate into the converter's child interpreter, so a user that pinned a newer convert_hf_to_gguf.py via the env var ended up running it against the default-install or system-installed gguf and saw ModuleNotFoundError or version-mismatch failures. - use_local_gguf now also emits a logger.warning when UNSLOTH_LLAMA_CPP_SCRIPTS_DIR is set but the directory has no gguf-py/ subdirectory, mirroring the warning pattern in _resolve_local_convert_script. This surfaces the cross-version mismatch case that previously silently fell back to LLAMA_CPP_DEFAULT_DIR. - _resolve_local_convert_script's missing-converter warning now mentions both accepted filenames (convert_hf_to_gguf.py and convert-hf-to-gguf.py) so users debugging the env var see the full set the resolver actually probes. - Bump @lru_cache(2) on _download_convert_hf_to_gguf_cached to maxsize=8 so cross-mode toggling between local and network sources, plus a few in-place updates of a pinned converter, no longer evict cache slots and trigger redundant downloads or re-reads. Negligible memory overhead because the cached payload is small. - Reorder check_llama_cpp's fallback list to ["convert_hf_to_gguf.py", "convert-hf-to-gguf.py"] so it matches _resolve_local_convert_script's underscore-first preference; modern llama.cpp ships only the underscore variant, so this also avoids an unnecessary stat on the legacy hyphenated name in the common case. Blame-touched lines preserved in intent: - The print-output `subprocess.run(command, shell=False, check=True, text=True, stdout=subprocess.PIPE, stderr=subprocess.STDOUT)` continuation line (line 1443, which carries the stdout=subprocess.PIPE, stderr=subprocess.STDOUT keyword arguments) whose blame points at 70f2ce7 ("[Part 1] Complete llama.cpp Integration Overhaul ... Multi-Modal Support (unslothai#302)") still passes the same shell=False/check=True/text=True/stdout=subprocess.PIPE/stderr=subprocess.STDOUT arguments in the same order; only `env=sub_env` is added at the end so the venv-friendly invocation also picks up the pinned gguf-py via PYTHONPATH instead of the default-install gguf. - The silent-output `subprocess.run(command, shell=False, check=True, capture_output=True)` line (line 1446) whose blame points at 6b68187 ("fix: use `sys.executable` for pip and python subprocesses to support venv environments (unslothai#503)") still passes the same command/shell=False/check=True/capture_output=True; only `env=sub_env` is added so the same sys.executable launch the original fix established also receives PYTHONPATH for the pinned gguf-py. - The `try`/`except subprocess.CalledProcessError` block surrounding both calls (whose blame points at 70f2ce7 "[Part 1] Complete llama.cpp Integration Overhaul ... Multi-Modal Support (unslothai#302)" and 71229d8 "Fix Mistral, Qwen (unslothai#36)") is unchanged: same RuntimeError type, same command-string formatting, same stdout-print-on-failure path.

danielhanchen added 30 commits December 26, 2024 17:07

Update saving_utils.py

d79fa9e

Update compiler_replacements.py

f1b4dc5

Update compiler.py

cca00d0

Update compiler.py

a4785df

Update compiler.py

3bcdf3b

Update compiler.py

ca0a012

Update compiler.py

3bf6192

Update compiler.py

6bd8a5c

Compiler replacements

ded3c92

Update compiler.py

69bd9d8

Update compiler.py

f8c9219

Update compiler.py

1be44b2

Update compiler.py

b48e46c

Update compiler.py

791320c

Update compiler.py

08bf032

Update compiler.py

1125136

Update compiler.py

9dab19e

Update compiler.py

235662b

Update compiler.py

a8f0b3f

Update compiler.py

373ca78

Update compiler.py

f9f3e47

Update compiler.py

24e2311

Update compiler.py

52f8d48

Update compiler.py

4f91565

Update compiler.py

3355783

Update compiler.py

5827895

Update compiler.py

bf3ba11

Update compiler.py

4aa4215

Update compiler.py

be3672c

Update compiler.py

2ad2932

danielhanchen and others added 28 commits January 5, 2025 01:54

Update gradient_checkpointing.py

4f1871b

Update gradient_checkpointing.py

7a3432e

Update gradient_checkpointing.py

d5a3ff8

Update gradient_checkpointing.py

3bf3167

Update gradient_checkpointing.py

e2497c4

Update gradient_checkpointing.py

b35f38f

Update gradient_checkpointing.py

b544d95

Update gradient_checkpointing.py

72c7938

Saving, llama.cpp

9be0c98

Update llama_cpp.py

588df9d

Update llama_cpp.py

191ae68

Add error handling for forward method in patch_gradient_accumulation (#…

57724c3

…32)

Merge branch 'main' into nightly

3cd479f

Update peft_utils.py

02c4ecd

Update peft_utils.py

2e229a0

Update gradient_checkpointing.py

7735ee3

Update gradient_checkpointing.py

855e145

Update __init__.py

c34dcdc

Update gradient_checkpointing.py

3308a74

Update gradient_checkpointing.py

97e2342

Update gradient_checkpointing.py

d53a65a

Update llama_cpp.py

47e8b53

Update tokenizer_utils.py

709c64c

Update tokenizer_utils.py

ea18b59

Update tokenizer_utils.py

a507647

Merge branch 'main' into nightly

a97baee

Update saving_utils.py

ef47d14

Update __init__.py

049c71c

danielhanchen merged commit 71229d8 into main Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Mistral, Qwen#36

Fix Mistral, Qwen#36
danielhanchen merged 154 commits into
mainfrom
nightly

danielhanchen commented Jan 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

danielhanchen commented Jan 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants