Fix llama.cpp CMake build detection in save.py by ashzak · Pull Request #5957 · unslothai/unsloth

ashzak · 2026-06-02T17:20:40Z

Summary

Fixes #5832: when llama.cpp migrated from Makefile to CMake, the build detection in unsloth/save.py tried make clean first and failed with a confusing "Build system changed: The Makefile build has been replaced by CMake" error.

Add _is_cmake_only_llama_cpp() helper that detects the post-migration Makefile deprecation stub (or a missing Makefile)
Detect the build system before attempting make clean in install_llama_cpp_make_non_blocking(), install_llama_cpp_old(), and install_llama_cpp_blocking()
Suppress stderr when probing make clean as a fallback check

The modern GGUF path goes through unsloth_zoo.llama_cpp.install_llama_cpp(); the same guard for that path is in unslothai/unsloth-zoo#763.

A docs-download endpoint originally bundled in this PR was removed; #5821 should be solved by bundling real docs at build time in its own PR.

Testing

Helper verified against the actual deprecation stub from llama.cpp master (detected), a missing Makefile (detected), and a legacy buildable Makefile (not detected)
py_compile on unsloth/save.py passes

Fixes unslothai#5832: llama.cpp build fails due to CMake migration - Add _is_cmake_only_llama_cpp() helper to detect CMake-only builds - Check Makefile for deprecation notice before trying `make clean` - Suppress confusing "Build system changed" error by detecting early - Update all three build functions to use the new detection Fixes unslothai#5821: Add documentation download for offline RAG - Add /api/settings/docs/download endpoint to serve bundled docs - Generate comprehensive markdown quick-reference guide - Add download button to About tab in settings - Add i18n translations (English and Chinese)

for more information, see https://pre-commit.ci

gemini-code-assist

Code Review

This pull request introduces a new documentation download endpoint and a corresponding "Download" button in the frontend "About" tab to support offline use and local RAG systems. Additionally, it improves the llama.cpp build system detection by checking for CMake-only configurations before executing make. Feedback on the changes highlights a potential crash in unsloth/save.py if the make executable is missing from the system, suggesting wrapping the subprocess.run call in a try-except block to handle FileNotFoundError and fallback gracefully to CMake.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-02T17:22:20Z

+        result = subprocess.run(
+            ["make", "clean", "-C", "llama.cpp"],
+            stdout = subprocess.DEVNULL,
+            stderr = subprocess.DEVNULL,
+        )
+        IS_CMAKE = result.returncode != 0


Using subprocess.run with shell=False (the default) to execute make will raise a FileNotFoundError if the make executable is not installed on the system (e.g., on Windows or minimal Docker containers). This will crash the GGUF export process.

To prevent this, wrap the execution in a try...except FileNotFoundError block and fallback to IS_CMAKE = True if make is unavailable.

Suggested change

result = subprocess.run(

["make", "clean", "-C", "llama.cpp"],

stdout = subprocess.DEVNULL,

stderr = subprocess.DEVNULL,

)

IS_CMAKE = result.returncode != 0

try:

result = subprocess.run(

["make", "clean", "-C", "llama.cpp"],

stdout=subprocess.DEVNULL,

stderr=subprocess.DEVNULL,

)

IS_CMAKE = result.returncode != 0

except FileNotFoundError:

IS_CMAKE = True

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7181d9c4fc

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-06-02T17:22:43Z

+
+
+# Documentation export endpoint for RAG and offline use
+_DOCS_CACHE: str | None = None


Avoid Python 3.10-only union syntax

In Python 3.9 environments, which are allowed by the root pyproject.toml (requires-python = ">=3.9,<3.15"), importing this route evaluates str | None immediately and raises TypeError: unsupported operand type(s) for |: 'type' and 'NoneType' before the Studio server can start. Use Optional[str] or add postponed annotations so the new settings route remains importable on the supported Python range.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-02T17:22:43Z

+  if (token) headers.set("Authorization", `Bearer ${token}`);
+
+  try {
+    const res = await fetch(apiUrl("/api/settings/docs/download"), { headers });


Use authenticated fetch for the protected download

Because /api/settings/docs/download is protected by Depends(get_current_subject), this raw fetch fails whenever the access token has expired but the refresh token is still valid: it receives a 401 and only logs an error, unlike the existing settings API helpers that use authFetch to refresh/retry and handle Tauri auto-auth. In that common expired-token scenario the new Download button does nothing until the user refreshes or logs in again.

Useful? React with 👍 / 👎.

Address Gemini review feedback - wrap subprocess.run in try-except to handle FileNotFoundError when make is not installed, falling back to CMake build system.

- Use Optional[str] instead of str | None for Python 3.9 compatibility - Use authFetch instead of raw fetch for token refresh handling

# Conflicts: # studio/backend/routes/settings.py

…ion for PR unslothai#5957

danielhanchen · 2026-06-11T14:15:38Z

Thanks for the PR. I synced the branch with main and made two changes.

Kept the CMake detection fix (#5832). The _is_cmake_only_llama_cpp helper is the right approach. I verified it against the actual deprecation stub on llama.cpp master (detects it correctly), a missing Makefile, and a legacy buildable Makefile. Also trimmed the comments a bit.

Removed the docs download endpoint (#5821). The endpoint shipped a large hand-written markdown summary hardcoded inside routes/settings.py. That content duplicates docs.unsloth.ai and will silently drift out of date, and stale numbers presented as official documentation are worse than no file. The right shape for #5821 is bundling the real docs into the app at build time (or generating the file from the docs source) so there is a single source of truth. Happy to review that as its own PR.

Retitled the PR to match the remaining scope.

ashzak requested review from danielhanchen and rolandtannous as code owners June 2, 2026 17:20

[pre-commit.ci] auto fixes from pre-commit.com hooks

bc7144e

for more information, see https://pre-commit.ci

gemini-code-assist Bot reviewed Jun 2, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Jun 2, 2026

View reviewed changes

ashzak added 2 commits June 2, 2026 12:23

fix: handle missing make executable gracefully

cb409b8

Address Gemini review feedback - wrap subprocess.run in try-except to handle FileNotFoundError when make is not installed, falling back to CMake build system.

fix: address review feedback for docs download

5428ac4

- Use Optional[str] instead of str | None for Python 3.9 compatibility - Use authFetch instead of raw fetch for token refresh handling

danielhanchen self-assigned this Jun 11, 2026

danielhanchen added 2 commits June 11, 2026 14:13

Merge remote-tracking branch 'origin/main' into fix/quick-wins-5832-5821

ee57cee

# Conflicts: # studio/backend/routes/settings.py

Drop hardcoded docs endpoint, keep and tighten llama.cpp CMake detect…

319ee6c

…ion for PR unslothai#5957

danielhanchen changed the title ~~fix: llama.cpp CMake build detection + docs download feature~~ Fix llama.cpp CMake build detection in save.py Jun 11, 2026

danielhanchen mentioned this pull request Jun 11, 2026

Skip make on CMake-only llama.cpp checkouts in install_llama_cpp unslothai/unsloth-zoo#763

Merged

Merge remote-tracking branch 'origin/main' into fix/quick-wins-5832-5821

087f9d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix llama.cpp CMake build detection in save.py#5957

Fix llama.cpp CMake build detection in save.py#5957
ashzak wants to merge 7 commits into
unslothai:mainfrom
ashzak:fix/quick-wins-5832-5821

ashzak commented Jun 2, 2026 •

edited by danielhanchen

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 2, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 2, 2026

Uh oh!

danielhanchen commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		# Documentation export endpoint for RAG and offline use
		_DOCS_CACHE: str \| None = None

Uh oh!

Conversation

ashzak commented Jun 2, 2026 • edited by danielhanchen Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

danielhanchen commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ashzak commented Jun 2, 2026 •

edited by danielhanchen

Loading