Guard torch.compile on ROCm when triton_key is missing by hnxnq7 · Pull Request #3923 · unslothai/unsloth

hnxnq7 · 2026-01-22T21:17:26Z

Summary

On ROCm, some Triton builds don’t expose triton_key, which makes the torch.compile/Inductor path crash during LoRA backward.
This change detects that case and disables torch.compile/Inductor only on ROCm, avoiding the backend crash while leaving CUDA behavior unchanged.

Changes

unsloth/import_fixes.py: add fix_rocm_triton_key_error() to detect missing triton_key on ROCm and set TORCHINDUCTOR_DISABLE + TORCH_COMPILE_DISABLE.
unsloth/init.py: invoke the new fix during import.

Behavior

ROCm only; no change on NVIDIA/CUDA.
If Triton already has triton_key, nothing is changed.

for more information, see https://pre-commit.ci

gemini-code-assist · 2026-01-22T21:17:41Z

Summary of Changes

Hello @hnxnq7, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request implements a targeted solution to prevent crashes in ROCm environments when using torch.compile or Inductor, which can occur if the installed Triton version lacks the triton_key symbol. By conditionally disabling these features only when the specific problematic condition is met on ROCm, the change ensures stability without affecting CUDA users or ROCm users with compatible Triton builds. This enhances the robustness of the system by gracefully handling a known platform-specific dependency issue.

Highlights

ROCm Compatibility Fix: Introduces a fix specifically for ROCm environments where torch.compile and Inductor might crash due to a missing triton_key in some Triton builds.
Conditional Disablement: The system now detects if triton_key is absent on ROCm and, if so, automatically disables torch.compile and Inductor by setting specific environment variables (TORCHINDUCTOR_DISABLE, TORCH_COMPILE_DISABLE).
No Impact on CUDA: This change is strictly for ROCm and does not alter the behavior or functionality on NVIDIA/CUDA platforms.
Idempotent Behavior: If Triton already exposes triton_key on ROCm, the fix does nothing, ensuring no unnecessary changes to working configurations.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a safeguard for ROCm environments where an incomplete Triton installation (missing triton_key) can cause torch.compile to crash. The fix correctly identifies this scenario and disables torch.compile and Inductor by setting environment variables, which is a clean and effective approach. The changes are well-isolated. My review includes a suggestion to improve the robustness of the import checks in the new fix_rocm_triton_key_error function by using more specific exception handling.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

for more information, see https://pre-commit.ci

The function (introduced in unslothai#3923) assumed that the absence of `triton.runtime.triton_key` on ROCm means torch.compile will crash. Investigation shows this is incorrect: 1. `triton.runtime.triton_key` was renamed/removed in the ROCm Triton fork — it does not exist at that path. However, `triton.compiler.compiler.triton_key` (the path torch._inductor actually imports) EXISTS and works correctly on ROCm. 2. Both call-sites in torch._inductor (codecache.py and async_compile.py) already wrap the import in try/except, so even a genuinely missing triton_key would be handled gracefully. 3. Comprehensive testing on ROCm 7.1 + Triton 3.4.0 + gfx1100 confirms torch.compile works correctly for matmul, cross-entropy, RMSNorm, multi-layer transformer forward+backward, and LoRA — all without triton.runtime.triton_key. The original code was also ineffective (environment variables set after torch import have no effect on torch._dynamo config), so removing it has zero behavioral change on existing installations. Supersedes the compile-disable portion of unslothai#3923.

unslothai#4125) The function (introduced in unslothai#3923) assumed that the absence of `triton.runtime.triton_key` on ROCm means torch.compile will crash. Investigation shows this is incorrect: 1. `triton.runtime.triton_key` was renamed/removed in the ROCm Triton fork — it does not exist at that path. However, `triton.compiler.compiler.triton_key` (the path torch._inductor actually imports) EXISTS and works correctly on ROCm. 2. Both call-sites in torch._inductor (codecache.py and async_compile.py) already wrap the import in try/except, so even a genuinely missing triton_key would be handled gracefully. 3. Comprehensive testing on ROCm 7.1 + Triton 3.4.0 + gfx1100 confirms torch.compile works correctly for matmul, cross-entropy, RMSNorm, multi-layer transformer forward+backward, and LoRA — all without triton.runtime.triton_key. The original code was also ineffective (environment variables set after torch import have no effect on torch._dynamo config), so removing it has zero behavioral change on existing installations. Supersedes the compile-disable portion of unslothai#3923.

* Guard torch.compile on ROCm when triton_key missing * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update unsloth/import_fixes.py Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Tighten ROCm Triton import handling * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: Rachel Li <rachelliqx07@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

unslothai#4125) The function (introduced in unslothai#3923) assumed that the absence of `triton.runtime.triton_key` on ROCm means torch.compile will crash. Investigation shows this is incorrect: 1. `triton.runtime.triton_key` was renamed/removed in the ROCm Triton fork — it does not exist at that path. However, `triton.compiler.compiler.triton_key` (the path torch._inductor actually imports) EXISTS and works correctly on ROCm. 2. Both call-sites in torch._inductor (codecache.py and async_compile.py) already wrap the import in try/except, so even a genuinely missing triton_key would be handled gracefully. 3. Comprehensive testing on ROCm 7.1 + Triton 3.4.0 + gfx1100 confirms torch.compile works correctly for matmul, cross-entropy, RMSNorm, multi-layer transformer forward+backward, and LoRA — all without triton.runtime.triton_key. The original code was also ineffective (environment variables set after torch import have no effect on torch._dynamo config), so removing it has zero behavioral change on existing installations. Supersedes the compile-disable portion of unslothai#3923.

Rachel Li and others added 2 commits January 22, 2026 13:16

Guard torch.compile on ROCm when triton_key missing

ab9bda2

[pre-commit.ci] auto fixes from pre-commit.com hooks

22bc96e

for more information, see https://pre-commit.ci

gemini-code-assist Bot reviewed Jan 22, 2026

View reviewed changes

Comment thread unsloth/import_fixes.py Outdated

hnxnq7 and others added 6 commits January 22, 2026 16:19

Update unsloth/import_fixes.py

0b13822

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

0667ddd

for more information, see https://pre-commit.ci

Merge branch 'main' into rocm-triton-key-fix

c04a293

[pre-commit.ci] auto fixes from pre-commit.com hooks

de48d88

for more information, see https://pre-commit.ci

Tighten ROCm Triton import handling

a544390

[pre-commit.ci] auto fixes from pre-commit.com hooks

502d07e

for more information, see https://pre-commit.ci

danielhanchen merged commit 0c50a51 into unslothai:main Jan 22, 2026
1 check passed

GoldenGrapeGentleman mentioned this pull request Feb 28, 2026

fix(ROCm): remove dead code fix_rocm_triton_key_error #4125

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Guard torch.compile on ROCm when triton_key is missing#3923

Guard torch.compile on ROCm when triton_key is missing#3923
danielhanchen merged 8 commits into
unslothai:mainfrom
hnxnq7:rocm-triton-key-fix

hnxnq7 commented Jan 22, 2026

Uh oh!

gemini-code-assist Bot commented Jan 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hnxnq7 commented Jan 22, 2026

Uh oh!

gemini-code-assist Bot commented Jan 22, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants