[RelEng] Define `BUILD_BUNDLE_PTXAS` (#119750) by atalman · Pull Request #119988 · pytorch/pytorch

atalman · 2024-02-15T15:24:59Z

That would bundle PTXAS into a bin folder

When compiling for Triton, define TRITION_PTXAS_PATH if ptxas is bundled with PyTorch Needed to make PyTorch compiled against CUDA-11.8 usable with 11.8 driver, as Triton is bundled with latest (CUDA-12.3 at time of PyTorch-2.2 release) ptxas

Needs pytorch/builder@5c814e2 to produce valid binary builds

Test plan:

Create dummy ptxas in torch/bin folder and observe torch.compile fail with backtrace in Triton module.
Run following script (to be added to binary tests ) against CUDA-11.8 wheel:

import torch
import triton

@torch.compile
def foo(x: torch.Tensor) -> torch.Tensor:
  return torch.sin(x) + torch.cos(x)

x=torch.rand(3, 3, device="cuda")
print(foo(x))
# And check that CUDA versions match
cuda_version = torch.version.cuda
ptxas_version = triton.backends.nvidia.compiler.get_ptxas_version().decode("ascii")
assert cuda_version in ptxas_version, f"CUDA version mismatch: torch build with {cuda_version}, but Triton uses ptxs {ptxas_version}"

Fixes #119054

Pull Request resolved: #119750
Approved by: https://github.com/jansel, https://github.com/atalman

Fixes #ISSUE_NUMBER

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

That would bundle PTXAS into a `bin` folder When compiling for Triton, define `TRITION_PTXAS_PATH` if `ptxas` is bundled with PyTorch Needed to make PyTorch compiled against CUDA-11.8 usable with 11.8 driver, as Triton is bundled with latest (CUDA-12.3 at time of PyTorch-2.2 release) ptxas Needs pytorch/builder@5c814e2 to produce valid binary builds Test plan: - Create dummy ptxas in `torch/bin` folder and observe `torch.compile` fail with backtrace in Triton module. - Run following script (to be added to binary tests ) against CUDA-11.8 wheel: ```python import torch import triton @torch.compile def foo(x: torch.Tensor) -> torch.Tensor: return torch.sin(x) + torch.cos(x) x=torch.rand(3, 3, device="cuda") print(foo(x)) # And check that CUDA versions match cuda_version = torch.version.cuda ptxas_version = triton.backends.nvidia.compiler.get_ptxas_version().decode("ascii") assert cuda_version in ptxas_version, f"CUDA version mismatch: torch build with {cuda_version}, but Triton uses ptxs {ptxas_version}" ``` Fixes pytorch#119054 Pull Request resolved: pytorch#119750 Approved by: https://github.com/jansel, https://github.com/atalman

pytorch-bot · 2024-02-15T15:25:03Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/119988

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit b86d77c with merge base a8bd593 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (cpu_inductor_torchbench, 2, 2, linux.12xlarge) (gh)
phi_1_5
inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (dynamic_cpu_inductor_torchbench, 2, 2, linux.12xlarge) (gh)
phi_1_5
pull / linux-focal-cuda12.1-py3.10-gcc9 / test (default, 5, 5, linux.4xlarge.nvidia.gpu) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions bot added module: inductor ciflow/inductor labels Feb 15, 2024

atalman mentioned this pull request Feb 15, 2024

[v2.2.1] Release Tracker #119295

Closed

huydhn approved these changes Feb 15, 2024

View reviewed changes

atalman merged commit 6c8c5ad into pytorch:release/2.2 Feb 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RelEng] Define `BUILD_BUNDLE_PTXAS` (#119750)#119988

[RelEng] Define `BUILD_BUNDLE_PTXAS` (#119750)#119988
atalman merged 1 commit intopytorch:release/2.2from
atalman:cherry_pick_ptxas

atalman commented Feb 15, 2024 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Feb 15, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

atalman commented Feb 15, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/119988

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

atalman commented Feb 15, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Feb 15, 2024 •

edited

Loading