[CD] Add pytorch xpu wheel build in nightly#129560
[CD] Add pytorch xpu wheel build in nightly#129560chuanqi129 wants to merge 6 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129560
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 2 New FailuresAs of commit daf563e with merge base 46c5266 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
70eec88 to
092324e
Compare
|
|
||
| ROCM_ARCHES = ["6.0", "6.1"] | ||
|
|
||
| XPU_ARCHES = ["xpu"] |
There was a problem hiding this comment.
@chuanqi129 separate this PR into 2.
- First build triton-xpu wheel. The changes in this file and generated-linux-binary-manywheel-nightly.yml are not required at this point
- Second PR actually add manywheel build. At this point add changes to this file and generated-linux-binary-manywheel-nightly.yml
|
The xpu 3.8 wheel build failed on numpy 1.15 installation should be fixed by PR pytorch/builder#1909 |
| PACKAGE_TYPE: manywheel | ||
| # TODO: This is a legacy variable that we eventually want to get rid of in | ||
| # favor of GPU_ARCH_VERSION | ||
| DESIRED_CUDA: xpu |
There was a problem hiding this comment.
What's the meaning of DESIRED_CUDA?
There was a problem hiding this comment.
Same here. And the DESIRED_CUDA shared with all devices, including cpu, rocm etc.
| conda-pytorchbot-token-test: ${{ secrets.CONDA_PYTORCHBOT_TOKEN_TEST }} | ||
| uses: ./.github/workflows/_binary-upload.yml | ||
|
|
||
| manywheel-py3_9-xpu-build: |
There was a problem hiding this comment.
Most of the code is copy-past. Compared with Py3.8, is the python version the only difference? I'm just wondering if we can reuse the code (https://yaml.org/spec/1.2.2/#rule-c-ns-anchor-property)
There was a problem hiding this comment.
Hi @EikanWang , this workflow is generated by script .github/scripts/generate_ci_workflows.py based on template .github/templates/linux_binary_build_workflow.yml.j2
|
The build crash will be fixed by PR #130333 |
|
Hi @atalman , all PR changes related test jobs are passed, especially for the XPU manylinux wheel build and test jobs. But there are some unrelated test job failures caused by conda, could you please help to double check and review the PR again? Thanks |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
|
@pytorchmergebot merge -f "lint is successful, previous run was fully green" |
|
The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Torch XPU nightly wheel build has been enabled with #pytorch/pytorch#129560 landed, add XPU to binary build generation. Works for pytorch/pytorch#114850 ```bash > python tools/scripts/generate_binary_build_matrix.py --with-xpu enable { "include": [ { "python_version": "3.8", "gpu_arch_type": "cpu", "gpu_arch_version": "", "desired_cuda": "cpu", "container_image": "pytorch/manylinux-builder:cpu", "package_type": "manywheel", "build_name": "manywheel-py3_8-cpu", "validation_runner": "linux.2xlarge", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cpu", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "cuda", "gpu_arch_version": "11.8", "desired_cuda": "cu118", "container_image": "pytorch/manylinux-builder:cuda11.8", "package_type": "manywheel", "build_name": "manywheel-py3_8-cuda11_8", "validation_runner": "linux.g5.4xlarge.nvidia.gpu", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu118", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "cuda", "gpu_arch_version": "12.1", "desired_cuda": "cu121", "container_image": "pytorch/manylinux-builder:cuda12.1", "package_type": "manywheel", "build_name": "manywheel-py3_8-cuda12_1", "validation_runner": "linux.g5.4xlarge.nvidia.gpu", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu121", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "cuda", "gpu_arch_version": "12.4", "desired_cuda": "cu124", "container_image": "pytorch/manylinux-builder:cuda12.4", "package_type": "manywheel", "build_name": "manywheel-py3_8-cuda12_4", "validation_runner": "linux.g5.4xlarge.nvidia.gpu", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu124", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "rocm", "gpu_arch_version": "6.0", "desired_cuda": "rocm6.0", "container_image": "pytorch/manylinux-builder:rocm6.0", "package_type": "manywheel", "build_name": "manywheel-py3_8-rocm6_0", "validation_runner": "linux.2xlarge", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm6.0", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "rocm", "gpu_arch_version": "6.1", "desired_cuda": "rocm6.1", "container_image": "pytorch/manylinux-builder:rocm6.1", "package_type": "manywheel", "build_name": "manywheel-py3_8-rocm6_1", "validation_runner": "linux.2xlarge", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/rocm6.1", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false }, { "python_version": "3.8", "gpu_arch_type": "xpu", "gpu_arch_version": "", "desired_cuda": "xpu", "container_image": "pytorch/manylinux2_28-builder:xpu", "package_type": "manywheel", "build_name": "manywheel-py3_8-xpu", "validation_runner": "linux.2xlarge", "installation": "pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/xpu", "channel": "nightly", "upload_to_base_bucket": "no", "stable_version": "2.3.1", "use_split_build": false } ] } ```
Add pytorch xpu wheel build in nightly after the xpu build image enabling PR pytorch/builder#1879 merged Pull Request resolved: pytorch#129560 Approved by: https://github.com/atalman
Add pytorch xpu wheel build in nightly after the xpu build image enabling PR pytorch/builder#1879 merged. Link to #114850