Skip to content

Install magma from a tarball#140417

Closed
afrittoli wants to merge 1 commit into
pytorch:mainfrom
afrittoli:magma_from_tarball
Closed

Install magma from a tarball#140417
afrittoli wants to merge 1 commit into
pytorch:mainfrom
afrittoli:magma_from_tarball

Conversation

@afrittoli

@afrittoli afrittoli commented Nov 12, 2024

Copy link
Copy Markdown
Collaborator

Magma is built for specific CUDA versions and stored in the ossci-linux bucket. Install it from there rather than the deprecated conda package.

There are two places where magma is installed today:

  • install_conda.sh: extract the magma package in the same exact location where conda would install it, using a dedicated install_magma_conda.sh script. The new script is included in the relevant Dockerfiles where CUDA+magma is needed
  • install_magma.sh: this script already uses a tarball. Use the new tarball instead of the tarball from the conda package. The format of the new tarball is compatible with the old one, so changes here are minimal:wq

Fixes #140538
Test PR: #141584

@afrittoli afrittoli requested a review from jeffdaily as a code owner November 12, 2024 16:50
@pytorch-bot

pytorch-bot Bot commented Nov 12, 2024

Copy link
Copy Markdown

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140417

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit cb46b69 with merge base 9012e7a (image):

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot Bot added the topic: not user facing topic category label Nov 12, 2024
@afrittoli afrittoli force-pushed the magma_from_tarball branch 4 times, most recently from aad2e03 to a132112 Compare November 12, 2024 22:30
@soulitzer soulitzer requested a review from a team November 13, 2024 04:13
@soulitzer soulitzer added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Nov 13, 2024
@afrittoli afrittoli force-pushed the magma_from_tarball branch 2 times, most recently from 6f707b7 to e2b9fa0 Compare November 13, 2024 10:47
afrittoli added a commit to afrittoli/pytorch that referenced this pull request Nov 26, 2024
This is a test PR to validate the images produced by installing magma
from tarball (instead of conda), introduced by pytorch#140417

Signed-off-by: Andrea Frittoli <andrea.frittoli@uk.ibm.com>
@afrittoli

Copy link
Copy Markdown
Collaborator Author

@atalman Tested at #141584.
The s390x and onnx failures definitely seem unrelated.
I'm unsure about the linux-jammy-py3-clang12-executorch failure. WDYT?

atalman
atalman previously approved these changes Nov 26, 2024

@atalman atalman left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@atalman

atalman commented Nov 27, 2024

Copy link
Copy Markdown
Collaborator

@pytorchmergebot merge -f "Failures are not related"

@pytorchmergebot

Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@atalman

atalman commented Nov 27, 2024

Copy link
Copy Markdown
Collaborator

@pytorchmergebot revert -c nosignal -m "Caused failures in calculate docker image"

@pytorchmergebot

Copy link
Copy Markdown
Collaborator

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot added a commit that referenced this pull request Nov 27, 2024
This reverts commit 30ab102.

Reverted #140417 on behalf of https://github.com/atalman due to Caused failures in calculate docker image ([comment](#140417 (comment)))
@pytorchmergebot

Copy link
Copy Markdown
Collaborator

@afrittoli your PR has been successfully reverted.

@pytorchmergebot pytorchmergebot added the ci-no-td Do not run TD on this PR label Nov 27, 2024
@pytorch-bot pytorch-bot Bot dismissed atalman’s stale review November 27, 2024 23:22

This PR was reopened (likely due to being reverted), so your approval was removed. Please request another review.

@atalman

atalman commented Nov 27, 2024

Copy link
Copy Markdown
Collaborator

@afrittoli sorry for revert. Will try to reland this tomorrow. Looks like the issue after landing this PR was caused by CI flakiness, not PR itself

@atalman

atalman commented Nov 28, 2024

Copy link
Copy Markdown
Collaborator

@pytorchmergebot rebase -b main

@pytorchmergebot

Copy link
Copy Markdown
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot

Copy link
Copy Markdown
Collaborator

Rebase failed due to

Aborting rebase because rebasing the branch resulted in the same sha as the target branch.
This usually happens because the PR has already been merged.  Please rebase locally and push.

Raised by https://github.com/pytorch/pytorch/actions/runs/12070880702

Magma is built for specific CUDA versions and stored in the
ossci-linux bucket. Install it from there rather than the
deprecated conda package.

There are two places where magma is installed today:
- install_conda.sh: extract the magma package in the same
  exact location where conda would install it, using a
  dedicated install_magma_conda.sh script
- install_magma.sh: this script already uses a tarball.
  Use the new tarball instead of the tarball from the conda
  package. The format of the new tarball is compatible with
  the old one, so changes here are minimal:wq

Signed-off-by: Andrea Frittoli <andrea.frittoli@uk.ibm.com>

@atalman atalman left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
Magma is built for specific CUDA versions and stored in the ossci-linux bucket. Install it from there rather than the deprecated conda package.

There are two places where magma is installed today:
- `install_conda.sh`: extract the magma package in the same exact location where conda would install it, using a dedicated `install_magma_conda.sh` script. The new script is included in the relevant Dockerfiles where CUDA+magma is needed
- `install_magma.sh`: this script already uses a tarball. Use the new tarball instead of the tarball from the conda package. The format of the new tarball is compatible with the old one, so changes here are minimal:wq

Fixes pytorch#140538
Test PR: pytorch#141584

Pull Request resolved: pytorch#140417
Approved by: https://github.com/atalman
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
This reverts commit 30ab102.

Reverted pytorch#140417 on behalf of https://github.com/atalman due to Caused failures in calculate docker image ([comment](pytorch#140417 (comment)))
@atalman

atalman commented Dec 5, 2024

Copy link
Copy Markdown
Collaborator

@pytorchmergebot merge -f "All required jobs are green"

@pytorchmergebot

Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

AmdSampsa pushed a commit to AmdSampsa/pytorch that referenced this pull request Dec 9, 2024
Magma is built for specific CUDA versions and stored in the ossci-linux bucket. Install it from there rather than the deprecated conda package.

There are two places where magma is installed today:
- `install_conda.sh`: extract the magma package in the same exact location where conda would install it, using a dedicated `install_magma_conda.sh` script. The new script is included in the relevant Dockerfiles where CUDA+magma is needed
- `install_magma.sh`: this script already uses a tarball. Use the new tarball instead of the tarball from the conda package. The format of the new tarball is compatible with the old one, so changes here are minimal:wq

Fixes pytorch#140538
Test PR: pytorch#141584

Pull Request resolved: pytorch#140417
Approved by: https://github.com/atalman
pytorchmergebot pushed a commit that referenced this pull request Apr 22, 2025
…ll (#147476)

Related to: #138506 we migrated magma-cuda build from anaconda to aws
Last version of magma-cuda published was 12.6 https://anaconda.org/pytorch/magma-cuda126

Here is the PR that moved from anaconda to tarball: #140417

Pull Request resolved: #147476
Approved by: https://github.com/albanD
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci-no-td Do not run TD on this PR Merged open source Reverted topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Update build jobs that use magma-cuda to use the new package

5 participants