ROCm enable sparse_sampled_addmm by jpvillam-amd · Pull Request #86401 · pytorch/pytorch

jpvillam-amd · 2022-10-06T19:36:27Z

Enables:
test_comprehensive_sparse_sampled_addmm_cuda_complex128
test_comprehensive_sparse_sampled_addmm_cuda_complex64
test_comprehensive_sparse_sampled_addmm_cuda_float32
test_comprehensive_sparse_sampled_addmm_cuda_float64
test_dispatch_meta_sparse_sampled_addmm_cuda_complex128
test_dispatch_meta_sparse_sampled_addmm_cuda_complex64
test_dispatch_meta_sparse_sampled_addmm_cuda_float32
test_dispatch_meta_sparse_sampled_addmm_cuda_float64
test_meta_sparse_sampled_addmm_cuda_complex128
test_meta_sparse_sampled_addmm_cuda_complex64
test_meta_sparse_sampled_addmm_cuda_float32
test_meta_sparse_sampled_addmm_cuda_float64

cc @jeffdaily @sunway513 @jithunnair-amd @ROCmSupport

pytorch-bot · 2022-10-06T19:36:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86401

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures, 1 Pending

As of commit ddac27b:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ngimel

Lint errors are real, otherwise lgtm

jpvillam-amd · 2022-10-07T16:19:58Z

    test_comprehensive_sparse_sampled_addmm_cuda_complex128 (__main__.TestDecompCUDA) ... ok (0.187s)
    test_comprehensive_sparse_sampled_addmm_cuda_complex64 (__main__.TestDecompCUDA) ... ok (0.091s)
    test_comprehensive_sparse_sampled_addmm_cuda_float32 (__main__.TestDecompCUDA) ... ok (0.072s)
    test_comprehensive_sparse_sampled_addmm_cuda_float64 (__main__.TestDecompCUDA) ... ok (0.074s)

From: https://github.com/pytorch/pytorch/actions/runs/3201111231/jobs/5228949868#step:8:22000

    test_dispatch_meta_sparse_sampled_addmm_cuda_complex128 (__main__.TestMetaCUDA) ... ok (0.029s)
    test_dispatch_meta_sparse_sampled_addmm_cuda_complex64 (__main__.TestMetaCUDA) ... ok (0.010s)
    test_dispatch_meta_sparse_sampled_addmm_cuda_float32 (__main__.TestMetaCUDA) ... ok (0.009s)
    test_dispatch_meta_sparse_sampled_addmm_cuda_float64 (__main__.TestMetaCUDA) ... ok (0.009s)

From: https://github.com/pytorch/pytorch/actions/runs/3201111231/jobs/5228949868#step:8:6521

    test_meta_sparse_sampled_addmm_cuda_complex128 (__main__.TestMetaCUDA) ... ok (0.010s)
    test_meta_sparse_sampled_addmm_cuda_complex64 (__main__.TestMetaCUDA) ... ok (0.010s)
    test_meta_sparse_sampled_addmm_cuda_float32 (__main__.TestMetaCUDA) ... ok (0.009s)
    test_meta_sparse_sampled_addmm_cuda_float64 (__main__.TestMetaCUDA) ... ok (0.010s)

From: https://github.com/pytorch/pytorch/actions/runs/3201111231/jobs/5228949868#step:8:11747

jpvillam-amd · 2022-10-07T16:21:48Z

I think the 1 failed check is from a XML reporting tool segfault


OK (skipped=12, expected failures=2)

Generating XML reports...
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutograd-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-autograd.test_complex.TestAutogradComplex-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutogradDeviceTypeCUDA-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutogradForwardMode-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutogradForwardModeBatchedGrad-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-autograd.test_functional.TestAutogradFunctional-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutogradInferenceMode-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestAutogradMultipleDispatchCUDA-20221007003900.xml
Generated XML report: test-reports/python-unittest/test_autograd/TEST-TestMultithreadAutograd-20221007003900.xml
[db1a429837ff:11768:0:11842] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x9)

FINISHED PRINTING LOG FILE of test_autograd (/var/lib/jenkins/workspace/test/test-reports/test_autograd_u5hojsgw)

Traceback (most recent call last):
  File "/var/lib/jenkins/workspace/test/run_test.py", line 1236, in <module>
    main()
  File "/var/lib/jenkins/workspace/test/run_test.py", line 1211, in main
    raise RuntimeError(err_message)
RuntimeError: test_autograd failed! Received signal: SIGSEGV

Don't think that is related to this PR. @jithunnair-amd

jithunnair-amd · 2022-10-26T16:58:51Z

@pytorchbot rebase

pytorchmergebot · 2022-10-26T17:00:57Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-10-26T17:01:02Z

Successfully rebased ROCm-SDDMM onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout ROCm-SDDMM && git pull --rebase)

jithunnair-amd · 2022-10-26T18:06:43Z

@pytorchbot merge

pytorchmergebot · 2022-10-26T18:08:29Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Enables: test_comprehensive_sparse_sampled_addmm_cuda_complex128 test_comprehensive_sparse_sampled_addmm_cuda_complex64 test_comprehensive_sparse_sampled_addmm_cuda_float32 test_comprehensive_sparse_sampled_addmm_cuda_float64 test_dispatch_meta_sparse_sampled_addmm_cuda_complex128 test_dispatch_meta_sparse_sampled_addmm_cuda_complex64 test_dispatch_meta_sparse_sampled_addmm_cuda_float32 test_dispatch_meta_sparse_sampled_addmm_cuda_float64 test_meta_sparse_sampled_addmm_cuda_complex128 test_meta_sparse_sampled_addmm_cuda_complex64 test_meta_sparse_sampled_addmm_cuda_float32 test_meta_sparse_sampled_addmm_cuda_float64 Pull Request resolved: pytorch#86401 Approved by: https://github.com/ngimel

jpvillam-amd requested review from mruberry and ngimel as code owners October 6, 2022 19:36

pytorch-bot bot added the release notes: sparse release notes category label Oct 6, 2022

facebook-github-bot added the cla signed label Oct 6, 2022

pytorchbot added the open source label Oct 6, 2022

ngimel approved these changes Oct 6, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 6, 2022

jpvillam-amd force-pushed the ROCm-SDDMM branch from ed8d7d3 to 8abce0d Compare October 6, 2022 23:17

jpvillam-amd changed the title ~~Added hip version checking and hipification code~~ ROCm enable sparse_sampled_addmm Oct 6, 2022

pytorch-bot bot added the module: rocm AMD GPU support for Pytorch label Oct 6, 2022

jpvillam-amd added 3 commits October 26, 2022 17:00

Added hip version checking and hipification code

f2c7d56

Linter

6914e07

Changed version check name from hip to ROCm

ddac27b

pytorchmergebot force-pushed the ROCm-SDDMM branch from b887e55 to ddac27b Compare October 26, 2022 17:01

pytorchmergebot added the Merged label Oct 26, 2022

pytorchmergebot closed this in 38dd4cb Oct 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROCm enable sparse_sampled_addmm#86401

ROCm enable sparse_sampled_addmm#86401
jpvillam-amd wants to merge 3 commits intopytorch:masterfrom
ROCm:ROCm-SDDMM

jpvillam-amd commented Oct 6, 2022 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Oct 6, 2022 •

edited

Loading

Uh oh!

ngimel left a comment

Uh oh!

jpvillam-amd commented Oct 7, 2022

Uh oh!

jpvillam-amd commented Oct 7, 2022

Uh oh!

jithunnair-amd commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Uh oh!

jithunnair-amd commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

jpvillam-amd commented Oct 6, 2022 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86401

✅ No Failures, 1 Pending

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

jpvillam-amd commented Oct 7, 2022

Uh oh!

jpvillam-amd commented Oct 7, 2022

Uh oh!

jithunnair-amd commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Uh oh!

jithunnair-amd commented Oct 26, 2022

Uh oh!

pytorchmergebot commented Oct 26, 2022

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jpvillam-amd commented Oct 6, 2022 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 6, 2022 •

edited

Loading