Add sparse marlin 2:4 gemm op by Diogo-V · Pull Request #733 · pytorch/ao

Diogo-V · 2024-08-22T22:41:54Z

Description

This PR is a more concise version of #621 where only the gemm op and surrounding functions are implemented for a 2:4 sparse marlin kernel.

What was done:

Tests to validate the gemm op as well as with opcheck() to check if torch.compile will work out of the box with it
Implemented functions to pack an int4 quantized tensor into a sparse marlin representation
Implemented functions to reverse the above process (to be later used when dequantize() is called)

Notes:

The cuda kernel was extracted from this repo

cc @jcaip

pytorch-bot · 2024-08-22T22:41:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/733

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8699877 with merge base 0ed3090 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jcaip

Thanks @Diogo-V!

msaroufim · 2024-08-23T17:23:54Z

that was fast haha, 1 shot green ci ;)

Diogo-V · 2024-08-23T18:24:03Z

Glad I could be of help!
Now, I have to maintain the streak ;)

feat: add sparse marlin 2:4 kernel

c18f6bd

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 22, 2024

Diogo-V mentioned this pull request Aug 22, 2024

Add sparse marlin AQT layout #621

Merged

Merge branch 'main' into feat/sparse-marlin-gemm-op

8699877

jcaip self-requested a review August 23, 2024 16:52

jcaip approved these changes Aug 23, 2024

View reviewed changes

jcaip merged commit 614c667 into pytorch:main Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sparse marlin 2:4 gemm op#733

Add sparse marlin 2:4 gemm op#733
jcaip merged 2 commits into
pytorch:mainfrom
Diogo-V:feat/sparse-marlin-gemm-op

Diogo-V commented Aug 22, 2024 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Aug 22, 2024 •

edited

Loading

Uh oh!

jcaip left a comment

Uh oh!

msaroufim commented Aug 23, 2024

Uh oh!

Diogo-V commented Aug 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Diogo-V commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

What was done:

Notes:

Uh oh!

pytorch-bot Bot commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/733

✅ No Failures

Uh oh!

jcaip left a comment

Choose a reason for hiding this comment

Uh oh!

msaroufim commented Aug 23, 2024

Uh oh!

Diogo-V commented Aug 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Diogo-V commented Aug 22, 2024 •

edited

Loading

pytorch-bot Bot commented Aug 22, 2024 •

edited

Loading