[FlexAttention] Enable different qk and v head-dims by drisspg · Pull Request #134043 · pytorch/pytorch

drisspg · 2024-08-20T20:42:27Z

Stack from ghstack (oldest at bottom):

-> [FlexAttention] Enable different qk and v head-dims #134043

Summary

Adds the option for the head dims to be different between QK and V tensors.

Fixes issue: #133674

V_DIM > QK_DIM is blocked by landing: triton-lang/triton#4138 / triton-lang/triton#4540

Into PyTorch's triton branch.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-08-20T20:42:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134043

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 067c895 with merge base 5f3d22a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

# Summary Adds the option for the head dims to be different between QK and V tensors. Local testing shows that when QK_HEAD_DIM > V head dim this works great for the forward Not when V > QK , still debugging [ghstack-poisoned]

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

drisspg · 2024-08-20T23:19:13Z

torch/_inductor/kernel/flex_decoding.py


    q_range = stride_qg * off_g[:, None, None] + stride_qm * off_m[None, :, None] + stride_qk * offs_d[None, None, :]

-    Q_block_ptr = tl.make_block_ptr(


wasnt being used

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

drisspg · 2024-08-20T23:52:37Z

@pytorchbot merge

yanboliang · 2024-08-22T01:54:23Z

@pytorchbot merge

pytorchmergebot · 2024-08-22T01:56:06Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

yanboliang · 2024-08-22T03:39:21Z

@pytorchbot merge -f "stucked ROCM jobs, flex attention unit tests only on CUDA"

pytorchmergebot · 2024-08-22T03:39:41Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

yanboliang · 2024-08-22T03:40:29Z

@pytorchbot merge -f "stucked ROCM jobs, flex attention unit tests only on CUDA"

pytorchmergebot · 2024-08-22T03:42:08Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jeanschmidt · 2024-08-22T13:42:37Z

@pytorchbot revert -m "Need to revert, in order to be able to revert #133373, feel free to reland this after solving conflicts" -c ghfirst

pytorchmergebot · 2024-08-22T13:44:07Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit e847b6b. Reverted #134043 on behalf of https://github.com/jeanschmidt due to Need to revert, in order to be able to revert #133373, feel free to reland this after solving conflicts ([comment](#134043 (comment)))

pytorchmergebot · 2024-08-22T13:44:20Z

@drisspg your PR has been successfully reverted.

yanboliang · 2024-08-23T00:59:36Z

@pytorchbot merge

pytorchmergebot · 2024-08-23T01:01:32Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: pytorch#133674 V_DIM > QK_DIM is blocked by landing: triton-lang/triton#4138 / triton-lang/triton#4540 Into PyTorch's triton branch. Pull Request resolved: pytorch#134043 Approved by: https://github.com/Chillee

…134043)" This reverts commit e847b6b. Reverted pytorch#134043 on behalf of https://github.com/jeanschmidt due to Need to revert, in order to be able to revert pytorch#133373, feel free to reland this after solving conflicts ([comment](pytorch#134043 (comment)))

[FlexAttention] Enable different qk and v head-dims

27dcaf1

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 20, 2024

Update on "[FlexAttention] Enable different qk and v head-dims"

c629f6c

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

drisspg requested review from albanD, jbschlosser and mikaylagawarecki as code owners August 20, 2024 22:26

Update on "[FlexAttention] Enable different qk and v head-dims"

93eb4c2

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

drisspg commented Aug 20, 2024

View reviewed changes

drisspg requested review from Chillee and yanboliang and removed request for albanD, jbschlosser and mikaylagawarecki August 20, 2024 23:21

Update on "[FlexAttention] Enable different qk and v head-dims"

8ef1c8b

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

Chillee approved these changes Aug 20, 2024

View reviewed changes

Update on "[FlexAttention] Enable different qk and v head-dims"

cf10556

# Summary Adds the option for the head dims to be different between QK and V tensors. Fixes issue: #133674 [ghstack-poisoned]

pytorchmergebot added the Merged label Aug 22, 2024

pytorchmergebot closed this in e847b6b Aug 22, 2024

pytorchmergebot removed the merging label Aug 22, 2024

pytorchmergebot added the Reverted label Aug 22, 2024

pytorchmergebot reopened this Aug 22, 2024

pytorchmergebot added the merging label Aug 23, 2024

pytorchmergebot closed this in bf5addb Aug 23, 2024

pytorchmergebot removed the merging label Aug 23, 2024

github-actions bot deleted the gh/drisspg/36/head branch October 1, 2024 02:13


		q_range = stride_qg * off_g[:, None, None] + stride_qm * off_m[None, :, None] + stride_qk * offs_d[None, None, :]

		Q_block_ptr = tl.make_block_ptr(

Conversation

drisspg commented Aug 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Aug 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134043

✅ No Failures

Uh oh!

drisspg Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

drisspg commented Aug 20, 2024

Uh oh!

yanboliang commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Merge started

Uh oh!

yanboliang commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Uh oh!

yanboliang commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Merge started

Uh oh!

jeanschmidt commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Uh oh!

yanboliang commented Aug 23, 2024

Uh oh!

pytorchmergebot commented Aug 23, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

drisspg commented Aug 20, 2024 •

edited

Loading

pytorch-bot bot commented Aug 20, 2024 •

edited

Loading