Skip to content

[release/2.3] [ROCM] Properly disable Flash Attention/Efficient Attention with environment variables#1571

Merged
pruthvistony merged 6 commits intorelease/2.3from
xinyazhang/internal-2.3-nofa
Sep 11, 2024
Merged

[release/2.3] [ROCM] Properly disable Flash Attention/Efficient Attention with environment variables#1571
pruthvistony merged 6 commits intorelease/2.3from
xinyazhang/internal-2.3-nofa

Conversation

@xinyazhang
Copy link

Now USE_FLASH_ATTENTION=0 USE_MEM_EFF_ATTENTION=0 python setup.py can compile correctly.

This is cherry-picked version of pytorch#133866

@jithunnair-amd jithunnair-amd changed the title [ROCM] Properly disable Flash Attention/Efficient Attention with environment variables [release/2.3] [ROCM] Properly disable Flash Attention/Efficient Attention with environment variables Sep 11, 2024
"Mem Efficient attention was not compiled for current AMD GPU architecture. Attempting to run on architecture ", dprops->gcnArchName);
}
return false;
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pruthvistony Isn't this where we would need to add a return true to address the "Control reaching end of non-void function" error?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No its NOT here.

@pruthvistony
Copy link
Collaborator

For 2.3 mem_efficient should always be false.
@xinyazhang , please update the PR.

@xinyazhang xinyazhang force-pushed the xinyazhang/internal-2.3-nofa branch from 56f999f to 2c4693b Compare September 11, 2024 15:56
@pruthvistony pruthvistony merged commit 1b935e2 into release/2.3 Sep 11, 2024
@pruthvistony pruthvistony deleted the xinyazhang/internal-2.3-nofa branch September 11, 2024 18:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants