Skip to content

[BUGFIX] Corrected types for strides in triton FA (#274)#276

Merged
maleksan85 merged 1 commit intomainfrom
llama32_crash_in_fa_fix
Nov 13, 2024
Merged

[BUGFIX] Corrected types for strides in triton FA (#274)#276
maleksan85 merged 1 commit intomainfrom
llama32_crash_in_fa_fix

Conversation

@maleksan85
Copy link

Plus removed KV truncation for cross attention case

Co-authored-by: Aleksandr Malyshev <maleksan@amd.com>
(cherry picked from commit 9a46e97)
@maleksan85 maleksan85 merged commit efb0432 into main Nov 13, 2024
@maleksan85 maleksan85 deleted the llama32_crash_in_fa_fix branch November 13, 2024 18:22
maleksan85 added a commit that referenced this pull request Nov 19, 2024
Co-authored-by: Aleksandr Malyshev <maleksan@amd.com>
(cherry picked from commit 9a46e97)
shajrawi pushed a commit that referenced this pull request Dec 3, 2024
* corrected types for strides in triton FA (#274) (#276)

Co-authored-by: Aleksandr Malyshev <maleksan@amd.com>
(cherry picked from commit 9a46e97)

* fused_moe configs for MI325X

New fused_moe configs for Mixtral-8x7B and Mixtral-8x22B with
TP=1,2,4,8 for both FP8 and FP16 on the recently announced MI325X.

---------

Co-authored-by: Aleksandr Malyshev <164964928+maleksan85@users.noreply.github.com>
gshtras pushed a commit that referenced this pull request Dec 9, 2024
Co-authored-by: Aleksandr Malyshev <maleksan@amd.com>
(cherry picked from commit 9a46e97)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants