[ragged-paged-attn] Combine k_pages and v_pages on num_kv_head by bythew3i · Pull Request #8892 · pytorch/xla

bythew3i · 2025-03-26T23:41:39Z

This PR

Combines k_pages and v_pages on num_kv_head to support sharding num_kv_heads to 1 (multi-chip) while still having performant kernel and scatter.
Merge sliding_window and soft_cap change
Integrate kernel to pytorch cusom kernel
Refactor tests and improve the tests coverage
Tested dynamo compilation with dynamic grid from pallas

Tested:

python test/test_pallas.py -v -k PallasTest.test_ragged_paged_attention_wrapper

vanbasten23 · 2025-03-26T23:53:38Z

-    mask_value: float,
+    sliding_window: int | None = None,
+    soft_cap: float | None = None,
+    mask_value: float | None = DEFAULT_MASK_VALUE,


left some comments in your original cl for the kernel.

Thx! I will resolve there!

vanbasten23 · 2025-03-27T00:58:12Z

    page_indices_xla = page_indices.to("xla")
    cu_q_lens_xla = cu_q_lens.to("xla")
    num_seqs_xla = torch.tensor([num_seqs], dtype=torch.int32).to("xla")
+    sliding_window = sliding_window


nit no need: line672-673?

Good point! Looks like we have merged this PR! Let me resolve in a separated PR

vanbasten23 · 2025-03-27T00:59:19Z

+    sliding_window = sliding_window
+    soft_cap = soft_cap
+    # Test mask_value
+    mask_value = None


imo, we can just use the default mask value rather than letting the user choose one.

I will remove that in follow up PR since this PR is merged

vanbasten23 · 2025-03-27T02:03:11Z

Mostly LGTM, pending on CI. Thanks Jevin!

vanbasten23 · 2025-03-27T02:04:07Z

-  _, page_size, kv_hidden_size = k_pages.shape
-  num_kv_heads = kv_hidden_size // head_dim
+  check_inputs_shapes(q, kv_pages, kv_lens, page_indices, cu_q_lens, num_seqs)
+  if mask_value is None:


Do you still needed since mask_value is assign to a default value "mask_value: float | None = DEFAULT_MASK_VALUE"

Yes, the float | None is allowed-type. So if mask_value is None, it won't use DEFAULT_MASK_VALUE

But you have "= DEFAULT_MASK_VALUE", that means if mask_value is None, it will use DEFAULT_MASK_VALUE, right?

No, as mentioned, the None in float | None is just allowed type, you can do a simple test:

>>> def f(a, b: float | None = 1.0): ... print(a, b) ... >>> f(2, None) 2 None

bythew3i added 8 commits March 26, 2025 08:19

Combine kv to num_kv_heads

348b37d

Fix py style

e17e540

Update test

da2d622

Fix tests

63ad9b2

Merge branch 'master' into ragged-attn-v2

fad3a3e

Add more tests for sliding window and soft cap

e703e06

Refactor tests to cover more cases

63623cc

Fix py style

466f17e

vanbasten23 reviewed Mar 26, 2025

View reviewed changes

Test dynamo compilation with dynamic grid

8e3e330

vanbasten23 reviewed Mar 27, 2025

View reviewed changes

vanbasten23 approved these changes Mar 27, 2025

View reviewed changes

vanbasten23 merged commit 7a3c051 into pytorch:master Mar 27, 2025
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ragged-paged-attn] Combine k_pages and v_pages on num_kv_head#8892

[ragged-paged-attn] Combine k_pages and v_pages on num_kv_head#8892
vanbasten23 merged 9 commits intopytorch:masterfrom
bythew3i:ragged-attn-v2

bythew3i commented Mar 26, 2025 •

edited

Loading

Uh oh!

vanbasten23 Mar 26, 2025

Uh oh!

bythew3i Mar 26, 2025

Uh oh!

vanbasten23 Mar 27, 2025

Uh oh!

bythew3i Mar 27, 2025

Uh oh!

vanbasten23 Mar 27, 2025

Uh oh!

bythew3i Mar 27, 2025

Uh oh!

vanbasten23 commented Mar 27, 2025

Uh oh!

vanbasten23 Mar 27, 2025

Uh oh!

bythew3i Mar 27, 2025

Uh oh!

vanbasten23 Mar 27, 2025

Uh oh!

bythew3i Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bythew3i commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vanbasten23 commented Mar 27, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bythew3i commented Mar 26, 2025 •

edited

Loading