[Pallas] Introduce make_kernel_from_pallas by alanwaketan · Pull Request #6713 · pytorch/xla

alanwaketan · 2024-03-11T19:04:21Z

Summary:
This pull request introduces make_kernel_from_pallas API which is the top level API to interact with the Pallas integration. It takes a pallas_call wrapper and than make it a custom pytorch op.

Test Plan:
python test/test_pallas.py

JackCaoG · 2024-03-11T19:08:33Z

  return None
+
+
+def convert_torch_dtype_to_jax(dtype: torch.dtype) -> jnp.dtype:


@qihqi do we already have such converstation somewhere in torchxla2?

It's generated by copilot, lol

JackCaoG · 2024-03-11T19:10:25Z

+                                        (x.shape, x.dtype))
+
+    dtypes = [torch.float32, torch.float
+             ]  # TODO: torch.float64, torch.bfloat16, torch.float16 don't work.


why bf16 won't work?

Mosaic complaints. Need to dig more into it.

JackCaoG · 2024-03-11T19:11:58Z

+    import jax
+    import jax.numpy as jnp
+    import jax._src.pallas.mosaic.pallas_call_registration


seems like this is repeated on multiple tests, maybe just move to the top?

There is a compatibility issue where jax will try to lock tpu devices if we import them before any pt/xla computations... I will need to resolve that...

JackCaoG

Do you need this pr in 2.3?

alanwaketan · 2024-03-11T19:16:05Z

Do you need this pr in 2.3?

Yea, will also need a couple for the TODOs.

miladm · 2024-03-12T17:50:09Z

+  @unittest.skipIf(xr.device_type() != 'TPU', "This test only works on TPU.")
+  # TODO: This test cannot be ran individually, let's fix it.
+  def test_tpu_custom_call_pallas_wrap_add_payload(self):
+    import jax


I am concerned JAX-based tests cause failures due to libtpu version inconsistencies, and in turn CI hiccups. How do we resolve this concern?

That's resolved in the last PR: #6696

miladm · 2024-03-12T17:51:32Z

+
+
+def make_kernel_from_pallas(kernel: Callable, output_shape_dtype_fn: Callable):
+  # TODO: Maybe we can cache the payload for the same input.


The payload may change if the input is dynamic. We need to confirm this with pallas folks.

Right, the cache itself should deal with the dynamism.

alanwaketan · 2024-03-12T23:39:47Z

Can I get any reviews?

JackCaoG

I still think we should refactor convert_torch_dtype_to_jax and invesgate bf16(which I assume most people will use), approve to unblock.

alanwaketan · 2024-03-13T00:11:43Z

I still think we should refactor convert_torch_dtype_to_jax and invesgate bf16(which I assume most people will use), approve to unblock.

Yea, for sure. Let me follow up with that.

Summary: This pull request introduces make_kernel_from_pallas API which is the top level API to interact with the Pallas integration. It takes a pallas_call wrapper and than make it a custom pytorch op. Test Plan: python test/test_pallas.py

Co-authored-by: Jiewen Tan <jwtan@google.com>

alanwaketan requested review from JackCaoG and qihqi March 11, 2024 19:04

JackCaoG reviewed Mar 11, 2024

View reviewed changes

miladm reviewed Mar 12, 2024

View reviewed changes

miladm self-requested a review March 12, 2024 17:52

miladm assigned alanwaketan Mar 12, 2024

miladm added the backport_2.3 label Mar 12, 2024

alanwaketan added 5 commits March 12, 2024 23:38

tmp

d19aa79

initial commit

cc4572d

Remove some jax code

f43ca7f

Improve the tests

e71e559

fix linters

ae6b62b

alanwaketan force-pushed the alanwaketan/pallas_api branch from 8b8be2e to ae6b62b Compare March 12, 2024 23:38

JackCaoG approved these changes Mar 12, 2024

View reviewed changes

alanwaketan merged commit 1bbe333 into master Mar 13, 2024

alanwaketan deleted the alanwaketan/pallas_api branch March 13, 2024 18:39

lsy323 added a commit that referenced this pull request Mar 13, 2024

[Pallas] Introduce make_kernel_from_pallas (#6713) (#6742)

e556ad8

Co-authored-by: Jiewen Tan <jwtan@google.com>

alanwaketan mentioned this pull request Mar 13, 2024

2.3 backport PR request list #6676

Closed

		return None


		def convert_torch_dtype_to_jax(dtype: torch.dtype) -> jnp.dtype:



		def make_kernel_from_pallas(kernel: Callable, output_shape_dtype_fn: Callable):
		# TODO: Maybe we can cache the payload for the same input.

Conversation

alanwaketan commented Mar 11, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JackCaoG left a comment

Choose a reason for hiding this comment

Uh oh!

alanwaketan commented Mar 11, 2024

Uh oh!

miladm Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alanwaketan commented Mar 12, 2024

Uh oh!

JackCaoG left a comment

Choose a reason for hiding this comment

Uh oh!

alanwaketan commented Mar 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

miladm Mar 12, 2024 •

edited

Loading