Enable eager mode for PyTorch/XLA by JackCaoG · Pull Request #7234 · pytorch/xla

JackCaoG · 2024-06-10T22:20:36Z

Add a eager mode for PyTorch/XLA. We already have an eager mode for debug which is UseEagerDebugMode that's controlled by a env var. This was contributed by the AWS folks(cc @aws-rhsoln @amithrm @jeffhataws). I decided to make a offical top level api for it.

In this pr, I

add api use_eager_mode along with all the pybinds
make sure that PT_XLA_DEBUG and PT_XLA_DEBUG_LEVEL does not print analysis for eager execution
add new metrics for eager mode compilation and eager mode executions

For future pr

increase the maximum number of queued task (for eager mode)
add api/decorator to compile a specified function in the eager mode.
add docs
test eager + SPMD

For a 2 layer decoder only model, eager mode can acheive ~40% of throuput compared to the full compiled mode.

lsy323 · 2024-06-10T23:21:50Z

Let's add a test, otherwise LGTM

JackCaoG · 2024-06-10T23:48:04Z

The correctness check is handled by run_eager_debug "$CDIR/test_operations.py" "$@" --verbosity=$VERBOSITY. I will expand this in the future.

JackCaoG · 2024-06-11T02:13:08Z

Ah.. I need to move this api under experimental.. Let me update the pr...

JackCaoG · 2024-06-11T20:12:27Z

I am going to skip the gpu tests and it takes forever to get resouce. From last few runs of the CI in this pr, there is nothing specified about the GPU.

Enable eager mode for PyTorch/XLA

94224d5

JackCaoG added the usability Bugs/features related to improving the usability of PyTorch/XLA label Jun 10, 2024

JackCaoG requested review from alanwaketan, lsy323 and will-cromar June 10, 2024 22:20

lsy323 approved these changes Jun 10, 2024

View reviewed changes

JackCaoG added the tpuci label Jun 10, 2024

JackCaoG added 3 commits June 10, 2024 23:28

run an example for eager

64b6acf

add PT_XLA_DEBUG=1 test

4a14378

test eager metrics

b15d8ad

qihqi approved these changes Jun 11, 2024

View reviewed changes

JackCaoG and others added 6 commits June 11, 2024 03:09

move eager mode to the experimental

f9c8765

Update train_decoder_only_eager.py

d10ca67

add missing file

b04a0f7

linter

a966010

rename api to eager_mode

1fc06f9

fix test

e6ef457

JackCaoG merged commit 4d0b94f into master Jun 11, 2024

JackCaoG deleted the JackCaoG/eager_mode branch June 11, 2024 20:12

JackCaoG mentioned this pull request Jun 12, 2024

[RFC] PyTorch/XLA eager mode as default #7253

Open

JackCaoG added the eager PyTorch/XLA eager-mode label Jun 20, 2024

JackCaoG mentioned this pull request Jul 2, 2024

2.4 backport PR request list #7242

Closed

JackCaoG added a commit that referenced this pull request Jul 2, 2024

Enable eager mode for PyTorch/XLA (#7234)

1320364

JackCaoG added a commit that referenced this pull request Jul 8, 2024

Enable eager mode for PyTorch/XLA (#7234)

6d48ff5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable eager mode for PyTorch/XLA#7234

Enable eager mode for PyTorch/XLA#7234
JackCaoG merged 10 commits intomasterfrom
JackCaoG/eager_mode

JackCaoG commented Jun 10, 2024 •

edited

Loading

Uh oh!

lsy323 commented Jun 10, 2024

Uh oh!

JackCaoG commented Jun 10, 2024

Uh oh!

JackCaoG commented Jun 11, 2024

Uh oh!

JackCaoG commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JackCaoG commented Jun 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lsy323 commented Jun 10, 2024

Uh oh!

JackCaoG commented Jun 10, 2024

Uh oh!

JackCaoG commented Jun 11, 2024

Uh oh!

JackCaoG commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JackCaoG commented Jun 10, 2024 •

edited

Loading