[1/2] Introduce at::accelerator::Graph as a unified Graph interface by guangyey · Pull Request #171269 · pytorch/pytorch

guangyey · 2025-12-24T15:35:56Z

Stack from ghstack (oldest at bottom):

-> [1/2] Introduce at::accelerator::Graph as a unified Graph interface #171269

Motivation

The original goal was to generalize CUDAGraph and share implementations and logic across different backends, as mentioned in #158827. However, after further offline discussions, we decided to take a more incremental approach: start by defining a unified interface, while allowing each backend to maintain its own implementation. This avoids premature coupling and addresses backend-specific concerns.

This PR introduces GraphImplInterface, a lightweight, backend-agnostic interface that defines a unified API for graph capture and replay. Each backend (e.g., CUDA, XPU, PrivateUse1) provides its own implementation and registers it via REGISTER_GRAPH_IMPL.
On top of this interface, we provide a unified graph API, at::accelerator::Graph, which transparently maps to:

CUDAGraph on CUDA
XPUGraph on XPU
and corresponding implementations for other backends (including PrivateUse1)

This design establishes a common abstraction layer while preserving backend autonomy, and lays the groundwork for future sharing of logic once the interface and use cases have stabilized.

An additional benefit is that, for CUDA and XPU, the backend-specific graph types (e.g., cuda::CUDAGraph and xpu::XPUGraph) can share the same underlying implementation as accelerator::Graph on each backend, avoiding code duplication and ensuring consistent behavior.

For PrivateUse1, accelerator::Graph can be supported with minimal effort by reusing the existing PU1Graph implementation.

cc @albanD @eellison @EikanWang

pytorch-bot · 2025-12-24T15:36:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/171269

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 62ab0c3 with merge base f72a552 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

docker-builds / docker-build (linux.12xlarge, pytorch-linux-jammy-cuda12.8-cudnn9-py3-gcc11) (gh) (similar failure)
##[error]Response status code does not indicate success: 401 (Unauthorized).
linux-aarch64 / linux-jammy-aarch64-py3.10 / test (default, 2, 3, lf.linux.arm64.m7g.4xlarge) (gh) (similar failure)
##[error]Response status code does not indicate success: 401 (Unauthorized).

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: f17570f Pull Request resolved: #171269

[ghstack-poisoned]

guangyey · 2026-03-04T02:09:34Z

Thanks @eellison
Try to land this PR.
@pytorchbot merge

pytorchmergebot · 2026-03-04T02:11:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2026-03-11T22:57:41Z

@pytorchbot revert -m 'This is currently breaking some internal build, I need to revert and reland this' -c ghfirst

pytorchmergebot · 2026-03-11T22:59:26Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

…erface (#171269)" This reverts commit 2afd3c1. Reverted #171269 on behalf of https://github.com/huydhn due to This is currently breaking some internal build, I need to revert and reland this ([comment](#171269 (comment)))

pytorchmergebot · 2026-03-11T22:59:41Z

@guangyey your PR has been successfully reverted.

huydhn · 2026-03-11T23:02:55Z

@guangyey Please help do a rebase and reland this change

ghstack-source-id: f4a22a7 Pull Request resolved: #171269

ghstack-source-id: f2a3e2e Pull Request resolved: #171269

[ghstack-poisoned]

guangyey · 2026-03-12T14:53:18Z

Try to reland.
@pytorchbot merge

pytorchmergebot · 2026-03-12T14:55:42Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

ghstack-source-id: d3b7f3f Pull Request resolved: pytorch/pytorch#171269

…ytorch#171269) # Motivation The original goal was to generalize `CUDAGraph` and share implementations and logic across different backends, as mentioned in pytorch#158827. However, after further offline discussions, we decided to take a more incremental approach: start by defining a unified interface, while allowing each backend to maintain its own implementation. This avoids premature coupling and addresses backend-specific concerns. This PR introduces `GraphImplInterface`, a lightweight, backend-agnostic interface that defines a unified API for graph capture and replay. Each backend (e.g., `CUDA`, `XPU`, `PrivateUse1`) provides its own implementation and registers it via `REGISTER_GRAPH_IMPL`. On top of this interface, we provide a unified graph API, `at::accelerator::Graph`, which transparently maps to: - `CUDAGraph` on CUDA - `XPUGraph` on XPU - and corresponding implementations for other backends (including `PrivateUse1`) This design establishes a common abstraction layer while preserving backend autonomy, and lays the groundwork for future sharing of logic once the interface and use cases have stabilized. An additional benefit is that, for `CUDA` and `XPU`, the backend-specific graph types (e.g., `cuda::CUDAGraph` and `xpu::XPUGraph`) can share the same underlying implementation as `accelerator::Graph` on each backend, avoiding code duplication and ensuring consistent behavior. For `PrivateUse1`, `accelerator::Graph` can be supported with minimal effort by reusing the existing `PU1Graph` implementation. Pull Request resolved: pytorch#171269 Approved by: https://github.com/EikanWang, https://github.com/eellison

…erface (pytorch#171269)" This reverts commit 2afd3c1. Reverted pytorch#171269 on behalf of https://github.com/huydhn due to This is currently breaking some internal build, I need to revert and reland this ([comment](pytorch#171269 (comment)))

…ytorch#171269) # Motivation The original goal was to generalize `CUDAGraph` and share implementations and logic across different backends, as mentioned in pytorch#158827. However, after further offline discussions, we decided to take a more incremental approach: start by defining a unified interface, while allowing each backend to maintain its own implementation. This avoids premature coupling and addresses backend-specific concerns. This PR introduces `GraphImplInterface`, a lightweight, backend-agnostic interface that defines a unified API for graph capture and replay. Each backend (e.g., `CUDA`, `XPU`, `PrivateUse1`) provides its own implementation and registers it via `REGISTER_GRAPH_IMPL`. On top of this interface, we provide a unified graph API, `at::accelerator::Graph`, which transparently maps to: - `CUDAGraph` on CUDA - `XPUGraph` on XPU - and corresponding implementations for other backends (including `PrivateUse1`) This design establishes a common abstraction layer while preserving backend autonomy, and lays the groundwork for future sharing of logic once the interface and use cases have stabilized. An additional benefit is that, for `CUDA` and `XPU`, the backend-specific graph types (e.g., `cuda::CUDAGraph` and `xpu::XPUGraph`) can share the same underlying implementation as `accelerator::Graph` on each backend, avoiding code duplication and ensuring consistent behavior. For `PrivateUse1`, `accelerator::Graph` can be supported with minimal effort by reusing the existing `PU1Graph` implementation. Pull Request resolved: pytorch#171269 Approved by: https://github.com/EikanWang, https://github.com/eellison

guangyey added a commit that referenced this pull request Dec 24, 2025

Introduce torch.accelerator.Graph as a unified Graph interface

6e291ef

ghstack-source-id: f17570f Pull Request resolved: #171269

guangyey marked this pull request as draft December 24, 2025 15:36

guangyey changed the title ~~Introduce torch.accelerator.Graph as a unified Graph interface~~ [WIP] Introduce torch.accelerator.Graph as a unified Graph interface Dec 24, 2025

pytorchbot added the open source label Dec 24, 2025

guangyey mentioned this pull request Dec 24, 2025

Introduce a unified API to empty the host cache memory #171270

Closed

guangyey added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 24, 2025

guangyey added 2 commits December 24, 2025 22:59

Update

8728335

[ghstack-poisoned]

Update

2d6b9bd

[ghstack-poisoned]

guangyey mentioned this pull request Dec 25, 2025

[2/2] Introduce torch.accelerator.Graph as a unified frontend Graph interface #171285

Closed

guangyey changed the title ~~[WIP] Introduce torch.accelerator.Graph as a unified Graph interface~~ [WIP][1/2] Introduce at::accelerator::Graph as a unified Graph interface Dec 25, 2025

guangyey changed the title ~~[WIP][1/2] Introduce at::accelerator::Graph as a unified Graph interface~~ [WIP] [1/2] Introduce at::accelerator::Graph as a unified Graph interface Dec 25, 2025

guangyey added 7 commits December 25, 2025 09:58

Update

f841589

[ghstack-poisoned]

Update

ad17e7d

[ghstack-poisoned]

Update

b7fcacd

[ghstack-poisoned]

Update

65929fc

[ghstack-poisoned]

Update

3f6b071

[ghstack-poisoned]

Update

3c4dd22

[ghstack-poisoned]

Update

92948a1

[ghstack-poisoned]

guangyey mentioned this pull request Dec 26, 2025

Support torch.accelerator.Graph on CUDA #171313

Open

guangyey added 2 commits December 26, 2025 14:19

Update

82c90a8

[ghstack-poisoned]

Update

c61c970

[ghstack-poisoned]

This was referenced Dec 29, 2025

[RFC] Graph generalization #158827

Open

Introduce is_capturing method for c10::Stream & torch.Stream #171443

Open

Intorduce torch.accelerator.is_graph_available to check if Graph is available #171445

Open

guangyey added 2 commits December 29, 2025 10:14

Update

6d270d3

[ghstack-poisoned]

Update

32c0692

[ghstack-poisoned]

guangyey changed the title ~~[WIP] [1/2] Introduce at::accelerator::Graph as a unified Graph interface~~ [1/2] Introduce at::accelerator::Graph as a unified Graph interface Dec 30, 2025

guangyey added release notes: cpp release notes category module: accelerator Issues related to the shared accelerator API labels Dec 30, 2025

pytorchmergebot added the merging label Mar 4, 2026

pytorchmergebot added the Merged label Mar 4, 2026

pytorchmergebot closed this in 2afd3c1 Mar 4, 2026

pytorchmergebot removed the merging label Mar 4, 2026

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Mar 11, 2026

pytorchmergebot reopened this Mar 11, 2026

guangyey added a commit that referenced this pull request Mar 12, 2026

[1/2] Introduce torch.accelerator.Graph as a unified Graph interface

ce96080

ghstack-source-id: f4a22a7 Pull Request resolved: #171269

guangyey added a commit that referenced this pull request Mar 12, 2026

[1/2] Introduce torch.accelerator.Graph as a unified Graph interface

ddecee4

ghstack-source-id: f4a22a7 Pull Request resolved: #171269

guangyey added a commit that referenced this pull request Mar 12, 2026

[1/2] Introduce torch.accelerator.Graph as a unified Graph interface

8d3e9a8

ghstack-source-id: f2a3e2e Pull Request resolved: #171269

guangyey added 2 commits March 12, 2026 09:53

Update

a9bc045

[ghstack-poisoned]

Update

62ab0c3

[ghstack-poisoned]

pytorchmergebot added the merging label Mar 12, 2026

pytorchmergebot closed this in 3f254ba Mar 12, 2026

pytorchmergebot removed the merging label Mar 12, 2026

sandy-gags pushed a commit to sandy-gags/pytorch that referenced this pull request Mar 12, 2026

[1/2] Introduce torch.accelerator.Graph as a unified Graph interface

6f37879

ghstack-source-id: d3b7f3f Pull Request resolved: pytorch/pytorch#171269

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[1/2] Introduce at::accelerator::Graph as a unified Graph interface#171269

[1/2] Introduce at::accelerator::Graph as a unified Graph interface#171269
guangyey wants to merge 20 commits intogh/guangyey/265/basefrom
gh/guangyey/265/head

guangyey commented Dec 24, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 24, 2025 •

edited

Loading

Uh oh!

guangyey commented Mar 4, 2026

Uh oh!

pytorchmergebot commented Mar 4, 2026

Uh oh!

huydhn commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

huydhn commented Mar 11, 2026

Uh oh!

guangyey commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

guangyey commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

pytorch-bot bot commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/171269

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

guangyey commented Mar 4, 2026

Uh oh!

pytorchmergebot commented Mar 4, 2026

Merge started

Uh oh!

huydhn commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

huydhn commented Mar 11, 2026

Uh oh!

guangyey commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

guangyey commented Dec 24, 2025 •

edited

Loading

pytorch-bot bot commented Dec 24, 2025 •

edited

Loading