Make run_dtensor_rng_op compatible with compile_on_one_rank#177447
Make run_dtensor_rng_op compatible with compile_on_one_rank#177447aorenste wants to merge 9 commits intogh/aorenste/220/basefrom
Conversation
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/177447
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New Failure, 1 Pending, 2 Unrelated FailuresAs of commit 1c8fbdc with merge base 417a890 ( NEW FAILURE - The following job has failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 71bd32b Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 7a7bbae Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 0f5f81d Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 328f827 Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 991e325 Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 52b8581 Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: a6c27a8 Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: 8d87e47 Pull Request resolved: #177447
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. [ghstack-poisoned]
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. ghstack-source-id: af73b79 Pull Request resolved: #177447
|
test/distributed/tensor/test_random_ops.py::DistTensorRandomOpTest::test_pipeline_parallel_manual_seed is a pre-existing failure in trunk |
Merge startedYour change will be merged while ignoring the following 3 checks: inductor / inductor-cpu-test / test (cpu_inductor_torchbench, 1, 2, linux.2xlarge.amx, unstable), inductor / inductor-test / test (inductor_torchbench, 2, 2, linux.g5.4xlarge.nvidia.gpu), dtensor / dtensor-test / test (dtensor, 1, 1, lf.linux.g5.12xlarge.nvidia.gpu) Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…177447) Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. Pull Request resolved: pytorch#177447 Approved by: https://github.com/yiming0416 ghstack dependencies: pytorch#177446
…177447) Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim) when compile_on_one_rank is active. Previously, mesh.get_coordinate() returned concrete rank-specific integers that got baked into the compiled graph, producing different graphs on different ranks. Also refactors test_compile_on_one_rank.py to extract graph-comparison helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph) and adds a test for DTensor random op graph consistency. Authored with Claude. Pull Request resolved: pytorch#177447 Approved by: https://github.com/yiming0416 ghstack dependencies: pytorch#177446
Stack from ghstack (oldest at bottom):
Use mesh._sym_get_coordinate() in _compute_rng_offsets so that RNG
offset values become symbolic SymInts (via _runtime_compute_coordinate_on_dim)
when compile_on_one_rank is active. Previously, mesh.get_coordinate()
returned concrete rank-specific integers that got baked into the compiled
graph, producing different graphs on different ranks.
Also refactors test_compile_on_one_rank.py to extract graph-comparison
helpers (_assert_graphs_identical_across_ranks, _compile_and_capture_graph)
and adds a test for DTensor random op graph consistency.
Authored with Claude.