Refactor TensorAccessor for headeronly. by pearu · Pull Request #166855 · pytorch/pytorch

pearu · 2025-11-03T13:26:21Z

This PR moves the implementations of Tensor accessor classes to headeronly with the following modifications:

Add ArrayRef and IndexBoundsCheck template parameters to refactor out the usages of IntArrayRef and TORCH_CHECK_INDEX from Tensor accessor implementations.
Eliminate usage of c10::irange as it is not headeronly-compatible.
Introduce torch::headeronly::{TensorAccessorBase,TensorAccessor, GenericPackedTensorAccessorBase, GenericPackedTensorAccessor} that are headeronly-equivalent to at::{TensorAccessorBase,TensorAccessor, GenericPackedTensorAccessorBase, GenericPackedTensorAccessor}. Both these sets of template classes use original implementations from torch::headeronly::detail that have new template parameters ArrayRefCls and IndexBoundsCheck to facilitate at and torch::headeronly implementations of ArrayRef and checking indices.

TODO:

~~when Refactor out headeronly ArrayRef #164991 lands, eliminate the placeholder class HeaderOnlyArrayRef~~ UPDATE: done.

Stack from ghstack (oldest at bottom):

-> Refactor TensorAccessor for headeronly. #166855

cc @jbschlosser

[ghstack-poisoned]

pytorch-bot · 2025-11-03T13:26:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166855

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 966eea7 with merge base d01a7b0 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge, unstable) (gh) (#166072)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_int8_static_quant_recipe

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 7ae42bc Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: cbe7a34 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 679aabf Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 8bdb8b0 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 7028577 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 651240f Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: a761208 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 010c0ba Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 9554e5d Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 668d747 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 25e7ad2 Pull Request resolved: #166855

janeyx99

Thank you! This looks p good to me, but let's rename the headeronly APIs so they don't share names to minimize confusion.

janeyx99 · 2025-11-12T01:34:42Z

+/// at::TensorAccessor, for instance) when `final` specifier is used.
 template <typename T>
-class ArrayRef final : public HeaderOnlyArrayRef<T> {
+class ArrayRef : public HeaderOnlyArrayRef<T> {


The reason is given in the note before the class definition:

/// NOTE: ArrayRef cannot be derived from. Normally, we would use /// `final` specifier to force this constraint at compile time. /// However, Intel compiler does not recognize ArrayRef as a class /// template (which is required in the definition of /// at::TensorAccessor, for instance) when `final` specifier is /// used. So, we cannot define ArrayRef as final because of the Intel /// compiler issue.

When ArrayRef is defined as final, the xpu CI fails with

2025-11-12T12:58:10.7402385Z [7169/8004] Building SYCL (Device) object torch_xpu_ops_gen_SparseSoftmaxKernels.cpp.o�[K 2025-11-12T12:58:10.7405002Z �[31mFAILED: �[0mcaffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl/torch_xpu_ops_gen_SparseSoftmaxKernels.cpp.o /var/lib/jenkins/workspace/build/caffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl/torch_xpu_ops_gen_SparseSoftmaxKernels.cpp.o 2025-11-12T12:58:10.7409903Z cd /var/lib/jenkins/workspace/build/caffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl && /opt/conda/envs/py_3.10/lib/python3.10/site-packages/cmake/data/bin/cmake -E make_directory /var/lib/jenkins/workspace/build/caffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl/. && /opt/conda/envs/py_3.10/lib/python3.10/site-packages/cmake/data/bin/cmake -D verbose:BOOL=OFF -D generated_file:STRING=/var/lib/jenkins/workspace/build/caffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl/./torch_xpu_ops_gen_SparseSoftmaxKernels.cpp.o -P /var/lib/jenkins/workspace/build/caffe2/aten_xpu/src/CMakeFiles/torch_xpu_ops.dir/ATen/native/sparse/xpu/sycl/torch_xpu_ops_gen_SparseSoftmaxKernels.cpp.o.Release.cmake 2025-11-12T12:58:10.7413388Z In file included from <command-line>: 2025-11-12T12:58:10.7414602Z /tmp/icx-8814ad46d2/SparseSoftmaxKernels-header-221199.h:1621:248: error: ‘c10::ArrayRef’ is not a template 2025-11-12T12:58:10.7416885Z 1621 | template <> struct KernelInfo<::sycl::detail::RoundedRangeKernel<::sycl::item<1, true>, 1, ::at::native::xpu::MaxRowKernelFunctor<double, long *, ::torch::headeronly::detail::GenericPackedTensorAccessor<::torch::headeronly::detail::TensorAccessor<::c10::ArrayRef<long>, double, 1, torch::headeronly::DefaultPtrTraits, long>, ::at::IndexBoundsCheck<2, long>, double, 2, torch::headeronly::DefaultPtrTraits, long>, double *>>> { 2025-11-12T12:58:10.7418863Z | ^~

cc @swolchok does the final specifier matter to keep around?

sounds like a compiler bug, but no I dont think that final is gating optimization opportunities in the absence of virtual methods

janeyx99

Can you also add a kernel use case in libtorch_agnostic_extension in kernel.cpp that uses the HeaderOnlyPackedTensorAccessor in a more real-world way?

[ghstack-poisoned]

ghstack-source-id: 2ec5972 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 8c466f6 Pull Request resolved: #166855

pearu · 2025-11-12T20:21:03Z

Can you also add a kernel use case in libtorch_agnostic_extension in kernel.cpp that uses the HeaderOnlyPackedTensorAccessor in a more real-world way?

Yes, I have added mv_tensor_accessor with CPU and CUDA kernels using headeronly tensor accessors.

[ghstack-poisoned]

ghstack-source-id: 2cd8e32 Pull Request resolved: #166855

[ghstack-poisoned]

ghstack-source-id: 5bba9eb Pull Request resolved: #166855

janeyx99

thanks! looking forward to tensor.packed_tensor_accessor apis 😛

one comment about the anon namespace

[ghstack-poisoned]

ghstack-source-id: 3880b24 Pull Request resolved: #166855

pearu · 2025-11-15T18:47:21Z

@pytorchbot merge

pytorchmergebot · 2025-11-15T18:50:00Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

ghstack-source-id: a901b1d Pull Request resolved: pytorch/pytorch#166855

This PR moves the implementations of Tensor accessor classes to headeronly with the following modifications: - Add ArrayRef and IndexBoundsCheck template parameters to refactor out the usages of `IntArrayRef` and `TORCH_CHECK_INDEX` from Tensor accessor implementations. - Eliminate usage of `c10::irange` as it is not headeronly-compatible. - Introduce `torch::headeronly::{TensorAccessorBase,TensorAccessor, GenericPackedTensorAccessorBase, GenericPackedTensorAccessor}` that are headeronly-equivalent to `at::{TensorAccessorBase,TensorAccessor, GenericPackedTensorAccessorBase, GenericPackedTensorAccessor}`. Both these sets of template classes use original implementations from `torch::headeronly::detail` that have new template parameters `ArrayRefCls` and `IndexBoundsCheck` to facilitate `at` and `torch::headeronly` implementations of ArrayRef and checking indices. TODO: - ~when pytorch#164991 lands, eliminate the placeholder class HeaderOnlyArrayRef~ UPDATE: done. Pull Request resolved: pytorch#166855 Approved by: https://github.com/janeyx99

Update

cd158b7

[ghstack-poisoned]

pearu requested a review from janeyx99 as a code owner November 3, 2025 13:26

pytorch-bot Bot added the ciflow/inductor label Nov 3, 2025

pearu added a commit that referenced this pull request Nov 3, 2025

Refactor TensorAccessor for headeronly.

e0c04e4

ghstack-source-id: 7ae42bc Pull Request resolved: #166855

pearu added module: cpp Related to C++ API open source topic: not user facing topic category labels Nov 3, 2025

Update

82af273

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 3, 2025

Refactor TensorAccessor for headeronly.

14b9c31

ghstack-source-id: cbe7a34 Pull Request resolved: #166855

Update

39dcad2

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 4, 2025

Refactor TensorAccessor for headeronly.

2ef373c

ghstack-source-id: 679aabf Pull Request resolved: #166855

pearu mentioned this pull request Nov 4, 2025

[STABLE ABI] Porting audio to Torch Stable ABI pytorch/audio#4114

Closed

19 tasks

pearu added a commit that referenced this pull request Nov 5, 2025

Refactor TensorAccessor for headeronly.

588ffbe

ghstack-source-id: 679aabf Pull Request resolved: #166855

Update

d0cecb3

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 5, 2025

Refactor TensorAccessor for headeronly.

5660e59

ghstack-source-id: 8bdb8b0 Pull Request resolved: #166855

Update

b9f75ed

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 5, 2025

Refactor TensorAccessor for headeronly.

b04a880

ghstack-source-id: 7028577 Pull Request resolved: #166855

pearu mentioned this pull request Nov 5, 2025

[Do not merge] Try a workaround to ICX issue. #167097

Closed

pearu added 2 commits November 5, 2025 23:38

Update

a937722

[ghstack-poisoned]

Update

34b149c

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 5, 2025

Refactor TensorAccessor for headeronly.

402ff45

ghstack-source-id: 651240f Pull Request resolved: #166855

Update

97a0c63

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 6, 2025

Refactor TensorAccessor for headeronly.

dc7ff7b

ghstack-source-id: a761208 Pull Request resolved: #166855

Update

cc7ff59

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 6, 2025

Refactor TensorAccessor for headeronly.

d301e79

ghstack-source-id: 010c0ba Pull Request resolved: #166855

Update

c9517d6

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 6, 2025

Refactor TensorAccessor for headeronly.

a169125

ghstack-source-id: 9554e5d Pull Request resolved: #166855

Update

d9f860f

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 6, 2025

Refactor TensorAccessor for headeronly.

cb76480

ghstack-source-id: 668d747 Pull Request resolved: #166855

Update

f02063b

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 10, 2025

Refactor TensorAccessor for headeronly.

a00057b

ghstack-source-id: 25e7ad2 Pull Request resolved: #166855

janeyx99 reviewed Nov 12, 2025

View reviewed changes

Update

6f61797

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 12, 2025

Refactor TensorAccessor for headeronly.

7b74039

ghstack-source-id: 2ec5972 Pull Request resolved: #166855

Update

14a7068

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 12, 2025

Refactor TensorAccessor for headeronly.

2fc0aa5

ghstack-source-id: 8c466f6 Pull Request resolved: #166855

pearu requested a review from janeyx99 November 12, 2025 20:21

janeyx99 reviewed Nov 12, 2025

View reviewed changes

Update

218321e

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 13, 2025

Refactor TensorAccessor for headeronly.

ddaddb4

ghstack-source-id: 2cd8e32 Pull Request resolved: #166855

pearu requested a review from janeyx99 November 13, 2025 11:05

Update

a361e20

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 13, 2025

Refactor TensorAccessor for headeronly.

9e176e1

ghstack-source-id: 5bba9eb Pull Request resolved: #166855

janeyx99 approved these changes Nov 14, 2025

View reviewed changes

Update

966eea7

[ghstack-poisoned]

pearu added a commit that referenced this pull request Nov 15, 2025

Refactor TensorAccessor for headeronly.

8375554

ghstack-source-id: 3880b24 Pull Request resolved: #166855

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 15, 2025

pytorchmergebot added the merging label Nov 15, 2025

pytorchmergebot added the Merged label Nov 15, 2025

pytorchmergebot closed this in 0ec53be Nov 15, 2025

pytorchmergebot removed the merging label Nov 15, 2025

Khanaksahu pushed a commit to Khanaksahu/pytorch that referenced this pull request Nov 17, 2025

Refactor TensorAccessor for headeronly.

1d99079

ghstack-source-id: a901b1d Pull Request resolved: pytorch/pytorch#166855

github-actions Bot deleted the gh/pearu/150/head branch December 16, 2025 02:19

Conversation

pearu commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166855

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janeyx99 Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

pearu Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janeyx99 Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

swolchok Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

pearu commented Nov 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janeyx99 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pearu commented Nov 15, 2025

Uh oh!

pytorchmergebot commented Nov 15, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pearu commented Nov 3, 2025 •

edited

Loading

pytorch-bot Bot commented Nov 3, 2025 •

edited

Loading

pearu Nov 12, 2025 •

edited

Loading

janeyx99 left a comment •

edited

Loading