Test that TORCH_FEATURE_VERSION guards are used where needed by mikaylagawarecki · Pull Request #167962 · pytorch/pytorch

mikaylagawarecki · 2025-11-17T05:28:32Z

Splits each torch library registration in the 2.10 folder into its own file -- I had a script that parsed kernel.cpp to do this but I felt like forcing this responsibility on the user might be less error prone

Compiles each file targetting 2.9 and asserts that compilation fails. (There are 2 2.9 kernels we use as negative tests where compilation is expected to succeed)

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-11-17T05:28:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167962

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit f11e191 with merge base 1c04a43 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable) (gh) (#166072)
extension/llm/custom_ops/test_sdpa_with_kv_cache.py::SDPAWithAttentionMaskTest::test_sdpa_with_cache_no_mqa_1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 69ab4ca Pull Request resolved: #167962

[ghstack-poisoned]

janeyx99 · 2025-11-17T18:05:23Z

...ions/libtorch_agnostic_2_10_extension/libtorch_agnostic_2_10/csrc/mv_tensor_accessor_cpu.cpp

@@ -0,0 +1,40 @@
+// This is duplicated from the libtorch_agnostic_2_9_extension


mv_tensor_accessor_cpu was added in 2_10, no?

yea hmmm should this code be in 2_10 from the start? Or no, because it's header only?

This is a bit of a confusing counterexample because it is headeronly so it retroactively works, but a more clear example might be something that was definitely landed in 2_9?

It works in 2.9 because it only uses things in headeronly

There's nothing being passed across the shim boundary and stable::Tensor.sizes() and stable::Tensor.strides() also works in 2.9 is my understanding

My workflow in the below PR ensures that this test is able to run on 2.9

@janeyx99

This is a bit of a confusing counterexample because it is headeronly so it retroactively works, but a more clear example might be something that was definitely landed in 2_9?

I agree that it is confusing, but I disagree that it's not a good sanity check test, this function demonstrates the exact case that we want this test class to catch

Current version is 2.x, user adds test and adds it to libtorch_agnostic_2_x_extension thinking this is a 2.x feature

We want this test to catch this (due to compilation succeeding) and prompt the user to either add version guards or move this test to libtorch_agnostic_2_(x-1)_extension

If the libtorch_agnostic_targetting workflow succeeds, we've confirmed the test belongs in libtorch_agnostic_2_(x-1)_extension, otherwise we have signal for the user that they missed version guards

wdyt, does that motivate the rationale for choosing this kernel better? Also happy to add others in a followup if you disagree

janeyx99 · 2025-11-17T18:06:09Z

...ions/libtorch_agnostic_2_10_extension/libtorch_agnostic_2_10/csrc/mv_tensor_accessor_cuda.cu

@@ -0,0 +1,47 @@
+// This is duplicated from the libtorch_agnostic_2_9_extension


janeyx99 · 2025-11-17T18:08:37Z

..._agnostic_2_10_extension/libtorch_agnostic_2_10/csrc/make_tensor_clones_and_call_foreach.cpp

+using torch::stable::Tensor;
+
+// Declare my__foreach_mul (defined in my__foreach_mul.cpp)
+extern std::vector<Tensor> my__foreach_mul(


Is this code resilient to linking order? e.g., if this code is linked before my__foreach_mul.cpp, will it break?

Hm good question, I didn't run into any issues with this so far, according to claude it should not matter :)

**setup.py** compiles all .cpp files in the csrc directory as separate object files

The files are linked together into a single shared library (_C.so)

The linker resolves the symbol **my__foreach_mul**regardless of which object file comes first in the link order

janeyx99 · 2025-11-17T18:19:05Z

...ions/libtorch_agnostic_2_10_extension/libtorch_agnostic_2_10/csrc/mv_tensor_accessor_cpu.cpp

@@ -0,0 +1,40 @@
+// This is duplicated from the libtorch_agnostic_2_9_extension


yea hmmm should this code be in 2_10 from the start? Or no, because it's header only?

This is a bit of a confusing counterexample because it is headeronly so it retroactively works, but a more clear example might be something that was definitely landed in 2_9?