Introduce int32 index_fill and index_copy indices by rpsilva-aws · Pull Request #142160 · pytorch/pytorch

rpsilva-aws · 2024-12-05T19:27:29Z

Fixes #142090
This PR extends index_fill and index_copy operations to support int32 indices in addition to the existing int64 support:

Memory Efficiency and potential performance for handling a large number of indices, particularly when needing to transfer to accelerator backends. In some cases, the compiler may not support 64-bit integers, and in some cases, may cause conflicting casts when used with TorchXLA. I do not see the immediate need to propagate these to the dim input, since it is tied to the APIs and is relatively negligible.
Framework interoperability: As mentioned above, this gives more flexibility when working with TorchXLA, since some operations require the same type of physical raw representations for the tensors as the XLA tensors. In some cases, for Neuron, XLA generates S32 types which are not compatible with some operations (e.g. casts) when needing to convert across tensors.
Consistency with other APIs, such as index_add and index_select.

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2024-12-05T19:27:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142160

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 763b7e4 with merge base 0318589 ():

NEW FAILURE - The following job has failed:

pull / linux-focal-py3.13-clang10 / build (gh)
#21 70.40 Getting requirements to build wheel: finished with status 'error'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

rpsilva-aws · 2024-12-05T19:28:16Z

@pytorchbot label "release notes: cuda"

rpsilva-aws · 2024-12-05T19:28:55Z

FYI: @albanD, reopened #142090

rpsilva-aws · 2024-12-05T19:43:15Z

cc: @miladm for visibility, since this is somewhat an extension/related to pytorch/xla#8450.

albanD · 2024-12-05T19:46:23Z

@eqy can you take a look at this?

rpsilva-aws · 2024-12-06T05:58:28Z

@albanD, @eqy Any way we can consider this in for 2.6? We need it for our Torch XLA counterpart PRs.

netlify · 2024-12-06T19:33:15Z

✅ Deploy Preview for chimerical-cranachan-793287 ready!

Name	Link
🔨 Latest commit	`763b7e4`
🔍 Latest deploy log	https://app.netlify.com/sites/chimerical-cranachan-793287/deploys/67539246757645000865b5a8
😎 Deploy Preview	https://deploy-preview-142160--chimerical-cranachan-793287.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

rpsilva-aws · 2024-12-06T19:35:20Z

Fixed a typo in the CUDA file, PTAL.

rpsilva-aws · 2024-12-06T22:05:07Z

@eqy Can you help restart the WF? Thank you.

eqy

Would it make sense to add some trivial asserts that guarantee that the number of elements of the tensor doesn't exceed the max possible value of the provided index type?

rpsilva-aws · 2024-12-07T00:18:58Z

Would it make sense to add some trivial asserts that guarantee that the number of elements of the tensor doesn't exceed the max possible value of the provided index type?

The indices are for the elements along a specific dimension, and these are implicitly capped by the index type. The total number of elements for the indices can have repeated values if I am not mistaken, so the assertion wouldn't necessarily hold. I also thought of aligning the first dimension type with the indices, but doesn't necessarily need to hold, but we do have an assert for that:

          TORCH_CHECK_INDEX(idx >= -self_dim_size && idx < self_dim_size,
                            "index ", idx, " is out of bounds for dimension ",
                            dim, " with size ", self_dim_size);

rpsilva-aws · 2024-12-07T21:11:24Z

This run seems flaky https://github.com/pytorch/pytorch/actions/runs/12208028139/job/34061080338?pr=142160#step:7:15939.

eqy · 2024-12-07T22:49:51Z

Would it make sense to add some trivial asserts that guarantee that the number of elements of the tensor doesn't exceed the max possible value of the provided index type?

The indices are for the elements along a specific dimension, and these are implicitly capped by the index type. The total number of elements for the indices can have repeated values if I am not mistaken, so the assertion wouldn't necessarily hold. I also thought of aligning the first dimension type with the indices, but doesn't necessarily need to hold, but we do have an assert for that:
          TORCH_CHECK_INDEX(idx >= -self_dim_size && idx < self_dim_size,
                            "index ", idx, " is out of bounds for dimension ",
                            dim, " with size ", self_dim_size);

What I mean is, does it make sense to add a check against a dim with size > INT_MAX when 32-bit indices are used?

rpsilva-aws · 2024-12-08T01:39:10Z

Would it make sense to add some trivial asserts that guarantee that the number of elements of the tensor doesn't exceed the max possible value of the provided index type?

The indices are for the elements along a specific dimension, and these are implicitly capped by the index type. The total number of elements for the indices can have repeated values if I am not mistaken, so the assertion wouldn't necessarily hold. I also thought of aligning the first dimension type with the indices, but doesn't necessarily need to hold, but we do have an assert for that:
          TORCH_CHECK_INDEX(idx >= -self_dim_size && idx < self_dim_size,
                            "index ", idx, " is out of bounds for dimension ",
                            dim, " with size ", self_dim_size);
What I mean is, does it make sense to add a check against a dim with size > INT_MAX when 32-bit indices are used?

I see, thanks for your suggestion - and if I am not mistaken, this already is in place. In the current kernel implementation, when we iterate through the indices, the existing TORCH_CHECK_INDEX macro should already do this bounds checking for each index against dim (size > idx <= INT_MAX, for int32). This should already make it a safe operation for handling 32-bit indices: https://github.com/pytorch/pytorch/pull/142160/files#diff-8aa1a200ec63d23db422aa31b6dca1e6cb372887c43b064ef435210b1b0dec0aR3446. If you instead mean the dim input, then these represent the dim tensor that we operate within, it wouldn't be directly related. Let me know if this helps answer it.

rpsilva-aws · 2024-12-10T19:01:03Z

@eqy do you have other concerns here? Let me know if any, in case this could still go in for 2.6.

eqy · 2024-12-10T19:23:52Z

My concern was not memory safety, but rather warning/alerting the user in cases where the index width could not be used to fully address a given dimension, e.g., dim size > 2**32 but index type int. In other words, does the "implict capping" lead to potentially nonsensical use-cases that should raise a warning or exception?

eqy · 2024-12-10T19:33:40Z

A trivial code example:

>>> import torch
>>> a = torch.empty(2**32, device='cuda', dtype=torch.uint8)
>>> a.index_fill_(0, torch.tensor([2**32 - 1], dtype=torch.int64, device='cuda'), -1)
tensor([  0,   0,   0,  ...,   0,   0, 255], device='cuda:0',
       dtype=torch.uint8)
>>> a.index_fill_(0, torch.tensor([2**32 - 1], dtype=torch.int64, device='cuda').to(torch.int32), -1)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
IndexError: index_fill_(): Expected dtype int64 for index.

What would the second case do here? IMO relying on potential UB (e.g., assuming the conversion rolls negative) seems flaky

github-actions · 2025-02-08T19:33:58Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

rpsilva-aws · 2025-02-08T22:19:04Z

@pytorchbot label "no-stale"

jeffhataws · 2025-08-20T19:17:21Z

Keep this alive.

jeffhataws · 2025-08-27T16:06:29Z

See #141994 for a related RFC.

jeffhataws · 2025-09-04T19:12:35Z

My concern was not memory safety, but rather warning/alerting the user in cases where the index width could not be used to fully address a given dimension, e.g., dim size > 2**32 but index type int. In other words, does the "implict capping" lead to potentially nonsensical use-cases that should raise a warning or exception?

#142160 (comment)

@rpsilva-aws @eqy can the following be used to check for index bound?

canUse32BitIndexMath

pytorch/aten/src/ATen/native/IndexingUtils.cpp

Line 6 in cb1e313

bool canUse32BitIndexMath(const TensorBase& t, int64_t max_elem) {

can_use_32bit_indexing

pytorch/aten/src/ATen/TensorIterator.cpp

Line 1299 in cb1e313

bool TensorIteratorBase::can_use_32bit_indexing() const {

jeffhataws · 2025-09-04T19:12:52Z

@rpsilva-aws will you rebase?

rpsilva-aws requested review from eqy and syed-ahmed as code owners December 5, 2024 19:27

pytorch-bot Bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Dec 5, 2024

rpsilva-aws mentioned this pull request Dec 5, 2024

Introduce int32 index_fill and index_copy indices #142090

Closed

pytorch-bot Bot added the release notes: cuda release notes category label Dec 5, 2024

pytorchbot added the open source label Dec 5, 2024

rpsilva-aws force-pushed the rpsilva_pt_int32_v2 branch from 0719b7d to 27ef254 Compare December 6, 2024 19:32

eqy reviewed Dec 6, 2024

View reviewed changes

Extend IndexFill and IndexCopy for int32

763b7e4

rpsilva-aws force-pushed the rpsilva_pt_int32_v2 branch from 27ef254 to 763b7e4 Compare December 7, 2024 00:09

janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Dec 9, 2024

github-actions Bot added the Stale label Feb 8, 2025

pytorch-bot Bot added the no-stale label Feb 8, 2025

jeffhataws mentioned this pull request Aug 27, 2025

Add torch.set_default_int_dtype() / extend set_default_dtype() to allow setting default signed integer dtype #141994

Open

coderabbitai Bot mentioned this pull request Mar 25, 2026

sam_refactor_1: Add SAM prompt mode support and enhanced preprocessing pipeline voxel51/fiftyone#7179

Merged

7 tasks

Conversation

rpsilva-aws commented Dec 5, 2024

Uh oh!

pytorch-bot Bot commented Dec 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142160

❌ 1 New Failure

Uh oh!

rpsilva-aws commented Dec 5, 2024

Uh oh!

rpsilva-aws commented Dec 5, 2024

Uh oh!

rpsilva-aws commented Dec 5, 2024

Uh oh!

albanD commented Dec 5, 2024

Uh oh!

rpsilva-aws commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify Bot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for chimerical-cranachan-793287 ready!

Uh oh!

rpsilva-aws commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rpsilva-aws commented Dec 6, 2024

Uh oh!

eqy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rpsilva-aws commented Dec 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rpsilva-aws commented Dec 7, 2024

Uh oh!

eqy commented Dec 7, 2024

Uh oh!

rpsilva-aws commented Dec 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rpsilva-aws commented Dec 10, 2024

Uh oh!

eqy commented Dec 10, 2024

Uh oh!

eqy commented Dec 10, 2024

Uh oh!

github-actions Bot commented Feb 8, 2025

Uh oh!

rpsilva-aws commented Feb 8, 2025

Uh oh!

jeffhataws commented Aug 20, 2025

Uh oh!

jeffhataws commented Aug 27, 2025

Uh oh!

jeffhataws commented Sep 4, 2025

Uh oh!

jeffhataws commented Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pytorch-bot Bot commented Dec 5, 2024 •

edited

Loading

rpsilva-aws commented Dec 6, 2024 •

edited

Loading

netlify Bot commented Dec 6, 2024 •

edited

Loading

rpsilva-aws commented Dec 6, 2024 •

edited

Loading

eqy left a comment •

edited

Loading

rpsilva-aws commented Dec 7, 2024 •

edited

Loading

rpsilva-aws commented Dec 8, 2024 •

edited

Loading