Skip to content

Non-deterministic parallel scatter reduction algorithms for scatter operations for CPU (sum, subtract, divide, multiply). #33389

@v0dro

Description

@v0dro

🚀 Feature

Implementing non-deterministic parallel scatter algorithms for various reduction operations. This issue is a follow-up of #33065 and a prelude to #22378.

Motivation

Read comment #22378 (comment) for more details.

Pitch

A scatter reduction API for addition, subtraction, multiplication and division that will have the following form:

input = torch.zeros(4, 4, device=device)
src = torch.ones(2, 2, device=device)
index = torch.tensor([[1], [2]], device=device, dtype=torch.long)
input.scatter_(0, index, src, reduce="sum")

Metadata

Metadata

Assignees

Labels

enhancementNot as big of a feature, but technically not a bug. Should be easy to fixtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions