Fix CUDA division by a scalar on large arrays. by colesbury · Pull Request #12023 · pytorch/pytorch

colesbury · 2018-09-24T19:55:22Z

The gpu_unary_kernel function was not handling arrays that
cannot use 32-bit indexing. This functions was only called directly
by CUDA division by a scalar. Other arithmetic operations go through
gpu_binary_kernel, which already properly handled large arrays.

This bug sometimes manifested as a crash and sometimes as an incorrect
answer.

Fixes #11788

The gpu_unary_kernel function was not handling arrays that cannot use 32-bit indexing. This functions was only called directly by CUDA division by a scalar. Other arithmetic operations go through gpu_binary_kernel, which already properly handled large arrays. This bug sometimes manifested as a crash and sometimes as an incorrect answer. Fixes pytorch#11788

facebook-github-bot

colesbury has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: The gpu_unary_kernel function was not handling arrays that cannot use 32-bit indexing. This functions was only called directly by CUDA division by a scalar. Other arithmetic operations go through gpu_binary_kernel, which already properly handled large arrays. This bug sometimes manifested as a crash and sometimes as an incorrect answer. Fixes #11788 Pull Request resolved: pytorch/pytorch#12023 Differential Revision: D10034017 Pulled By: colesbury fbshipit-source-id: b17300f327de54035746bf02f576766007c9b144

colesbury requested review from apaszke, ezyang, gchanan, soumith and zdevito as code owners September 24, 2018 19:55

soumith approved these changes Sep 24, 2018

View reviewed changes

colesbury added 2 commits September 24, 2018 13:15

Don't call get_device_properties if CUDA is not available

deef0ce

Increase large tensor threshold

484d865

facebook-github-bot reviewed Sep 25, 2018

View reviewed changes

facebook-github-bot closed this in b263078 Sep 25, 2018

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CUDA division by a scalar on large arrays.#12023

Fix CUDA division by a scalar on large arrays.#12023
colesbury wants to merge 3 commits intopytorch:masterfrom
colesbury:cuda_div_scalar

colesbury commented Sep 24, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

colesbury commented Sep 24, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants