Fix many type mismatches in the CUDA version of calc_digamma and calc_trigamma by xuhdev · Pull Request #25791 · pytorch/pytorch

xuhdev · 2019-09-06T18:52:05Z

There are some missing casts.

Functions like ::log, ::sin will potentially always invoke the double version on host. For
example, compiling the following code:

#include <cmath>

float log_float(float f) {
    return ::logf(f);
}

double log_double(double f) {
    return ::log(f);
}

float log_float2(float f) {
    return ::log(f);
}

float log_float3(float f) {
    return std::log(f);
}

using g++ -c -O3 leads to:

log_float(float):
        jmp     logf
log_double(double):
        jmp     log
log_float2(float):
        subq    $8, %rsp
        cvtss2sd        %xmm0, %xmm0
        call    log
        addq    $8, %rsp
        cvtsd2ss        %xmm0, %xmm0
        ret
log_float3(float):
        jmp     logf

Note that log_float2 delegates the call to the double version of log
(surrounded by cast), while log_float3 delegates the call correctly to
logf. See https://godbolt.org/z/KsRWwW

…_trigamma - There are some missing casts. - Functions like ::log, ::sin will potentially always invoke the double version on host. For example, compiling the following code: ```c++ #include <cmath> float log_float(float f) { return ::logf(f); } double log_double(double f) { return ::log(f); } float log_float2(float f) { return ::log(f); } float log_float3(float f) { return std::log(f); } ``` using `g++ -c -O3` leads to: log_float(float): jmp logf log_double(double): jmp log log_float2(float): subq $8, %rsp cvtss2sd %xmm0, %xmm0 call log addq $8, %rsp cvtsd2ss %xmm0, %xmm0 ret log_float3(float): jmp logf Note that log_float2 delegates the call to the double version of log (surrounded by cast), while log_float3 delegates the call correctly to logf.

xuhdev · 2019-09-08T16:03:49Z

@pytorchbot rebase this please

xuhdev · 2019-09-09T20:23:19Z

@pytorchbot rebase this please

xuhdev · 2019-09-13T18:05:06Z

@pytorchbot rebase this please

zou3519 · 2019-09-13T18:56:20Z

  }

-  bool x_is_integer = x == ::floor(x);
+  bool x_is_integer = x == std::floor(x);


I'm a little confused. Does gcc compile this, or does nvcc do it? The code ultimately gets run on device, right?

The function is defined as __host__ __device__, that means it will be compiled twice on each

@zou3519 The main point of this kind of change in this PR is about the confusion that, the CUDA doc is silent on this issue (given that std::floor exists), and that assuming ::floor is the same as std::floor is inconsistent with standard C++. I think it would best if we can be more compliant to avoid a potential issue, which by itself at least doesn't hurt.

Okay, to make sure I understand what's going on: where is floor in ::floor coming from?

It can come from a C definition (as declared in a C header file, which is solely defined for double). The standard C++ library is silent on ::floor, while it defines std::floor. Does this answer your question?

That does, thanks for the clarification

xuhdev · 2019-09-16T19:18:18Z

@pytorchbot rebase this please

xuhdev · 2019-09-17T05:01:14Z

@pytorchbot rebase this please

xuhdev · 2019-09-17T17:46:13Z

@pytorchbot merge this please

The CI failure is likely unrelated, because this PR does not change any GPU code, and the change of CPU code is very local.

facebook-github-bot

@izdeby has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-09-18T22:11:46Z

@izdeby merged this pull request in 13b5446.

…_trigamma (#25791) Summary: - There are some missing casts. - Functions like ::log, ::sin will potentially always invoke the double version on host. For example, compiling the following code: ```c++ #include <cmath> float log_float(float f) { return ::logf(f); } double log_double(double f) { return ::log(f); } float log_float2(float f) { return ::log(f); } float log_float3(float f) { return std::log(f); } ``` using `g++ -c -O3` leads to: log_float(float): jmp logf log_double(double): jmp log log_float2(float): subq $8, %rsp cvtss2sd %xmm0, %xmm0 call log addq $8, %rsp cvtsd2ss %xmm0, %xmm0 ret log_float3(float): jmp logf Note that log_float2 delegates the call to the double version of log (surrounded by cast), while log_float3 delegates the call correctly to logf. See https://godbolt.org/z/KsRWwW Pull Request resolved: pytorch/pytorch#25791 Differential Revision: D17452312 Pulled By: izdeby fbshipit-source-id: 6276a011a373cd7cb144f9ecd84116aa206e7d1b

mruberry · 2019-09-19T01:50:36Z

Unlanding this with #26444.

The affected code does run on the GPU and likely broke the ROCm builds.

std:: and :: is an important distinction on GPUs. See some comments in https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/Distributions.h#L7.

xuhdev · 2019-09-19T03:20:39Z

Oops, sorry about this; looks like I made the merge comment on the wrong thread. yes, it is causing ROCm failure. I'll look into this, as ROCm works with some std:: functions, while not all of them.

…_trigamma (pytorch#25791) Summary: - There are some missing casts. - Functions like ::log, ::sin will potentially always invoke the double version on host. For example, compiling the following code: ```c++ #include <cmath> float log_float(float f) { return ::logf(f); } double log_double(double f) { return ::log(f); } float log_float2(float f) { return ::log(f); } float log_float3(float f) { return std::log(f); } ``` using `g++ -c -O3` leads to: log_float(float): jmp logf log_double(double): jmp log log_float2(float): subq $8, %rsp cvtss2sd %xmm0, %xmm0 call log addq $8, %rsp cvtsd2ss %xmm0, %xmm0 ret log_float3(float): jmp logf Note that log_float2 delegates the call to the double version of log (surrounded by cast), while log_float3 delegates the call correctly to logf. See https://godbolt.org/z/KsRWwW Pull Request resolved: pytorch#25791 Differential Revision: D17452312 Pulled By: izdeby fbshipit-source-id: 6276a011a373cd7cb144f9ecd84116aa206e7d1b

xuhdev requested a review from ifedan September 6, 2019 18:52

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: operators labels Sep 6, 2019

xuhdev changed the title ~~Fix many type mismatches in the CUDA version of calc_digamma and calc…~~ Fix many type mismatches in the CUDA version of calc_digamma and calc_trigamma Sep 6, 2019

Merge remote-tracking branch 'origin/master' into HEAD

eae8c14

Merge remote-tracking branch 'origin/master' into HEAD

cf1a108

xuhdev requested review from VitalyFedyunin and zou3519 September 13, 2019 18:04

Merge remote-tracking branch 'origin/master' into HEAD

a03e94e

zou3519 reviewed Sep 13, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into HEAD

bb265ad

zou3519 approved these changes Sep 17, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into HEAD

f7cbf87

pytorchbot added the merge-this-please Was marked for merge with @pytorchbot merge this please label Sep 17, 2019

ezyang added the open source label Sep 18, 2019

facebook-github-bot reviewed Sep 18, 2019

View reviewed changes

VitalyFedyunin approved these changes Sep 18, 2019

View reviewed changes

facebook-github-bot closed this in 13b5446 Sep 18, 2019

facebook-github-bot added the merged label Sep 18, 2019

ngimel mentioned this pull request Sep 23, 2019

Port cuda sigmoid to Aten(CUDA) #26643

Closed

mruberry added the Merged label Oct 28, 2020

Conversation

xuhdev commented Sep 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xuhdev commented Sep 8, 2019

Uh oh!

xuhdev commented Sep 9, 2019

Uh oh!

xuhdev commented Sep 13, 2019

Uh oh!

zou3519 Sep 13, 2019

Choose a reason for hiding this comment

Uh oh!

xuhdev Sep 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuhdev Sep 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zou3519 Sep 13, 2019

Choose a reason for hiding this comment

Uh oh!

xuhdev Sep 13, 2019

Choose a reason for hiding this comment

Uh oh!

zou3519 Sep 17, 2019

Choose a reason for hiding this comment

Uh oh!

xuhdev commented Sep 16, 2019

Uh oh!

xuhdev commented Sep 17, 2019

Uh oh!

xuhdev commented Sep 17, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 18, 2019

Uh oh!

mruberry commented Sep 19, 2019

Uh oh!

xuhdev commented Sep 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

xuhdev commented Sep 6, 2019 •

edited

Loading

xuhdev Sep 13, 2019 •

edited

Loading

xuhdev Sep 13, 2019 •

edited

Loading