Port roi_align to actually use dispatcher by ezyang · Pull Request #2366 · pytorch/vision

ezyang · 2020-06-29T22:01:34Z

No description provided.

This is still registering everything as catchalls, so we're really just moving deck chairs around, but payoff is coming soon. Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang · 2020-06-29T22:39:11Z

CUDA is not done yet, needs similar treatment.

mcarilli · 2020-06-30T00:06:40Z

CUDA is not done yet, needs similar treatment.

out of date? Cuda ops look ok/symmetric with CPU ops.

Diffs make sense given what we discussed on slack. I think it will be straightforward to add an autocast layer as well, which will cast inputs then redispatch to one of the dispatch nexuses (roi_align or _roi_align_backward). It won't be pretty (will need to copy casting boilerplate from ATen/autocast_mode.cpp that, in a perfect world, would have some easy interface for external libs to use) but it should work.

If this is mergeable, I can wait for merge then pull it to add an autocast layer and flesh out other ops similarly.

mcarilli · 2020-06-30T00:33:45Z

torchvision/csrc/ROIAlign.h

    ctx->saved_data["input_shape"] = input.sizes();
    ctx->save_for_backward({rois});
-    auto result = ROIAlign_forward(
+    at::AutoNonVariableTypeMode g;


Why an explicit autograd-disabling guard here? Does torch::autograd::Function not disable autograd automatically around forward?

The pre-PR forward doesn't use an explicit guard, and afaik Python-side torch.autograd.Function does disable autograd around its forward method. Both of these lead me to expect torch::autograd::Function also disables autograd around forward. (If it doesn't, I think it should. Aligning its behavior with the Python version makes sense to me. But that would be a Pytorch-side change.)

pytorch/pytorch#40736

fmassa

Thanks a lot for the port @ezyang !

Lint is failing though, can you run clang-format?

I'm ok merging this PR right now, just have a question about having to implement dummy gradgrad kernels, as we will probably be using this PR as a template for all other functions.

fmassa · 2020-06-30T09:32:59Z

torchvision/csrc/ROIAlign.h

 };

-at::Tensor roi_align(
+// TODO: There should be an easier way to do this


Can't we register a fallback kernel that raises an error on double-backward? So that we don't have to implement a dummy double-backwards kernel for all the ops.

This would indeed be the right thing to do in core library. There will be some BC consequences though

ezyang · 2020-06-30T15:20:31Z

@mcarilli

It won't be pretty (will need to copy casting boilerplate from ATen/autocast_mode.cpp that, in a perfect world, would have some easy interface for external libs to use) but it should work.

You should probably put the casting utilities in a header file. We'll need to talk about what a good external facing API for it is.

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

mcarilli · 2020-06-30T15:37:22Z

You should probably put the casting utilities in a header file. We'll need to talk about what a good external facing API for it is.

aye, right now I don't have a clear idea. Trying it for torchvision will help solidify what such an API needs.

The fact that right now, autocast only wraps forward ops, and relies on autograd to reverse casts in backward, makes life easier.

codecov · 2020-06-30T15:40:25Z

Codecov Report

Merging #2366 into master will increase coverage by 1.36%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #2366      +/-   ##
==========================================
+ Coverage   68.49%   69.86%   +1.36%     
==========================================
  Files          93       93              
  Lines        7655     8361     +706     
  Branches     1177     1414     +237     
==========================================
+ Hits         5243     5841     +598     
- Misses       2075     2121      +46     
- Partials      337      399      +62

Impacted Files	Coverage Δ
torchvision/transforms/functional_tensor.py	`66.28% <0.00%> (+2.95%)`	⬆️
torchvision/transforms/functional.py	`80.32% <0.00%> (+3.02%)`	⬆️
torchvision/transforms/transforms.py	`81.99% <0.00%> (+4.76%)`	⬆️
torchvision/transforms/functional_pil.py	`68.45% <0.00%> (+7.50%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 446eac6...6155d0c. Read the comment docs.

fmassa

Thanks a lot Ed!

* Switch torchvision registrations to new operator registration API. This is still registering everything as catchalls, so we're really just moving deck chairs around, but payoff is coming soon. Signed-off-by: Edward Z. Yang <ezyang@fb.com> * Port roi_align to actually use dispatcher Signed-off-by: Edward Z. Yang <ezyang@fb.com>

* Switch torchvision registrations to new operator registration API. This is still registering everything as catchalls, so we're really just moving deck chairs around, but payoff is coming soon. Signed-off-by: Edward Z. Yang <ezyang@fb.com> * Port roi_align to actually use dispatcher Signed-off-by: Edward Z. Yang <ezyang@fb.com> Co-authored-by: Edward Z. Yang <ezyang@fb.com>

Switch torchvision registrations to new operator registration API.

8bd973e

This is still registering everything as catchalls, so we're really just moving deck chairs around, but payoff is coming soon. Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang requested a review from mcarilli June 29, 2020 22:01

ezyang force-pushed the pr/roi_align_port branch from 3ce660f to a0e6f91 Compare June 29, 2020 22:34

ezyang force-pushed the pr/roi_align_port branch from a0e6f91 to ad105f1 Compare June 29, 2020 22:44

mcarilli reviewed Jun 30, 2020

View reviewed changes

mcarilli mentioned this pull request Jun 30, 2020

torch.cuda.amp.autocast not working with torchvision.models.detection.maskrcnn pytorch/pytorch#37735

Closed

fmassa reviewed Jun 30, 2020

View reviewed changes

Port roi_align to actually use dispatcher

6155d0c

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang force-pushed the pr/roi_align_port branch from ad105f1 to 6155d0c Compare June 30, 2020 15:21

fmassa approved these changes Jun 30, 2020

View reviewed changes

fmassa merged commit 4480603 into master Jun 30, 2020

fmassa deleted the pr/roi_align_port branch June 30, 2020 15:55

fmassa mentioned this pull request Sep 1, 2020

ROCM version doesn't work #2621

Closed

fmassa mentioned this pull request Oct 13, 2020

Port all C++ ops to use the dispatcher #2796

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port roi_align to actually use dispatcher#2366

Port roi_align to actually use dispatcher#2366
fmassa merged 2 commits intomasterfrom
pr/roi_align_port

ezyang commented Jun 29, 2020

Uh oh!

ezyang commented Jun 29, 2020

Uh oh!

mcarilli commented Jun 30, 2020 •

edited

Loading

Uh oh!

mcarilli Jun 30, 2020 •

edited

Loading

Uh oh!

ezyang Jun 30, 2020

Uh oh!

fmassa left a comment

Uh oh!

fmassa Jun 30, 2020

Uh oh!

ezyang Jun 30, 2020

Uh oh!

ezyang commented Jun 30, 2020

Uh oh!

mcarilli commented Jun 30, 2020

Uh oh!

codecov bot commented Jun 30, 2020 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ezyang commented Jun 29, 2020

Uh oh!

ezyang commented Jun 29, 2020

Uh oh!

mcarilli commented Jun 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcarilli Jun 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang Jun 30, 2020

Choose a reason for hiding this comment

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

fmassa Jun 30, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang Jun 30, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jun 30, 2020

Uh oh!

mcarilli commented Jun 30, 2020

Uh oh!

codecov bot commented Jun 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcarilli commented Jun 30, 2020 •

edited

Loading

mcarilli Jun 30, 2020 •

edited

Loading

codecov bot commented Jun 30, 2020 •

edited

Loading