Fix None grad problem during training TOOD by adding SigmoidGeometricMean by Johnson-Wang · Pull Request #7090 · open-mmlab/mmdetection

Johnson-Wang · 2022-01-28T04:51:28Z

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

The training of TOOD often encounters None gradient during backpropagation, which would further cause None tensors in the next training step. Some issues in the original repo (fcjian/TOOD#11) might be also due to this error. The problem is caused by the naive implementation of sigmoid geometric mean function cls_score = (cls_logits.sigmoid() * cls_prob.sigmoid()).sqrt(). This output might be 0 if cls_logits or cls_prob is a low negative value, which causes either inf grad of none grad during backpropagation.

Modification

A reimplementation of SigmoidGeometricMean class as an inheritance of torch.autograd.Function is proposed. The backward function is derived analytically and would avoid and inf or none grad during bp.

This modification has little influence on the final results (42.3 mAP after modification vs. 42.4 mAP as reported).
This modification enables users to train TOOD without ATSS warmup, yet with some performance drop (~41.8 mAP)

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
This PR does not involve any function interface change.
Docstring has been added.

ZwwWayne · 2022-01-28T05:01:14Z

mmdet/models/utils/misc.py

 from torch.nn import functional as F


+class SigmoidGeometricMean(Function):


How about we implement an interface named sigmoid_geometric_mean = SigmoidGeometricMean.apply here so that in tood_head we can simply use sigmoid_geometric_mean(xxx)?

jshilong

LGTM

codecov · 2022-01-28T06:40:09Z

Codecov Report

Merging #7090 (72e89e3) into dev (4bdb312) will increase coverage by 0.04%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##              dev    #7090      +/-   ##
==========================================
+ Coverage   62.41%   62.46%   +0.04%     
==========================================
  Files         330      330              
  Lines       26199    26216      +17     
  Branches     4436     4437       +1     
==========================================
+ Hits        16353    16375      +22     
+ Misses       8976     8966      -10     
- Partials      870      875       +5

Flag	Coverage Δ
unittests	`62.43% <100.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmdet/models/dense_heads/tood_head.py	`83.79% <100.00%> (+0.06%)`	⬆️
mmdet/models/utils/__init__.py	`100.00% <100.00%> (ø)`
mmdet/models/utils/misc.py	`96.66% <100.00%> (+3.80%)`	⬆️
mmdet/utils/misc.py	`95.23% <0.00%> (-4.77%)`	⬇️
mmdet/core/bbox/assigners/max_iou_assigner.py	`72.36% <0.00%> (-1.32%)`	⬇️
mmdet/models/dense_heads/corner_head.py	`69.46% <0.00%> (+1.40%)`	⬆️
mmdet/models/detectors/cornernet.py	`100.00% <0.00%> (+5.12%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4bdb312...72e89e3. Read the comment docs.

…Mean (open-mmlab#7090)

…Mean (#7090)

…Mean (open-mmlab#7090)

Add SigmoidGeometricMean

d34ecb1

Johnson-Wang requested review from ZwwWayne and jshilong January 28, 2022 04:52

Johnson-Wang changed the title ~~Add SigmoidGeometricMean~~ Fix None grad problem during training TOOD by adding SigmoidGeometricMean Jan 28, 2022

ZwwWayne approved these changes Jan 28, 2022

View reviewed changes

ZwwWayne reviewed Jan 28, 2022

View reviewed changes

jshilong approved these changes Jan 28, 2022

View reviewed changes

use sigmoid_geometric_mean

72e89e3

RangiLyu approved these changes Jan 28, 2022

View reviewed changes

ZwwWayne approved these changes Jan 29, 2022

View reviewed changes

ZwwWayne merged commit 08bc3d7 into open-mmlab:dev Jan 29, 2022

hhaAndroid mentioned this pull request Feb 14, 2022

The default config tood_r50_fpn_1x_coco.py stops with an error in v2.20.0 #7151

Closed

chhluo pushed a commit to chhluo/mmdetection that referenced this pull request Feb 21, 2022

Fix None grad problem during training TOOD by adding SigmoidGeometric…

40a0c2b

…Mean (open-mmlab#7090)

ZwwWayne pushed a commit that referenced this pull request Jul 18, 2022

Fix None grad problem during training TOOD by adding SigmoidGeometric…

d4702a3

…Mean (#7090)

ZwwWayne pushed a commit to ZwwWayne/mmdetection that referenced this pull request Jul 19, 2022

Fix None grad problem during training TOOD by adding SigmoidGeometric…

24a8a58

…Mean (open-mmlab#7090)

RangiLyu mentioned this pull request Aug 11, 2022

mmdet复现的TOOD算法比TOOD论文发布的代码推理速度不一致 #8542

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix None grad problem during training TOOD by adding SigmoidGeometricMean#7090

Fix None grad problem during training TOOD by adding SigmoidGeometricMean#7090
ZwwWayne merged 2 commits intoopen-mmlab:devfrom
Johnson-Wang:dev

Johnson-Wang commented Jan 28, 2022 •

edited

Loading

Uh oh!

ZwwWayne Jan 28, 2022

Uh oh!

jshilong left a comment

Uh oh!

codecov bot commented Jan 28, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		from torch.nn import functional as F


		class SigmoidGeometricMean(Function):

Conversation

Johnson-Wang commented Jan 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modification

Checklist

Uh oh!

ZwwWayne Jan 28, 2022

Choose a reason for hiding this comment

Uh oh!

jshilong left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jan 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Johnson-Wang commented Jan 28, 2022 •

edited

Loading

codecov bot commented Jan 28, 2022 •

edited

Loading