[WIP]Refactor vit2 by HIT-cwh · Pull Request #295 · open-mmlab/mmpretrain

HIT-cwh · 2021-06-08T04:46:23Z

Hi! This pr refactors vision transformer with FFN and MSA in transformer.py, mmcv. As the refactor_tramsformer_base branch created by shilong isn't merged now, all the unitests failed.
Pls kindly take a look.

# Conflicts: # mmcls/models/backbones/vision_transformer.py

codecov · 2021-06-15T14:38:29Z

Codecov Report

Merging #295 (062d65f) into master (1a7cebe) will increase coverage by 0.12%.
The diff coverage is 86.88%.

❗ Current head 062d65f differs from pull request most recent head 33c196a. Consider uploading reports for the commit 33c196a to get more accurate results

@@            Coverage Diff             @@
##           master     #295      +/-   ##
==========================================
+ Coverage   76.02%   76.14%   +0.12%     
==========================================
  Files          96       96              
  Lines        5109     5069      -40     
  Branches      849      842       -7     
==========================================
- Hits         3884     3860      -24     
+ Misses       1106     1096      -10     
+ Partials      119      113       -6

Flag	Coverage Δ
unittests	`76.14% <86.88%> (+0.12%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmcls/models/backbones/vision_transformer.py	`80.41% <84.00%> (+3.82%)`	⬆️
mmcls/models/heads/vision_transformer_head.py	`93.61% <100.00%> (+0.59%)`	⬆️
mmcls/models/necks/gap.py	`80.00% <100.00%> (+0.83%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1a7cebe...33c196a. Read the comment docs.

mmcls/models/backbones/vision_transformer.py

Junjun2016 · 2021-06-15T17:43:05Z

mmcls/models/backbones/vision_transformer.py

+                 attn_drop_rate=0.,
+                 drop_path_rate=0.,
+                 num_fcs=2,
+                 qkv_bias=True,


It seems that the default qkv_bias is False (qkv_bias=False) in most repos (e.g. TIMM), so maybe we can follow the common practices.

In fact, the default value of qkv_bias is True. (e.g. TIMM)

Junjun2016 · 2021-06-17T06:47:31Z

mmcls/models/backbones/vision_transformer.py

+class HybridEmbed(BaseModule):
    """CNN Feature Map Embedding.

    Extract feature map from CNN, flatten, project to embedding dim.


Docstring of Args

Junjun2016 · 2021-06-17T06:49:41Z

mmcls/models/backbones/vision_transformer.py

-        super(PatchEmbed, self).__init__()
+                 norm_cfg=None,
+                 conv_cfg=None,
+                 init_cfg=None):


Add kernel_size, stride and padding args to suppport overlap embedding.

# Conflicts: # tests/test_classifiers.py

mzr1996 · 2021-08-05T11:36:31Z

Closed, continue with #395

* [Squash] Refator ViT (from #295) * Use base variable to simplify auto_aug setting * Use common PatchEmbed, remove HybridEmbed and refactor ViT init structure. * Add `output_cls_token` option and change the output format of ViT and input format of ViT head. * Update unit tests and add test for `output_cls_token`. * Support out_indices. * Standardize config files * Support resize position embedding. * Add readme file of vit * Rename config file * Improve docs about ViT. * Update docstring * Use local version `MultiheadAttention` instead of mmcv version. * Fix MultiheadAttention * Support `qk_scale` argument in `MultiheadAttention` * Improve docs and change `layer_cfg` to `layer_cfgs` and support sequence. * Use init_cfg to init Linear layer in VisionTransformerHead * update metafile * Update checkpoints and configs * Imporve docstring. * Update README * Revert GAP modification.

* [Squash] Refator ViT (from open-mmlab#295) * Use base variable to simplify auto_aug setting * Use common PatchEmbed, remove HybridEmbed and refactor ViT init structure. * Add `output_cls_token` option and change the output format of ViT and input format of ViT head. * Update unit tests and add test for `output_cls_token`. * Support out_indices. * Standardize config files * Support resize position embedding. * Add readme file of vit * Rename config file * Improve docs about ViT. * Update docstring * Use local version `MultiheadAttention` instead of mmcv version. * Fix MultiheadAttention * Support `qk_scale` argument in `MultiheadAttention` * Improve docs and change `layer_cfg` to `layer_cfgs` and support sequence. * Use init_cfg to init Linear layer in VisionTransformerHead * update metafile * Update checkpoints and configs * Imporve docstring. * Update README * Revert GAP modification.

HIT-cwh added 30 commits April 16, 2021 16:13

add builder.py to mmcls/utils

d9d15b1

add vision_transformer.py to backbone

921f7fa

modify vision_transformer.py

08cca3f

Merge branch 'master' into refactor_vit

1fdd3d5

# Conflicts: # mmcls/models/backbones/vision_transformer.py

refactor vit using transformer in mmcv

6c5e9bd

test vit and vit_hybrid

46ae5ca

Merge branch 'master' into refactor_vit

e00efa8

# Conflicts: # mmcls/models/backbones/vision_transformer.py

inherit from mmcv BaseModule

81e6a3e

inherit from BaseModule

1b7ee3e

refactor init_weights

86eb44a

sync with model

28b8a46

delete init_weights in BaseBackbone

6b38927

refactor vit

8cb434a

sync with model

263256d

change ckpt save interval to 10

f8b23dc

sync with vit backbone

ce8d959

fix backbone.init_weights() unfinished

9282387

refactor vit backbone

ee50c47

delete norm_cfg

0477047

add mytrain for test

21a4711

add mytrain.py for test

ef072c9

test cls_token

8740287

test before layers

f4d9392

test first layer

52a2f1a

test attr in layers

215647f

permute to [bs, n_query, embed_dim]

cfdfafe

test backbone

bf92913

test classifier

1db7173

test classifier

8c4ba54

delete mytrain.py

68e0a96

HIT-cwh added 2 commits June 15, 2021 22:29

Merge branch 'master' into refactor_vit2

6c07bf2

# Conflicts: # mmcls/models/backbones/vision_transformer.py

add drop_path_rate comments

8a9fe92

Junjun2016 reviewed Jun 15, 2021

View reviewed changes

mmcls/models/backbones/vision_transformer.py Show resolved Hide resolved

Junjun2016 reviewed Jun 15, 2021

View reviewed changes

Make clearer description about batch_first

28b0878

Junjun2016 reviewed Jun 17, 2021

View reviewed changes

HIT-cwh added 8 commits June 17, 2021 19:56

add comments on HybridEmbed

2563291

lecun init patchembed

e840fc6

ffn xavier_normal_ to xavier_uniform_

81c8ff9

zero init pre_logits.bias

cbb013a

LN eps=1e-6

83a17e4

set RandomResizedCrop backend=pillow

9775f65

sync with classyvision

5cde34c

Merge branch 'master' into refactor_vit2

33c196a

# Conflicts: # tests/test_classifiers.py

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 5, 2021

[Squash] Refator ViT (from open-mmlab#295)

09ac36a

mzr1996 mentioned this pull request Aug 5, 2021

[Refactor] Refator ViT (Continue #295) #395

Merged

6 tasks

mzr1996 closed this Aug 5, 2021

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 5, 2021

[Squash] Refator ViT (from open-mmlab#295)

18f62d5

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 10, 2021

[Squash] Refator ViT (from open-mmlab#295)

96d701a

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 20, 2021

[Squash] Refator ViT (from open-mmlab#295)

b0b4229

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 24, 2021

[Squash] Refator ViT (from open-mmlab#295)

a168d7a

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Aug 24, 2021

[Squash] Refator ViT (from open-mmlab#295)

4a7833a

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Sep 28, 2021

[cherry-pick] Refator ViT (from open-mmlab#295)

5c7d77d

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Sep 28, 2021

[Squash] Refator ViT (from open-mmlab#295)

5a87816

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request Oct 13, 2021

[cherry-pick] Refator ViT (from open-mmlab#295)

ffd93db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Refactor vit2#295

[WIP]Refactor vit2#295
HIT-cwh wants to merge 119 commits intoopen-mmlab:masterfrom
HIT-cwh:refactor_vit2

HIT-cwh commented Jun 8, 2021

Uh oh!

codecov bot commented Jun 15, 2021 •

edited

Loading

Uh oh!

Uh oh!

Junjun2016 Jun 15, 2021

Uh oh!

clownrat6 Jun 16, 2021

Uh oh!

Junjun2016 Jun 17, 2021

Uh oh!

Junjun2016 Jun 17, 2021

Uh oh!

mzr1996 commented Aug 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

HIT-cwh commented Jun 8, 2021

Uh oh!

codecov bot commented Jun 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Junjun2016 Jun 15, 2021

Choose a reason for hiding this comment

Uh oh!

clownrat6 Jun 16, 2021

Choose a reason for hiding this comment

Uh oh!

Junjun2016 Jun 17, 2021

Choose a reason for hiding this comment

Uh oh!

Junjun2016 Jun 17, 2021

Choose a reason for hiding this comment

Uh oh!

mzr1996 commented Aug 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Jun 15, 2021 •

edited

Loading