[Feature] Add EvalHook which will be used in downstream projects by dreamerlin · Pull Request #739 · open-mmlab/mmcv

dreamerlin · 2020-12-21T04:18:33Z

Overview

This PR adds mmcv/runner/hooks/eval.py file and EvalHook and DistEvalHook to MMCV together with related unittest parts. Since those two hooks have been widely used in many MM repos, this PR implements and unifies common functions used in the evaluation part.

Design

Model Evaluation performed in OpenMMLab uses EvalHook and DistEvalHook during training.

In detail, the those hooks are commonly registered in apis/train.py by runner.register_hook(eval_hook(val_dataloader, **eval_cfg)). Users can initialize their models with following steps:

Define validation dataset by assigning cfg.data.val, building val_datasetloader
Register EvalHook using runner.register_hook.

Once registering it, runner will periodically call the evaluate function to perform evaluation in a fixed mode (by epoch or by iteration, etc.). The high-level workflow of EvalHook in OpenMMLab is:

register EvalHook -> define evaluation setting (start, interval, by_epoch, save_best, comparison rules, etc.) -> perform evaluation after_train_iter or after_train_epoch -> (save the best checkpoint) -> loop back to after_train_xxx

APIs

EvalHook is the base module, and DistEvalHook is a child class to it. They both inherit Hook. Here is the APIs explanation for EvalHook.

Initialization

dataloader: The PyTorch dataloader for evaluation. It can be built with build_dataloader with provided dataloader setting.
start: Evaluation starting epoch. It enables evaluation before the training starts if start <= the resuming epoch. If set to None, whether to evaluate is merely decided by interval.
interval: Evaluation interval.
by_epoch: Determine whether to perform evaluation by epoch or by iteration. If set to True, it will perform by epoch. Otherwise, by iteration.
save_best: If a metric is specified, it would measure the best checkpoint during evaluation. The information about best checkpoint would be save in runner.meta['hook_msgs']. Options are the evaluation metrics to the validation dataset. e.g., bbox_mAP, segm_mAP for bbox detection and instance segmentation. AR@100 for proposal recall. If save_best is auto, the first key of the returned OrderedDict result will be used. The interval of CheckpointHook should be divisible by that of EvalHook.
rule : Comparison rule for best score. If set to None, it will infer a reasonable rule. Keys such as acc, top, etc. will be inferred by greater rule. Keys contain loss will be inferred by less rule. Options are greater, less, None.

But Note: Since the rule for downstream repos are different, it may need to overwrite the self.greater_keys and self.less_keys
eval_kwargs: Key word arguments for dataset evaluate function (def evaluate), which will be fed into the evaluate function of the dataset.
Other hardcoded inner variables: rule_map, init_value_map, greater_keys and less_keys.

Note that: Since the rule for downstream repos are different, it may need to overwrite these variable due to the specific task.
- rule_map: A dict containing comparison function, default as {'greater': lambda x, y: x > y, 'less': lambda x, y: x < y}.
- init_value_map: The initialized value for comparison, default as {'greater': -inf, 'less': inf}.
- greater_keys : A list containing some rule keys applied for greater function, which means in these rules EvalHook regards greater number as the better one. Note that: If a string is one of the rules' substring, it will be applied to greater function. e.g., for acc: top1_acc, top5_acc, mean_acc are all applied to greater rule.
- less_keys: Similar to greater_keys but it is for less rules. e.g., for loss: bce_loss, bmn_loss are all applied to less rule.

before_run

This part is to initialize runner.meta.hook_msgs, if users determine to save the best checkpoint.

before_train_xxx

For before_train_iter and before_train_epoch part. It is mainly to determine whether it is the right time to perform evaluation by examining:

Whether it is the correct time to perform evaluation according to by_epoch
Whether it achieves the start to perform evaluation

And use self.initial_flag to indicate whether the EvalHook gets into the normal evaluation loop. After getting into the normal evaluation loop, before_train_xxx will be skipped.

after_train_xxx

For after_train_iter and after_train_epoch part. It is mainly to inference the model and do evaluation, as well as save the best checkpoint if save_best is specified. In detail, it will call _do_evaluate() and _save_best().

_do_evaluate(): inference the model by calling single_gpu_test(), do evaluation and call _save_best()
_save_best(): compare the score according to the rule, write info into runner.meta['hook_msgs'] and save the best checkpoint in the work_dir

For DistEvalHook, besides the variable and function mentioned above, it inferences model by calling multi_gpu_test() and assigns tmpdir, gpu_collect for multi_gpu_test()

Usages

Users adopt EvalHook by typing in --validate for training command, it will call EvalHook in training.

Config

To use EvalHook with a specific setting, users need to modify the evaluation variable in config files. like this:

evaluation = dict(
    interval=1, metrics=['top_k_accuracy', 'mean_class_accuracy'])

the key-value pairs in the dict are corresponding to EvalHook API.

Migration

Since the key for determine greater an less is related to the downstream task, downstream repos may need to overwrite the self.greater_keys and self.less_keys:

from mmcv.runner import EvalHook as BasicEvalHook

class EvalHook(BasicEvalHook):

    greater_keys = ['a', 'b']
    less_keys = ['c']

    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)

dreamerlin · 2020-12-21T05:37:30Z

It can not be merged util #695 is merged since it uses mmcv.engine

dreamerlin · 2020-12-24T06:45:38Z

A related discuss, open-mmlab/mmaction2#395 (comment). Maybe we need to create rather than symlink

ZwwWayne · 2020-12-28T14:20:35Z

A related discuss, open-mmlab/mmaction2#395 (comment). Maybe we need to create rather than symlink

How about adding an option for such a case? By default, we create a symlink, but in some cases we can cp a new ckpt. And we should also ensure that only one symlink is created over the whole training time.

dreamerlin · 2020-12-31T15:02:14Z

A related discuss, open-mmlab/mmaction2#395 (comment). Maybe we need to create rather than symlink

How about adding an option for such a case? By default, we create a symlink, but in some cases we can cp a new ckpt. And we should also ensure that only one symlink is created over the whole training time.

Now I think open-mmlab/mmaction2#395 is a good proposal which cp the ckpt. Since mmcv hook has max_keep_ckpts, which may delete the original ckpt making the symlink for best invalid.

mmcv/runner/hooks/eval.py

ZwwWayne · 2021-01-01T15:26:59Z

A related discuss, open-mmlab/mmaction2#395 (comment). Maybe we need to create rather than symlink

How about adding an option for such a case? By default, we create a symlink, but in some cases we can cp a new ckpt. And we should also ensure that only one symlink is created over the whole training time.

Now I think open-mmlab/mmaction2#395 is a good proposal which cp the ckpt. Since mmcv hook has max_keep_ckpts, which may delete the original ckpt making the symlink for best invalid.

I am OK with that.

ZwwWayne · 2021-01-01T15:28:36Z

Please enrich the PR message for this PR as this is a big refactoring for downstream tasks to allow more discussion and for future reference, including its intention, modification, and consequences.

ZwwWayne · 2021-01-17T07:18:29Z

CI failed.
May add the contents/usages in PR messages into the documentation

dreamerlin · 2021-01-25T17:06:41Z

CI failed.

May add the contents/usages in PR messages into the documentation

CI failed due to mmcv.engine
Detailed PR message is updated

ZwwWayne · 2021-02-18T03:32:55Z

May add multi_gpu_test and single_gpu_test in this PR to avoid the dependency on entry point refactoring.
Evalhook needs to support synchronizing BN buffer of rank 0 models, see add bn buffer sync in eval_hook mmdetection#4582

mmcv/runner/hooks/eval.py

mmcv/engine/test.py

mmcv/runner/hooks/evaluation.py

ZwwWayne · 2021-03-03T11:12:20Z

LGTM if the comments can all be resolved.

dreamerlin · 2021-03-03T11:40:32Z

LGTM if the comments can all be resolved.

Done

ZwwWayne · 2021-03-03T11:55:22Z

See if @hellock have any comments.

ZwwWayne · 2021-03-18T12:50:23Z

Please resolve conflicts.
kindly ping @hellock
May need MMAction2 and MMDetection to create a PR based on this PR to verify the correctness.

LXXXXR · 2021-03-24T08:24:44Z

Validation has been done in mmcls and the performance remaines as it used to be. Thank you.

wHao-Wu · 2021-03-24T09:28:30Z

Validation has been successfully done in mmdet3d with mmdet's PR: open-mmlab/mmdetection#4806

congee524

have tested EvalHook & DistEvalHook on MMAction2.

The related PR "Use mmcv eval hook in mmaction2" will be raised after the release of the next version of mmcv.

xvjiarui · 2021-03-25T23:46:20Z

Validated in MMSeg. Thx!

ZwwWayne · 2021-04-08T08:42:25Z

This PR is merged as many downstream repos have verified its correctness. Thanks the efforts of all.

dreamerlin marked this pull request as ready for review December 24, 2020 02:44

dreamerlin force-pushed the eval_hook branch from 178f84d to c0fb2f8 Compare December 31, 2020 14:43

ZwwWayne reviewed Jan 1, 2021

View reviewed changes

mmcv/runner/hooks/eval.py Outdated Show resolved Hide resolved

ZwwWayne reviewed Jan 1, 2021

View reviewed changes

mmcv/runner/hooks/eval.py Outdated Show resolved Hide resolved

dreamerlin force-pushed the eval_hook branch from c0fb2f8 to c2cd29c Compare January 9, 2021 12:49

dreamerlin changed the title ~~Eval hook~~ [Feature] Add EvalHook Jan 9, 2021

dreamerlin changed the title ~~[Feature] Add EvalHook~~ [Feature] Add EvalHook which will be used in downstream projects Jan 9, 2021

dreamerlin requested a review from ZwwWayne January 9, 2021 13:28

hellock reviewed Feb 20, 2021

View reviewed changes

dreamerlin added 11 commits February 23, 2021 22:51

add EvalHook

0351e79

add EvalHook

f1e2326

refactor docstring

a3e6e15

polish

ecabe8a

use cp instead symlink

3dd9d42

add error info

32e8410

Update eval.py

0df56f1

Update eval.py

0215744

update

f0153cb

add engine depandancy

656faac

add comments

97fc044

dreamerlin force-pushed the eval_hook branch from 4148f60 to 97fc044 Compare February 24, 2021 14:20

dreamerlin added 2 commits February 25, 2021 11:46

update unittest

954811c

update unittest

f6a35d6

ZwwWayne reviewed Feb 26, 2021

View reviewed changes

mmcv/engine/test.py Show resolved Hide resolved

ZwwWayne reviewed Feb 26, 2021

View reviewed changes

mmcv/engine/test.py Show resolved Hide resolved

ZwwWayne reviewed Feb 26, 2021

View reviewed changes

mmcv/engine/test.py Show resolved Hide resolved

add docstring

db3122b

dreamerlin requested a review from ZwwWayne March 2, 2021 07:48

ZwwWayne approved these changes Mar 3, 2021

View reviewed changes

fix docstring

10b8d62

dreamerlin requested a review from hellock March 3, 2021 12:35

daavoo mentioned this pull request Mar 4, 2021

[Proposal] Roadmap of MMCV? #870

Closed

Merge branch 'master' into eval_hook

3c1c3f8

hhaAndroid mentioned this pull request Mar 22, 2021

Use MMCV's EvalHook in MMDetection open-mmlab/mmdetection#4806

Merged

LXXXXR mentioned this pull request Mar 23, 2021

[Fix] Use MMCV's EvalHook in MMClassification open-mmlab/mmpretrain#182

Merged

xiexinch mentioned this pull request Mar 24, 2021

Use MMCV's EvalHook in MMSegmentation open-mmlab/mmsegmentation#438

Merged

congee524 approved these changes Mar 25, 2021

View reviewed changes

congee524 mentioned this pull request Mar 25, 2021

[PR required] Use MMCV EvalHook in MMAction2 open-mmlab/mmaction2#757

Closed

xvjiarui approved these changes Mar 25, 2021

View reviewed changes

zhouzaida mentioned this pull request Apr 3, 2021

[Fix] Fix wrong iter number and progress number in the logging during val/test time #914

Merged

ZwwWayne merged commit be6541d into open-mmlab:master Apr 8, 2021

dreamerlin mentioned this pull request Jun 7, 2021

EvalHook uses case-insensitive key indicator matching and configurabl… #1076

Merged

dreamerlin mentioned this pull request Sep 2, 2021

[Feature] Support st-gcn open-mmlab/mmaction2#1123

Merged

Conversation

dreamerlin commented Dec 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Design

APIs

Initialization

before_run

before_train_xxx

after_train_xxx

Usages

Config

Migration

Uh oh!

dreamerlin commented Dec 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dreamerlin commented Dec 24, 2020

Uh oh!

ZwwWayne commented Dec 28, 2020

Uh oh!

dreamerlin commented Dec 31, 2020

Uh oh!

Uh oh!

Uh oh!

ZwwWayne commented Jan 1, 2021

Uh oh!

ZwwWayne commented Jan 1, 2021

Uh oh!

ZwwWayne commented Jan 17, 2021

Uh oh!

dreamerlin commented Jan 25, 2021

Uh oh!

ZwwWayne commented Feb 18, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ZwwWayne commented Mar 3, 2021

Uh oh!

dreamerlin commented Mar 3, 2021

Uh oh!

ZwwWayne commented Mar 3, 2021

Uh oh!

ZwwWayne commented Mar 18, 2021

Uh oh!

LXXXXR commented Mar 24, 2021

Uh oh!

wHao-Wu commented Mar 24, 2021

Uh oh!

congee524 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xvjiarui commented Mar 25, 2021

Uh oh!

ZwwWayne commented Apr 8, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

dreamerlin commented Dec 21, 2020 •

edited

Loading

dreamerlin commented Dec 21, 2020 •

edited

Loading

congee524 left a comment •

edited

Loading