Add qunatized resnet50 model by mengniwang95 · Pull Request #460 · onnx/models

mengniwang95 · 2021-08-20T02:52:55Z

ResNet

Use cases

ResNet models perform image classification - they take images as input and classify the major object in the image into a set of pre-defined classes. They are trained on ImageNet dataset which contains images from 1000 classes. ResNet models provide very high accuracies with affordable model sizes. They are ideal for cases when high accuracy of classification is required.

Description

Deeper neural networks are more difficult to train. Residual learning framework ease the training of networks that are substantially deeper. The research explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. It also provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset the residual nets were evaluated with a depth of up to 152 layers — 8× deeper than VGG nets but still having lower complexity.

MXNet ResNet-v1 ==> ONNX ResNet-v1 [18, 34, 50, 101, 152]

MXNet ResNet-v2 ==> ONNX ResNet-v2 [18, 34, 50, 101, 152]

Caffe2 ResNet-50 ==> ONNX ResNet [50-caffe2]

ONNX ResNet50-v1 ==> Quantized ONNX ResNet50-v1

Model

The model below are ResNet v1 and v2. ResNet models consists of residual blocks and came up to counter the effect of deteriorating accuracies with more layers due to network not learning the initial layers.
ResNet v2 uses pre-activation function whereas ResNet v1 uses post-activation for the residual blocks. The models below have 18, 34, 50, 101 and 152 layers for with ResNetv1 and ResNetv2 architecture.

Run ResNet-50 in browser - implemented by ONNX.js with ResNet50-Caffe2 release 1.2

Details of Resnet-50 quantization & evaluation - implemented by Intel® Low Precision Optimization Tool (LPOT)

ResNet V1:

Model	Download	Download (with sample test data)	ONNX version	Opset version	Top-1 accuracy (%)	Top-5 accuracy (%)
ResNet18	44.7 MB	42.9 MB	1.2.1	7	69.93	89.29
ResNet34	83.3 MB	78.6 MB	1.2.1	7	73.73	91.40
ResNet50	97.8 MB	92.2 MB	1.2.1	7	74.93	92.38
ResNet101	170.6 MB	159.8 MB	1.2.1	7	76.48	93.20
ResNet152	230.6 MB	217.2 MB	1.2.1	7	77.11	93.61

Model	Download	Download (with sample test data)	ONNX version	Opset version
ResNet50-caffe2	32 MB	95 MB	1.1	3
ResNet50-caffe2	32 MB	96 MB	1.1.2	6
ResNet50-caffe2	32 MB	100 MB	1.2	7
ResNet50-caffe2	32 MB	100 MB	1.3	8
ResNet50-caffe2	32 MB	100 MB	1.4	9

Model	Download	Download (with sample test data)	ONNX version	Opset version	Top-1 accuracy (%)	latency (ms)
ResNet50_fp32	97.8 MB	92.0 MB	1.7.0	12	74.97	8.32
ResNet50_int8	24.6 MB	22.3 MB	1.7.0	12	74.77	4.30

Note the latency depends on the test hardware and the displayed results are tested by Intel® Xeon® Platinum 8280 Processor.
Compared with the fp32 model, we get an accuracy drop of 0.2% and a performance improvement of 1.9X after quantization.

ResNet V2:

Model	Download	Download (with sample test data)	ONNX version	Opset version	Top-1 accuracy (%)	Top-5 accuracy (%)
ResNet18	44.6 MB	42.9 MB	1.2.1	7	69.70	89.49
ResNet34	83.2 MB	78.6 MB	1.2.1	7	73.36	91.43
ResNet50	97.7 MB	92.0 MB	1.2.1	7	75.81	92.82
ResNet101	170.4 MB	159.4 MB	1.2.1	7	77.42	93.61
ResNet152	230.3 MB	216.0 MB	1.2.1	7	78.20	94.21

Inference

We used MXNet as framework with gluon APIs to perform inference. View the notebook imagenet_inference to understand how to use above models for doing inference. Make sure to specify the appropriate model name in the notebook.

Input

All pre-trained models expect input images normalized in the same way, i.e. mini-batches of 3-channel RGB images of shape (N x 3 x H x W), where N is the batch size, and H and W are expected to be at least 224.
The inference was done using jpeg image.

Preprocessing

The images have to be loaded in to a range of [0, 1] and then normalized using mean = [0.485, 0.456, 0.406] and std = [0.229, 0.224, 0.225]. The transformation should preferably happen at preprocessing.

The following code shows how to preprocess a NCHW tensor:

import numpy

def preprocess(img_data):
    mean_vec = np.array([0.485, 0.456, 0.406])
    stddev_vec = np.array([0.229, 0.224, 0.225])
    norm_img_data = np.zeros(img_data.shape).astype('float32')
    for i in range(img_data.shape[0]):
         # for each pixel in each channel, divide the value by 255 to get value between [0, 1] and then normalize
        norm_img_data[i,:,:] = (img_data[i,:,:]/255 - mean_vec[i]) / stddev_vec[i]
    return norm_img_data

Check imagenet_preprocess.py for additional sample code.

Output

The model outputs image scores for each of the 1000 classes of ImageNet.

Postprocessing

The post-processing involves calculating the softmax probability scores for each class. You can also sort them to report the most probable classes. Check imagenet_postprocess.py for code.

To do quick inference with the model, check out Model Server.

Dataset

Dataset used for train and validation: ImageNet (ILSVRC2012). Check imagenet_prep for guidelines on preparing the dataset.

Caffe2 Version of ResNet50 uses the ImageNet dataset from 2015 -- ILSVRC2015.

Validation accuracy

The accuracies obtained by the models on the validation set are mentioned above. The validation was done using center cropping of images unlike the paper which uses ten-cropping. We expect an increase of 1-2% in accuracies using ten cropping and that would lead to accuracies similar to the paper.

Training

We used MXNet as framework with gluon APIs to perform training. View the training notebook to understand details for parameters and network for each of the above variants of ResNet.

Validation

We used MXNet as framework with gluon APIs to perform validation. Use the notebook imagenet_validation to verify the accuracy of the model on the validation set. Make sure to specify the appropriate model name in the notebook.

References

ResNetv1
Deep residual learning for image recognition
He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770-778. 2016.
ResNetv2
Identity mappings in deep residual networks
He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun.
In European Conference on Computer Vision, pp. 630-645. Springer, Cham, 2016.
MXNet, Gluon model zoo, GluonCV

Contributors

ankkhedia (Amazon AI)
abhinavs95 (Amazon AI)

License

Apache 2.0

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 · 2021-08-20T03:00:37Z

@wenbingl Hi Wenbing, we proposed this PR previously but reverted it due to some issues. Currently, I have fixed the issue and proposed it again.
One more question: I add a new table about model statistics into the original README.md, do you have any suggestions about the organization of this table? Should I combine it with the existing table (but the new table has a 'latency' column)?

wenbingl · 2021-08-20T21:10:09Z

Thanks a lot for the update.

I prefer to merge the table together. just leave some 'latency' blank if there is no value for the legacy model.

Even, can we also merge the 'quantization.md' into README.md to be a section?

mengniwang95 · 2021-08-24T01:45:23Z

Sure, I will merge the table together.
For "merge the 'quantization.md' into README.md to be a section", I need to confirm with our legal team.

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 · 2021-08-30T11:46:09Z

@wenbingl Hi Wenbing, I have confirmed with our legal team and fixed it. Please review it when you are available and I will update asap. Thx

wenbingl

Thanks a lot for this contribution.
It's the first quantized model for the zoo and it shows the great performance improvement.

* Add int8 resnet50 Signed-off-by: Mengni Wang <mengni.wang@intel.com> * update valid model Signed-off-by: Mengni Wang <mengni.wang@intel.com> * update test data name Signed-off-by: Mengni Wang <mengni.wang@intel.com> * update data Signed-off-by: Mengni Wang <mengni.wang@intel.com> * fix readme Signed-off-by: Mengni Wang <mengni.wang@intel.com> * add model info Signed-off-by: Mengni Wang <mengni.wang@intel.com> * add model Signed-off-by: Mengni Wang <mengni.wang@intel.com> * merge table Signed-off-by: Mengni Wang <mengni.wang@intel.com> * remove unused link Signed-off-by: Mengni Wang <mengni.wang@intel.com> * update model link Signed-off-by: Mengni Wang <mengni.wang@intel.com> * update config Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 added 7 commits August 20, 2021 09:16

Add int8 resnet50

938e076

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update valid model

f984479

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update test data name

184f88d

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update data

9bb20a8

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

fix readme

d6989db

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

add model info

e4fe125

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

add model

2e99d13

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

jebastin-nadar mentioned this pull request Aug 26, 2021

dnn : int8 quantized layers support in onnx importer opencv/opencv#20535

Merged

11 tasks

merge table

5f2b72c

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 force-pushed the int8_rn50 branch from cfe2bda to 5f2b72c Compare August 27, 2021 02:21

mengniwang95 added 4 commits August 27, 2021 10:23

remove unused link

6045dc5

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update model link

5e41557

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

update config

2553c04

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

Merge branch 'master' into int8_rn50

6d9b86b

wenbingl approved these changes Aug 31, 2021

View reviewed changes

wenbingl merged commit 7711852 into onnx:master Aug 31, 2021

messerb5467 mentioned this pull request Nov 11, 2021

Resnet50-v1-12* models and shufflenet-v1-12* models aren't appropriately included for LFS handling #482

Open

jcwchen added the quantization-model label Jun 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add qunatized resnet50 model#460

Add qunatized resnet50 model#460
wenbingl merged 12 commits intoonnx:masterfrom
mengniwang95:int8_rn50

mengniwang95 commented Aug 20, 2021

Uh oh!

mengniwang95 commented Aug 20, 2021

Uh oh!

wenbingl commented Aug 20, 2021

Uh oh!

mengniwang95 commented Aug 24, 2021

Uh oh!

mengniwang95 commented Aug 30, 2021

Uh oh!

wenbingl left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mengniwang95 commented Aug 20, 2021

ResNet

Use cases

Description

Model

Inference

Input

Preprocessing

Output

Postprocessing

Dataset

Validation accuracy

Training

Validation

References

Contributors

License

Uh oh!

mengniwang95 commented Aug 20, 2021

Uh oh!

wenbingl commented Aug 20, 2021

Uh oh!

mengniwang95 commented Aug 24, 2021

Uh oh!

mengniwang95 commented Aug 30, 2021

Uh oh!

wenbingl left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants