Bug fixes for universal intrinsics of RISC-V back-end: v_reduce_sum. by hanliutong · Pull Request #20598 · opencv/opencv

hanliutong · 2021-08-24T03:54:22Z

Fixed reduce_sum operations.

See #20278 for previous state

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

hanliutong · 2021-08-24T03:56:09Z

cc @joy2myself and @asmorkalov

asmorkalov · 2021-08-24T15:33:10Z

Remaining failed test on RISC-V:

[  FAILED  ] 13 tests, listed below:
[  FAILED  ] Reproducibility_FCN.Accuracy
[  FAILED  ] Layer_GRU_Test_Accuracy_with_.Pytorch
[  FAILED  ] Test_Int8_nets.FasterRCNN_inceptionv2/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_Int8_nets.FasterRCNN_vgg16/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_Int8_nets.YOLOv3/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_ONNX_layers.LSTM_Activations/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_ONNX_layers.GRU/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_ONNX_layers.GRU_bidirectional/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_ONNX_layers.CumSum/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_TensorFlow_layers.reduce_max/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_TensorFlow_layers.reduce_max_channel/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_TensorFlow_layers.reduce_max_channel_keep_dims/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_TensorFlow_layers.pooling_reduce_max/0, where GetParam() = OCV/CPU

hanliutong · 2021-08-25T06:12:55Z

@asmorkalov Thanks for your information! I can reproduce 4/13 failed test cases as below.

[  FAILED  ] Reproducibility_FCN.Accuracy
[  FAILED  ] Test_Int8_nets.FasterRCNN_inceptionv2/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_Int8_nets.FasterRCNN_vgg16/0, where GetParam() = OCV/CPU
[  FAILED  ] Test_Int8_nets.YOLOv3/0, where GetParam() = OCV/CPU

And then, I test the 4 failed cases on master, both of them failed too. So I think it may not be caused by this PR.

Others 9 test cases are pass on my server but I notice that all of those test cases are added to opencv_extra in the last several weeks. So if I guess right, update the opencv_extra repository to the current master can resolve the problem.

asmorkalov · 2021-08-25T06:53:43Z

Tests Test_Int8_nets_XXX are the new thing introduced in OpenCV week ago or so. Please ignore it for now. It looks like I have outdated extra or some inconsistency with my Docker containers.

asmorkalov · 2021-08-25T06:56:01Z

Could you add small test for the fixed universal intrinsics to https://github.com/opencv/opencv/blob/master/modules/core/test/test_intrin128.simd.hpp or https://github.com/opencv/opencv/blob/master/modules/core/test/test_intrin.cpp.

hanliutong · 2021-08-25T08:27:25Z

Oh, I found that there is already a function to check the result of reduce_sum in "test_intrin_utils.hpp". And the reason of why we miss it before is we usually run the test with VLEN=128 on QEMU.

We can just run qemu-riscv64 -cpu rv64,x-v=true,vlen=256 ./bin/opencv_test_core --gtest_filter="hal_intrin128.float32x4_BASELINE" to reproduces the issue.

The google test report the Failure on line 1467 as below

/root/opencv/modules/core/test/test_intrin_utils.hpp:231: Failure
Expected equality of these values:
  a
    Which is: 10
  b
    Which is: 14.4375
Google Test trace:
/root/opencv/modules/core/test/test_intrin_utils.hpp:1467: i=0

Do you think we need new test cases? Or we just need a new way (with VLEN>128, VLEN=256 as an example) to run the existing tests?

hanliutong · 2021-08-25T08:36:29Z

Unfortunately, I also found that this PR does not fix all the failures in hal_intrin128.float32x4_BASELINE. It does fix the v_reduce_sum4 but looks like there are other errors on v_rotate_right as the report by google test.

asmorkalov · 2021-08-25T10:32:17Z

No new test case is not required then. I'll add vlen=256 case to CI.

jebastin-nadar · 2021-08-26T06:32:10Z

Hi @asmorkalov @hanliutong. Can you please specify what is the exact error in the failed Test_Int8_Nets.XXX? Is it related to stricter thresholds for scoreDiff and IoUDiff?

fix v_reduce_sum

2f31763

asmorkalov self-requested a review August 24, 2021 07:00

asmorkalov added the pr: needs test New functionality requires minimal tests set label Aug 25, 2021

asmorkalov removed the pr: needs test New functionality requires minimal tests set label Aug 26, 2021

asmorkalov approved these changes Aug 26, 2021

View reviewed changes

alalek assigned asmorkalov Aug 26, 2021

opencv-pushbot merged commit 56d0d59 into opencv:master Aug 26, 2021

alalek mentioned this pull request Oct 15, 2021

(5.x) Merge 4.x #20886

Merged

hanliutong deleted the rvv-fix branch November 9, 2021 07:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug fixes for universal intrinsics of RISC-V back-end: v_reduce_sum.#20598

Bug fixes for universal intrinsics of RISC-V back-end: v_reduce_sum.#20598
opencv-pushbot merged 1 commit intoopencv:masterfrom
hanliutong:rvv-fix

hanliutong commented Aug 24, 2021 •

edited

Loading

Uh oh!

hanliutong commented Aug 24, 2021

Uh oh!

asmorkalov commented Aug 24, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

jebastin-nadar commented Aug 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

hanliutong commented Aug 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

hanliutong commented Aug 24, 2021

Uh oh!

asmorkalov commented Aug 24, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

hanliutong commented Aug 25, 2021

Uh oh!

asmorkalov commented Aug 25, 2021

Uh oh!

jebastin-nadar commented Aug 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

hanliutong commented Aug 24, 2021 •

edited

Loading