Skip to main content
Log in

A survey on deep learning-based camouflaged object detection

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

Camouflaged object detection (COD) is an emerging visual detection task that aims to identify objects that conceal themselves in the surrounding environment. The high intrinsic similarities between the camouflaged objects and their backgrounds make COD far more challenging than traditional object detection. Recently, COD has attracted increasing research interest in the computer vision community, and numerous deep learning-based methods have been proposed, showing great potential. However, most of the existing work focuses on analyzing the structure of COD models, with few overview works summarizing deep learning-based models. To address this gap, we provide a comprehensive analysis and summary of deep learning-based COD models. Specifically, we first classify 48 deep learning-based COD models and analyze their advantages and disadvantages. Second, we introduce widely available datasets for COD and performance evaluation metrics. Then, we evaluate the performance of existing deep learning-based COD models on these four datasets. Finally, we indicate relevant applications and discuss challenges and future research directions for the COD task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from €37.37 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price includes VAT (Netherlands)

Instant access to the full article PDF.

Fig. 1
The alternative text for this image may have been generated using AI.
Fig. 2
The alternative text for this image may have been generated using AI.
Fig. 3
The alternative text for this image may have been generated using AI.
Fig. 4
The alternative text for this image may have been generated using AI.
Fig. 5
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

Data will be made available on request.

References

  1. Bi, H., Zhang, C., Wang, K., et al.: Rethinking camouflaged object detection: models and datasets. IEEE Trans. Circuits Syst. Video Technol. 32(9), 5708–5724 (2022). https://doi.org/10.1109/TCSVT.2021.3124952

    Article  Google Scholar 

  2. Borji, A., Cheng, M.M., Jiang, H., et al.: Salient object detection: a benchmark. IEEE Trans. Image Process. 24(12), 5706–5722 (2015). https://doi.org/10.1109/TIP.2015.2487833

    Article  MathSciNet  Google Scholar 

  3. Brock, A., Donahue, J., Simonyan, K.: Large scale GAN training for high fidelity natural image synthesis. In: International Conference on Learning Representations (2019). https://openreview.net/forum?id=B1xsqj09Fm

  4. Carion, N., Massa, F., Synnaeve, G., et al.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., et al. (eds.) Computer Vision - ECCV 2020, pp. 213–229. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_13

    Chapter  Google Scholar 

  5. Chen, Z., Gao, R., Xiang, T.Z., et al.: Diffusion model for camouflaged object detection. In: 26th European Conference on Artificial Intelligence ECAI 2023, pp 445–452 (2023).https://doi.org/10.3233/FAIA230302

  6. Cheng, X., Xiong, H., Fan, D.P., et al.: Implicit motion handling for video camouflaged object detection. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 13854–13863 (2022). https://doi.org/10.1109/CVPR52688.2022.01349

  7. Cong, R., Sun, M., Zhang, S., et al.: Frequency perception network for camouflaged object detection. In: Proceedings of the 31st ACM International Conference on Multimedia. Association for Computing Machinery, New York, NY, USA, MM ’23, p 1179-1189 (2023). https://doi.org/10.1145/3581783.3612083

  8. Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. In: International Conference on Learning Representations (2021). https://openreview.net/forum?id=YicbFdNTTy

  9. Fan, D., Cheng, M., Liu, Y., et al.: Structure-measure: A new way to evaluate foreground maps. In: 2017 IEEE International Conference on Computer Vision (ICCV). IEEE Computer Society, Los Alamitos, CA, USA, pp 4558–4567 (2017). https://doi.org/10.1109/ICCV.2017.487. https://doi.ieeecomputersociety.org/10.1109/ICCV.2017.487

  10. Fan, D.P., Ji, G.P., Sun, G., et al.: Camouflaged object detection. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 2774–2784 (2020). https://doi.org/10.1109/CVPR42600.2020.00285

  11. Fan, D.P., Ji, G.P., Zhou, T., et al.: Pranet: parallel reverse attention network for polyp segmentation. In: Martel, A.L., Abolmaesumi, P., Stoyanov, D., et al. (eds.) Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, pp. 263–273. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-59725-2_26

    Chapter  Google Scholar 

  12. Fan, D.P., Zhou, T., Ji, G.P., et al.: Inf-net: automatic covid-19 lung infection segmentation from ct images. IEEE Trans. Med. Imaging 39(8), 2626–2637 (2020). https://doi.org/10.1109/TMI.2020.2996645

    Article  Google Scholar 

  13. Fan, D.P., Ji, G.P., Qin, X., et al.: Cognitive vision inspired object segmentation metric and loss function. SCIENTIA SINICA Informationis 51(9), 1475 (2021). https://doi.org/10.1360/SSI-2020-0370. http://www.sciengine.com/publisher/ScienceChinaPress/journal/SCIENTIASINICAInformationis/51/9/10.1360/SSI-2020-0370

  14. Fan, D.P., Ji, G.P., Cheng, M.M., et al.: Concealed object detection. IEEE Trans. Pattern Anal. Mach. Intell. 44(10), 6024–6042 (2022). https://doi.org/10.1109/TPAMI.2021.3085766

    Article  Google Scholar 

  15. Fan, D.P., Ji, G.P., Xu, P., et al.: Advances in deep concealed scene understanding. Visual Intell 1(1), 16 (2023)

    Article  Google Scholar 

  16. Galun, Sharon, Basri, et al.: Texture segmentation by multiscale aggregation of filter responses and shape elements. In: Proceedings Ninth IEEE International Conference on Computer Vision, pp 716–723 vol.1 (2003). https://doi.org/10.1109/ICCV.2003.1238418

  17. Gao, S.H., Cheng, M.M., Zhao, K., et al.: Res2net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652–662 (2021). https://doi.org/10.1109/TPAMI.2019.2938758

    Article  Google Scholar 

  18. Goodfellow, I.J., Pouget-Abadie, J., Mirza, M., et al.: Generative adversarial nets. In: Ghahramani, Z., Welling, M., Cortes, C., et al. (eds.) Advances in Neural Information Processing Systems, vol. 27. Curran Associates, Inc. (2014)

    Google Scholar 

  19. He, C., Li, K., Zhang, Y., et al.: Camouflaged object detection with feature decomposition and edge reconstruction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 22046–22055 (2023)

  20. He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90

  21. He, K., Gkioxari, G., Dollár, P., et al.: Mask r-cnn. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp 2980–2988 (2017). https://doi.org/10.1109/ICCV.2017.322

  22. He, R., Dong, Q., Lin, J., et al.: Weakly-supervised camouflaged object detection with scribble annotations. Proceedings of the AAAI Conference on Artificial Intelligence 37(1), 781–789 (2023). https://doi.org/10.1609/aaai.v37i1.25156. https://ojs.aaai.org/index.php/AAAI/article/view/25156

  23. Hu, X., Wang, S., Qin, X., et al.: High-resolution iterative feedback network for camouflaged object detection. Proceedings of the AAAI Conference on Artificial Intelligence 37(1), 881–889 (2023). https://doi.org/10.1609/aaai.v37i1.25167. https://ojs.aaai.org/index.php/AAAI/article/view/25167

  24. Huang, Z., Dai, H., Xiang, T.Z., et al.: Feature shrinkage pyramid for camouflaged object detection with transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 5557–5566 (2023)

  25. Ji, G., Fan, D., Chou, Y., et al.: Deep gradient learning for efficient camouflaged object detection. Mach Intell Res 20(1), 92–108 (2023). https://doi.org/10.1007/s11633-022-1365-9

    Article  MathSciNet  Google Scholar 

  26. Ji, G.P., Zhu, L., Zhuge, M., et al.: Fast camouflaged object detection via edge-based reversible re-calibration network. Pattern Recogn 123, 108414 (2022). https://doi.org/10.1016/j.patcog.2021.108414

    Article  Google Scholar 

  27. Jia, Q., Yao, S., Liu, Y., et al.: Segment, magnify and reiterate: Detecting camouflaged objects the hard way. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 4703–4712 (2022). https://doi.org/10.1109/CVPR52688.2022.00467

  28. Kendall, A., Gal, Y.: What uncertainties do we need in bayesian deep learning for computer vision? In: Guyon, I., Luxburg, U.V., Bengio, S., et al. (eds) Advances in Neural Information Processing Systems, vol 30. Curran Associates, Inc. (2017). https://proceedings.neurips.cc/paper_files/paper/2017/file/2650d6089a6d640c5e85b2b88265dc2b-Paper.pdf

  29. Khan, A., Khan, M., Gueaieb, W., et al.: Camofocus: Enhancing camouflage object detection with split-feature focal modulation and context refinement. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp 1434–1443 (2024)

  30. Khan, S., Naseer, M., Hayat, M., et al.: Transformers in vision: a survey. ACM Comput Surv. (2022). https://doi.org/10.1145/3505244

    Article  Google Scholar 

  31. Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: International Conference on Learning Representations (2017). https://openreview.net/forum?id=SJU4ayYgl

  32. Lakshminarayanan, B., Pritzel, A., Blundell, C.: Simple and scalable predictive uncertainty estimation using deep ensembles. In: Guyon, I., Luxburg, U.V., Bengio, S., et al. (eds) Advances in Neural Information Processing Systems, vol 30. Curran Associates, Inc. (2017). https://proceedings.neurips.cc/paper_files/paper/2017/file/9ef2ed4b7fd2c810847ffa5fa85bce38-Paper.pdf

  33. Le, T.N., Nguyen, T.V., Nie, Z., et al.: Anabranch network for camouflaged object segmentation. Comput Vision Image Underst 184, 45–56 (2019). https://doi.org/10.1016/j.cviu.2019.04.006

    Article  Google Scholar 

  34. Le, X., Mei, J., Zhang, H., et al.: A learning-based approach for surface defect detection using small image datasets. Neurocomputing 408, 112–120 (2020). https://doi.org/10.1016/j.neucom.2019.09.107

    Article  Google Scholar 

  35. Li, A., Zhang, J., Lv, Y., et al.: Uncertainty-aware joint salient object and camouflaged object detection. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 10066–10076 (2021). https://doi.org/10.1109/CVPR46437.2021.00994

  36. Li, S., Florencio, D., Li, W., et al.: A fusion framework for camouflaged moving foreground detection in the wavelet domain. IEEE Trans. Image Process. 27(8), 3918–3930 (2018). https://doi.org/10.1109/TIP.2018.2828329

    Article  MathSciNet  Google Scholar 

  37. Li, X., Yang, J., Li, S., et al.: Locate, refine and restore: A progressive enhancement network for camouflaged object detection. In: Elkind, E. (ed) Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI-23. International Joint Conferences on Artificial Intelligence Organization, pp 1116–1124 (2023). https://doi.org/10.24963/ijcai.2023/124, main Track

  38. Liang, Y., Qin, G., Sun, M., et al.: A systematic review of image-level camouflaged object detection with deep learning. Neurocomputing 566, 127050 (2024). https://doi.org/10.1016/j.neucom.2023.127050

    Article  Google Scholar 

  39. Lin, T.Y., Goyal, P., Girshick, R., et al.: Focal loss for dense object detection. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp 2999–3007 (2017). https://doi.org/10.1109/ICCV.2017.324

  40. Liu, J., Zhang, J., Barnes, N.: Confidence-aware learning for camouflaged object detection. arXiv preprint arXiv:2106.11641 (2021)

  41. . Liu, J., Zhang, J., Barnes, N.: Modeling aleatoric uncertainty for camouflaged object detection. In: 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp 2613–2622 (2022), https://doi.org/10.1109/WACV51458.2022.00267

  42. Liu, L., Wang, R., Xie, C., et al.: Pestnet: an end-to-end deep learning approach for large-scale multi-class pest detection and classification. IEEE Access 7, 45301–45312 (2019). https://doi.org/10.1109/ACCESS.2019.2909522

    Article  Google Scholar 

  43. Liu, Z., Huang, K., Tan, T.: Foreground object detection using top-down information based on em framework. IEEE Trans. Image Process. 21(9), 4204–4217 (2012). https://doi.org/10.1109/TIP.2012.2200492

    Article  MathSciNet  Google Scholar 

  44. Liu, Z., Lin, Y., Cao, Y., et al.: Swin transformer: Hierarchical vision transformer using shifted windows. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp 9992–10002 (2021). https://doi.org/10.1109/ICCV48922.2021.00986

  45. Liu, Z., Zhang, Z., Tan, Y., et al.: Boosting camouflaged object detection with dual-task interactive transformer. In: 2022 26th International Conference on Pattern Recognition (ICPR), pp 140–146 (2022). https://doi.org/10.1109/ICPR56361.2022.9956724

  46. Luo, Z., Liu, N., Zhao, W., et al.: Vscode: General visual salient and camouflaged object detection with 2d prompt learning. arXiv preprint arXiv:2311.15011 (2023)

  47. Lv, Y., Zhang, J., Dai, Y., et al.: Simultaneously localize, segment and rank the camouflaged objects. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 11586–11596 (2021). https://doi.org/10.1109/CVPR46437.2021.01142

  48. Lv, Y., Zhang, J., Dai, Y., et al.: Toward deeper understanding of camouflaged object detection. IEEE Trans. Circuits Syst. Video Technol. 33(7), 3462–3476 (2023). https://doi.org/10.1109/TCSVT.2023.3234578

    Article  Google Scholar 

  49. Lyu, Y., Zhang, H., Li, Y., et al.: Uedg:uncertainty-edge dual guided camouflage object detection. IEEE Trans. Multimed 26, 4050–4060 (2024). https://doi.org/10.1109/TMM.2023.3295095

    Article  Google Scholar 

  50. Mao, Y., Zhang, J., Wan, Z., et al.: Transformer transforms salient object detection and camouflaged object detection. arXiv preprint arXiv:2104.101271(2), 5 (2021)

  51. Margolin, R., Zelnik-Manor, L., Tal, A.: How to evaluate foreground maps. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp 248–255 (2014). https://doi.org/10.1109/CVPR.2014.39

  52. Mei, H., Ji, G.P., Wei, Z., et al.: Camouflaged object segmentation with distraction mining. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 8768–8777 (2021). https://doi.org/10.1109/CVPR46437.2021.00866

  53. Mnih, V., Heess, N., Graves, A., et al.: Recurrent models of visual attention. In: Ghahramani, Z., Welling, M., Cortes, C., et al. (eds) Advances in Neural Information Processing Systems, vol 27. Curran Associates, Inc. (2014). https://proceedings.neurips.cc/paper_files/paper/2014/file/09c6c3783b4a70054da74f2538ed47c6-Paper.pdf

  54. Pang, Y., Zhao, X., Zhang, L., et al.: Multi-scale interactive network for salient object detection. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 9410–9419 (2020). https://doi.org/10.1109/CVPR42600.2020.00943

  55. Pang, Y., Zhao, X., Xiang, T.Z., et al.: Zoom in and out: A mixed-scale triplet network for camouflaged object detection. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 2150–2160 (2022). https://doi.org/10.1109/CVPR52688.2022.00220

  56. Paragios, N., Mittal, A.: Motion-based background subtraction using adaptive kernel density estimation. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol 2. IEEE Computer Society, Los Alamitos, CA, USA, pp 302–309 (2004). https://doi.org/10.1109/CVPR.2004.164. https://doi.ieeecomputersociety.org/10.1109/CVPR.2004.164

  57. Park, K.B., Lee, J.Y.: Tcu-net: transformer and convolutional neural network-based advanced u-net for concealed object detection. IEEE Access 10, 122347–122360 (2022). https://doi.org/10.1109/ACCESS.2022.3223424

    Article  Google Scholar 

  58. Pei, J., Cheng, T., Fan, D.P., et al.: Osformer: one-stage camouflaged instance segmentation with transformers. In: Avidan, S., Brostow, G., Cissé, M., et al. (eds.) Computer Vision - ECCV 2022, pp. 19–37. Cham, Springer Nature (2022). https://doi.org/10.1007/978-3-031-19797-0_2

    Chapter  Google Scholar 

  59. Perazzi, F., Krähenbühl, P., Pritch, Y., et al.: Saliency filters: Contrast based filtering for salient region detection. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp 733–740 (2012).https://doi.org/10.1109/CVPR.2012.6247743

  60. Qin, X., Fan, D.P., Huang, C., et al.: Boundary-aware segmentation network for mobile and web applications. arXiv preprint arXiv:2101.04704 (2021)

  61. Ranftl, R., Lasinger, K., Hafner, D., et al.: Towards robust monocular depth estimation: mixing datasets for zero-shot cross-dataset transfer. IEEE Trans. Pattern Anal. Mach. Intell. 44(3), 1623–1637 (2022). https://doi.org/10.1109/TPAMI.2020.3019967

    Article  Google Scholar 

  62. Ren, J., Hu, X., Zhu, L., et al.: Deep texture-aware features for camouflaged object detection. IEEE Trans. Circuits Syst. Video Technol. 33(3), 1157–1167 (2023). https://doi.org/10.1109/TCSVT.2021.3126591

    Article  Google Scholar 

  63. Ren, S., He, K., Girshick, R., et al.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017). https://doi.org/10.1109/TPAMI.2016.2577031

    Article  Google Scholar 

  64. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., et al. (eds.) Medical image computing and computer-assisted intervention—MICCAI 2015, pp. 234–241. Springer International Publishing, Cham (2015)

    Google Scholar 

  65. Shen, Y., Zhang, Z., Sabuncu, M.R., et al.: Real-time uncertainty estimation in computer vision via uncertainty-aware distribution distillation. In: 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pp 707–716 (2021). https://doi.org/10.1109/WACV48630.2021.00075

  66. Shi, Z., XU, X., Liu, X., et al.: Video frame interpolation transformer. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 17461–17470 (2022). https://doi.org/10.1109/CVPR52688.2022.01696

  67. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2021).

  68. Skurowski, P., Abdulameer, H., Błaszczyk, J., et al.: Animal camouflage analysis: Chameleon database. Unpublished Manuscript (2018)

  69. Song, Y., Li, X., Qi, L.: Camouflaged object detection with feature grafting and distractor aware. In: 2023 IEEE International Conference on Multimedia and Expo (ICME), pp 2459–2464 (2023). https://doi.org/10.1109/ICME55011.2023.00419

  70. Song, Z., Kang, X., Wei, X., et al.: Fsnet: Focus scanning network for camouflaged object detection. IEEE Trans. Image Process. 32, 2267–2278 (2023). https://doi.org/10.1109/TIP.2023.3266659

    Article  Google Scholar 

  71. Sun, D., Jiang, S., Qi, L.: Edge-aware mirror network for camouflaged object detection. In: 2023 IEEE International Conference on Multimedia and Expo (ICME), pp 2465–2470 (2023). https://doi.org/10.1109/ICME55011.2023.00420

  72. Sun, Y., Chen, G., Zhou, T., et al.: Context-aware cross-level fusion network for camouflaged object detection. In: Zhou, Z.H. (ed) Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21. International Joint Conferences on Artificial Intelligence Organization, pp 1025–1031 (2021). https://doi.org/10.24963/ijcai.2021/142, main Track

  73. Sun, Y., Wang, S., Chen, C., et al.: Boundary-guided camouflaged object detection. In: Raedt, L.D. (ed) Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-22. International Joint Conferences on Artificial Intelligence Organization, pp 1335–1341 (2022). https://doi.org/10.24963/ijcai.2022/186, main Track

  74. Sungho, Kim: Unsupervised spectral-spatial feature selection-based camouflaged object detection using vnir hyperspectral camera. The Scientific World Journal p 834635 (2015). https://doi.org/10.1155/2015/834635

  75. Tankus, A., Yeshurun, Y.: Detection of regions of interest and camouflage breaking by direct convexity estimation. In: Proceedings 1998 IEEE Workshop on Visual Surveillance, pp 42–48 (1998). https://doi.org/10.1109/WVS.1998.646019

  76. Tankus, A., Yeshurun, Y.: Convexity-based visual camouflage breaking. Computer Vision and Image Understanding 82(3), 208–237 (2001). https://doi.org/10.1006/cviu.2001.0912. https://www.sciencedirect.com/science/article/pii/S1077314201909129

  77. Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. In: Guyon, I., Luxburg, U.V., Bengio, S., et al. (eds) Advances in Neural Information Processing Systems, vol 30. Curran Associates, Inc. (2017). https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf

  78. Wang, K., Bi, H., Zhang, Y., et al.: \(\text{ D}^{2}\)c-net: a dual-branch, dual-guidance and cross-refine network for camouflaged object detection. IEEE Trans. Industr. Electron. 69(5), 5364–5374 (2022). https://doi.org/10.1109/TIE.2021.3078379

    Article  Google Scholar 

  79. Wang, W., Xie, E., Li, X., et al.: Pyramid vision transformer: A versatile backbone for dense prediction without convolutions. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp 548–558 (2021). https://doi.org/10.1109/ICCV48922.2021.00061

  80. Wang, W., Xie, E., Li, X., et al.: Pvt v2: improved baselines with pyramid vision transformer. Comput Visual Med 8(3), 415–424 (2022)

    Article  Google Scholar 

  81. Hou, JYYH Wendi, Li, Jinping: Detection of the mobile object with camouflage color under dynamic background based on optical flow. Proc Eng. 15, 2201–2205 (2011). https://doi.org/10.1016/j.proeng.2011.08.412.

  82. Woo, S., Park, J., Lee, J.Y., et al.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., et al. (eds.) Computer vision—ECCV 2018. Lecture notes in computer science, vol. 11211, pp. 3–19. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  83. Wu, H., Xiao, B., Codella, N., et al.: Cvt: Introducing convolutions to vision transformers. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp 22–31 (2021).https://doi.org/10.1109/ICCV48922.2021.00009

  84. Wu, Y.H., Liu, Y., Zhang, L., et al.: Edn: salient object detection via extremely-downsampled network. IEEE Trans. Image Process. 31, 3125–3136 (2022). https://doi.org/10.1109/TIP.2022.3164550

    Article  Google Scholar 

  85. Xiang, M., Zhang, J., Lv, Y., et al.: Exploring depth contribution for camouflaged object detection. arXiv preprint arXiv:2106.13217 (2021)

  86. Xing, H., Gao, S., Wang, Y., et al.: Go closer to see better: camouflaged object detection via object area amplification and figure-ground conversion. IEEE Trans. Circuits Syst. Video Technol. 33(10), 5444–5457 (2023). https://doi.org/10.1109/TCSVT.2023.3255304

    Article  Google Scholar 

  87. Xue, F., Guoying, C., Hong, R., et al.: Camouflage texture evaluation using a saliency map. Multim Syst 21(2), 169–175 (2015). https://doi.org/10.1007/s00530-014-0368-y

    Article  Google Scholar 

  88. Yan, J., Le, T.N., Nguyen, K.D., et al.: Mirrornet: bio-inspired camouflaged object segmentation. IEEE Access 9, 43290–43300 (2021). https://doi.org/10.1109/ACCESS.2021.3064443

    Article  Google Scholar 

  89. Yang, F., Zhai, Q., Li, X., et al.: Uncertainty-guided transformer reasoning for camouflaged object detection. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp 4126–4135 (2021). https://doi.org/10.1109/ICCV48922.2021.00411

  90. Yin, B., Zhang, X., Hou, Q., et al.: Camoformer: Masked separable attention for camouflaged object detection. arXiv preprint arXiv:2212.06570 (2022)

  91. Yuan, L., Chen, Y., Wang, T., et al.: Tokens-to-token vit: Training vision transformers from scratch on imagenet. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp 538–547 (2021). https://doi.org/10.1109/ICCV48922.2021.00060

  92. Zhai, Q., Li, X., Yang, F., et al.: Mutual graph learning for camouflaged object detection. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 12992–13002 (2021). https://doi.org/10.1109/CVPR46437.2021.01280

  93. Zhan, C., Wang, A., Wang, M.: Camouflage object segmentation method based on channel attention and edge fusion. Journal of Computer Applications 43(07), 2166–2172 (2023)

    Google Scholar 

  94. Zhang, Q., Sun, X., Chen, Y., et al.: Attention-induced semantic and boundary interaction network for camouflaged object detection. Comput Vision aImage Underst 233, 103719 (2023). https://doi.org/10.1016/j.cviu.2023.103719

    Article  Google Scholar 

  95. Zhang, X., Zhu, C., Wang, S., et al.: A Bayesian approach to camouflaged moving object detection. IEEE Trans. Circuits Syst. Video Technol. 27(9), 2001–2013 (2017). https://doi.org/10.1109/TCSVT.2016.2555719

    Article  Google Scholar 

  96. Zhang, Y., Zhang, J., Hamidouche, W., et al.: Predictive uncertainty estimation for camouflaged object detection. IEEE Trans. Image Process. 32, 3580–3591 (2023). https://doi.org/10.1109/TIP.2023.3287137

    Article  Google Scholar 

  97. Zhao, J., Liu, J.J., Fan, D.P., et al.: Egnet: Edge guidance network for salient object detection. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp 8778–8787 (2019). https://doi.org/10.1109/ICCV.2019.00887

  98. Zheng, D., Zheng, X., Yang, L.T., et al.: Mffn: Multi-view feature fusion network for camouflaged object detection. In: 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp 6221–6231 (2023). https://doi.org/10.1109/WACV56688.2023.00617

  99. Zheng, S., Lu, J., Zhao, H., et al.: Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 6877–6886 (2021). https://doi.org/10.1109/CVPR46437.2021.00681

  100. Zhong, Y., Li, B., Tang, L., et al.: Detecting camouflaged object in frequency domain. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 4494–4503 (2022). https://doi.org/10.1109/CVPR52688.2022.00446

  101. Zhou, T., Zhou, Y., Gong, C., et al.: Feature aggregation and propagation network for camouflaged object detection. IEEE Trans. Image Process. 31, 7036–7047 (2022). https://doi.org/10.1109/TIP.2022.3217695

    Article  Google Scholar 

  102. Zhu, H., Li, P., Xie, H., et al.: I can find you! boundary-guided separated attention network for camouflaged object detection. In: Thirty-Sixth AAAI Conference on Artificial Intelligence, pp 3608–3616 (2022). https://doi.org/10.1609/aaai.v36i3.20273

  103. Zhu, J., Zhang, X., Zhang, S., et al.: Inferring camouflaged objects by texture-aware interactive guidance network. Proceedings of the AAAI Conference on Artificial Intelligence 35(4), 3599–3607 (2021). https://doi.org/10.1609/aaai.v35i4.16475

    Article  Google Scholar 

  104. Zhuge, M., Lu, X., Guo, Y., et al.: Cubenet: X-shape connection for camouflaged object detection. Pattern Recogn 127, 108644 (2022). https://doi.org/10.1016/j.patcog.2022.108644

    Article  Google Scholar 

Download references

Acknowledgements

This research was partially supported by the National Natural Science Foundation of China under Grant(62162013), Guizhou Normal University New Academic Talent Fund Project (Qian Shi Xin Miao [2022] No. 30), and Deep Learning-Based Camouflaged Object Detection–the National Entrepreneurship Training Program(202210663045).

Author information

Authors and Affiliations

Authors

Contributions

J.M.: conceptualization, methodology, validation, visualization, data curation, writing-original draft, writing-review. A.Z.: conceptualization, formal analysis, resources, supervision, writing-review. C.H.: formal analysis, data curation, supervision, writing-review. J.T.: data curation, supervision, writing-review. All authors reviewed the manuscript.

Corresponding author

Correspondence to Anzhi Wang.

Ethics declarations

Conflict of interest

The authors declare that they have no Conflict of interest.

Additional information

Communicated by Junyu Gao.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhong, J., Wang, A., Ren, C. et al. A survey on deep learning-based camouflaged object detection. Multimedia Systems 30, 268 (2024). https://doi.org/10.1007/s00530-024-01478-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1007/s00530-024-01478-7

Keywords

Profiles

  1. Anzhi Wang