Abstract
As a common building decoration material, ceramic tiles have been widely used in modern society, and deep learning inspection methods are increasingly employed for tile quality inspection. However, current methods face issues such as slow detection velocity and diminished precision in ceramic tiles detection. To resolve these issues, this study presents a dense detection algorithm for ceramic tile defects with an improved YOLOv8. The model redesigns the CSPLayer (Cross Stage Partial Layer) structure by incorporating the BiFormer architecture, and the SCConv (Spatial and Channel Reconstruction Convolution) is employed to replace the ordinary convolution in the Neck and Head. Furthermore, the MPDIoU + DFL (Distribution Focal Loss) is adopted as the bounding box regression loss function, and the EMA (Efficient Multi-Scale Attention mechanism) attention module is introduced to improve the significance and precision of the defective feature information detection. Experimental results indicate that the final improved model has a size of 58.6 MB, the mAP@0.5 reaches 95.62%, and the FPS is 145.4.









Similar content being viewed by others
Data availability
The data that support the findings of this article are available from the corresponding author upon request.
References
Ahmad, H. M., & Rahimi, A. (2022). Deep learning methods for object detection in smart manufacturing: A survey. Journal of Manufacturing Systems, 64, 181–196. https://doi.org/10.1016/j.jmsy.2022.06.011
Aliyun Alibaba. (2023). Ceramic tile defect detection data set. https://tianchi.aliyun.com/dataset/110088
Cao, Y., Wang, Y., Feng, H., Wang, T. (2022). Method for detecting surface defects of ceramic tile based on improved Cascade RCNN. In: 2022 4th International Conference on Frontiers Technology of Information and Computer (ICFTIC) (pp. 41–45). IEEE. https://doi.org/10.1109/ICFTIC57696.2022.10075095
Chen, H., Wang, Y., Guo, T., Xu, C., Deng, Y., Liu, Z., Ma, S., Xu, C., Gao, W. (2021). Pre-trained image processing transformer, In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 12299–12310). https://doi.org/10.48550/arXiv.2012.00364
Chen, Y., & Pock, T. (2016). Trainable nonlinear reaction diffusion: A flexible framework for fast and effective image restoration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1256–1272. https://doi.org/10.1109/TPAMI.2016.2596743
Dabov, K., Foi, A., Katkovnik, V., Egiazarian, K. (2007). Color image denoising via sparse 3D collaborative filtering with grouping constraint in luminance-chrominance space. In: 2007 IEEE International Conference on Image Processing (ICIP), I-313-I-316. https://doi.org/10.1109/ICIP.2007.4378954
Du, S., Xu, R., & Li, L. (2016). Modeling and analysis of multiproduct multistage manufacturing system for quality improvement. IEEE Transaction on Systems, Man, and Cybernetics: Systems, 48(5), 801–820. https://doi.org/10.1109/TSMC.2016.2614766
Durga, B. K., & Rajesh, V. (2022). A ResNet deep learning based facial recognition design for future multimedia applications. Computers and Electrical Engineering, 104, 108384. https://doi.org/10.1016/j.compeleceng.2022.108384
Eren, E., Kurama, S., & Solodov, I. (2012). Characterization of porosity and defect imaging in ceramic tile using ultrasonic inspections. Ceramics International, 38(3), 2145–2151. https://doi.org/10.1016/j.ceramint.2011.10.056
García-Martín, J., Gómez-Gil, J., & Vázquez-Sánchez, E. (2011). Non-destructive techniques based on eddy current testing. Sensors, 11(3), 2525–2565. https://doi.org/10.3390/s110302525
Ge, Z., Liu, S., Wang, F., Li, Z., & Sun, J. (2021). Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv: 2107. https://doi.org/10.48550/arXiv.2107.08430
Girshick R. (2015). Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision (pp. 1440–1448). https://doi.org/10.1109/ICCV.2015.169
Gong H., Li H., Xu K., Zhang Y. (2019). Object detection based on improved YOLOv3-tiny. In: 2019 Chinese automation congress (CAC) (pp. 3240–3245). IEEE. https://doi.org/10.1109/CAC48633.2019.8996750
Hanzaei, S. H., Afshar, A., & Barazandeh, F. (2017). Automatic detection and classification of the ceramic tiles’ surface defects. Pattern Recognition, 66, 174–189. https://doi.org/10.1016/j.patcog.2016.11.021
He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(9), 1904–1916. https://doi.org/10.1109/TPAMI.2015.2389824
Howard, A., Sandler, M., Chu, G., Chen, LC., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., Vasudevan, V., Le, QV., Adam, H. (2019). Searching for mobilenetv3. In: Proceedings of the IEEE/CVF international conference on computer vision (pp. 1314–1324). https://doi.org/10.1109/ICCV.2019.00140
Huang, D., Du, S., Li, G., & Wu, Z. (2017). A systematic approach for online minimizing volume difference of multiple chambers in machining processes based on high-definition metrology. Journal of Manufacturing Science and Engineering, 139(8), 081003. https://doi.org/10.1115/1.4035897
Hurtik, P., Molek, V., & Hula, J. (2019). Data preprocessing technique for neural networks based on image represented by a fuzzy function. IEEE Transactions on Fuzzy Systems, 28(7), 1195–1204. https://doi.org/10.1109/TFUZZ.2019.2911494
Jiang, Z., Zhao, L., Li, S., & Jia, Y. (2020). Real-time object detection method based on improved YOLOv4-tiny. arXiv preprint arXiv. https://doi.org/10.48550/arXiv.2011.04244
Jocher, G. (2021). YOLOv5 by Ultralytics. https://github.com/ultralytics/yolov5
Jocher, G., Chaurasia, A., Qiu, J. (2023). YOLOv8 by Ultralytics. https://github.com/ultralytics/ultralytics
Kesharaju, M., Nagarajah, R., Zhang, T., & Crouch, I. (2014). Ultrasonic sensor based defect detection and characterisation of ceramics. Ultrasonics, 54(1), 312–317. https://doi.org/10.1016/j.ultras.2013.07.018
Kimori, Y. (2022). A morphological image preprocessing method based on the geometrical shape of lesions to improve the lesion recognition performance of convolutional neural networks. IEEE Access, 10, 70919–70936. https://doi.org/10.1109/ACCESS.2022.3187507
Kumari, S., Jain, R., Kumar, U., Yadav, I., Ranjan, N., Kumari, K., Kesharwani, R. K., Kumar, S., Pal, S., & Pal, S. K. (2019). Defect identification in friction stir welding using continuous wavelet transform. Journal of Intelligent Manufacturing, 30, 483–494. https://doi.org/10.1007/s10845-016-1259-1
Li, G., Du, S., Huang, D., Zhao, C., & Deng, Y. (2019a). Dynamics modeling-based optimization of process parameters in face milling of workpieces with discontinuous surfaces. Manufacturing Science and Engineering, 141(10), 101009. https://doi.org/10.1115/1.4044421
Li, G., Du, S., Huang, D., Zhao, C., & Deng, Y. (2019b). Elastic mechanics-based fixturing scheme optimization of variable stiffness structure workpieces for surface quality improvement. Precision Engineering, 56, 343–363. https://doi.org/10.1016/j.precisioneng.2019.01.004
Li, G., Du, S., Wang, B., Lv, J., & Deng, Y. (2021). High definition metrology-based quality improvement of surface texture in face milling of workpieces with discontinuous surfaces. Journal of Manufacturing Science and Engineering, 144(3), 031001. https://doi.org/10.1115/1.4051883
Li, J., Quan, X., & Wang, Y. (2020). Research on defect detection algorithm of ceramic tile surface with multi-feature fusion. Computer Engineering and Applications, 56(15), 191–198. https://doi.org/10.3778/j.issn.1002-8331.1907-0130
Li, J., Wen, Y., He, L. (2023). Scconv: spatial and channel reconstruction convolution for feature redundancy. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 6153–6162). https://doi.org/10.1109/CVPR52729.2023.00596
Lin, TY., Goyal, P., Girshick, R., He, K., Dollár, P. (2017). Focal loss for dense object detection, In: Proceedings of the IEEE international conference on computer vision (pp. 2980–2988). https://doi.org/10.48550/arXiv.1708.02002
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, CY., Berg, AC. (2016). SSD: Single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing (pp. 21–37). https://doi.org/10.48550/arXiv.1512.02325
Lu, Q., Lin, J., Luo, L., Zhang, Y., & Zhu, W. (2022). A supervised approach for automated surface defect detection in ceramic tile quality control. Advanced Engineering Informatics, 53, 101692. https://doi.org/10.1016/j.aei.2022.101692
Siliang, M., & Yong, X. (2023). Mpdiou: A loss for efficient and accurate bounding box regression. arXiv preprint arXiv. https://doi.org/10.48550/arXiv.2307.07662
Ma, N., Zhang, X., Zheng, HT., Sun, J. (2018). Shufflenet v2: Practical guidelines for efficient cnn architecture design. In: Proceedings of the European conference on computer vision (ECCV) (pp. 116–131). https://doi.org/10.48550/arXiv.1807.11164
Monthly Report on 2023 Statistics, General Administration of Customs of the People’s Republic of China. (2023) http://www.customs.gov.cn/customs/302249/zfxxgk/2799825/302274/302277/3512606/index.html
Ouyang, D., He, S., Zhang, G., Luo, M., Zhan, J., Huang, Z. (2023). Efficient multi-scale attention module with cross-spatial learning. In: ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1–5). IEEE. https://doi.org/10.1109/ICASSP49357.2023.10096516
Qi, W. (2022). Object detection in high resolution optical image based on deep learning technique. Natural Hazards Research, 2(4), 384–392. https://doi.org/10.1016/j.nhres.2022.10.002
Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. ArXiv. https://doi.org/10.48550/arXiv.1804.02767
Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems. https://doi.org/10.48550/arXiv.1506.01497
Shao, Y., Yin, Y., Du, S., & Xi, L. (2019). A surface connectivity based approach for leakage channel prediction in static sealing interface. ASME Transaction on Journal of Tribology. https://doi.org/10.1115/1.4043123
Shao, Y., Yin, Y., Du, S., Xia, T., & Xi, L. (2018). Leakage monitoring in static sealing interface based on three dimensional surface topography indicator. ASME Transaction on Manufacturing Science and Engineering, 140(10), 101003. https://doi.org/10.1115/1.4040620
Tasci, B., & Tasci, I. (2022). Deep feature extraction based brain image classification model using preprocessed images: PDRNet. Biomedical Signal Processing and Control, 78, 103948. https://doi.org/10.1016/j.bspc.2022.103948
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems. https://doi.org/10.48550/arXiv.1706.03762
Wan, G., Fang, H., Wang, D., Yan, J., & Xie, B. (2022). Ceramic tile surface defect detection based on deep learning. Ceramics International, 48(8), 11085–11093. https://doi.org/10.1016/j.ceramint.2022.02.080
Wang CY., Bochkovskiy A., Liao HYM. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 7464–7475). https://doi.org/10.1109/CVPR52729.2023.00721
Wang, CY., Liao, HYM., Yeh, IH., Wu, YH., Chen, PY., Hsieh, JW. (2020). CSPNet: A new backbone that can enhance learning capability of CNN. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (pp. 390–391). https://doi.org/10.1109/CVPRW50498.2020.00203
Wang, K., Li, G., Du, S., Xi, L., & Xia, T. (2021). State space modelling of variation propagation in multistage machining processes for variable stiffness structure workpieces. International Journal of Production Research, 59(13), 4033–4052. https://doi.org/10.1080/00207543.2020.1757173
Wang, K., Li, Z., & Wang, X. (2022a). Concatenated network fusion algorithm (CNFA) based on deep learning: Improving the detection accuracy of surface defects for ceramic tile. Applied Sciences, 12, 1249. https://doi.org/10.3390/app12031249
Wang, Y., Li, Y. Z., Lai, Q. Q., Li, S. T., & Huang, J. (2022b). RU-Net: An improved U-Net placenta segmentation network based on ResNet. Computer Methods and Programs in Biomedicine, 227, 107206. https://doi.org/10.1016/j.cmpb.2022.107206
Zhang, K., Zuo, W., Gu, S., Zhang, L. (2017). Learning deep CNN denoiser prior for image restoration, In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 3929–3938). https://doi.org/10.48550/arXiv.1704.03264
Zhang, Y., Tian, Y., Kong, Y., Zhong, B., Fu, Y. (2020). Residual dense network for image restoration, In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2472–2481). https://doi.org/10.48550/arXiv.1812.10477
Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. (2017b). Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Transactions on Image Processing, 26, 3142–3155. https://doi.org/10.1109/TIP.2017.2662206
Zhang, K., Zuo, W., & Zhang, L. (2018). FFDNet: Toward a fast and flexible solution for CNN-based image denoising. IEEE Transactions on Image Processing, 27(9), 4608–4622. https://doi.org/10.1109/TIP.2018.2839891
Zhou, X., Koltun, V., & Krähenbühl, P. (2021). Probabilistic two-stage detection. ArXiv. https://doi.org/10.48550/arXiv.2103.07461
Zhu, L., Wang, X., Ke, Z., Zhang, W., Lau, RWH. (2023). Biformer: Vision transformer with bi-level routing attention. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 10323–10333). https://doi.org/10.48550/arXiv.2303.08810
Acknowledgements
This work is supported by Guizhou Provincial Science and Technology Projects (ZK[2023]060), National Key Research and Development Program (No. 2016YFD0201305-07), Open Fund Project in Semiconductor Power Device Reliability Engineering Center of Ministry of Education (ERCMEKFJJ2019-06), Special Field Project of Guizhou Provincial Education Department (KY[2020]056).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interest
The author has no conflicts of interest to declare related to the content of this article. The dataset used the tile defect dataset provided by the Alibaba Tianchi Competition, the website address is https://tianchi.aliyun.com/.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yu, M., Li, Y., Li, Z. et al. Dense detection algorithm for ceramic tile defects based on improved YOLOv8. J Intell Manuf 36, 5613–5628 (2025). https://doi.org/10.1007/s10845-024-02523-y
Received:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1007/s10845-024-02523-y

