Steel Surface Defect Detection Method Based on Improved YOLOv8
DOI:
https://doi.org/10.54097/ajst.v8i2.14944Keywords:
Steel surface defect detection; YOLOv8; DBB; MSCA; GDetect; Lightweight.Abstract
This research aims to improve the accuracy and efficiency of steel surface defect detection and address the current insufficient algorithm performance in steel defect detection, especially the challenges in multi-scale feature extraction and parameter efficiency. Based on the improved YOLOv8 network structure, three key improvement points are introduced: the DBB module optimizes multi-scale feature extraction, the MSCA attention mechanism enhances the model's defect detection accuracy, and the optimized GDetect module reduces the amount of parameters and calculations. Extensive experiments and comparative analysis using the NEU-DET data set verify the effectiveness of the improved method. Ablation experiments show that using DBB or MSCA alone has limited performance improvement, but when MSCA and GDetect are combined, the model has significant improvements in Precision, Recall, and mAP50 indicators. Comparative experiments show that the performance of this algorithm is close to or even surpasses some more complex models while remaining relatively lightweight. The average accuracy mean analysis for six defect categories also shows the superior performance of this algorithm for various types of defects. The steel surface defect detection method proposed in this study based on the improved YOLOv8 network has made significant progress in the field of steel defect detection. The improved method improves detection accuracy, solves common problems, and achieves high performance while maintaining lightweight, providing an efficient and feasible solution for steel surface defect detection.
Downloads
References
REN S, HE K, GIRSHICK R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks [J]. Advances in neural information processing systems, 2015, 28.
Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.
Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
Tian Z, Shen C, Chen H, et al. Fcos: Fully convolutional one-stage object detection[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 9627-9636.
Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.
Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. arXiv preprint arXiv:2004.10934, 2020.
Ge Z, Liu S, Wang F, et al. Yolox: Exceeding yolo series in 2021[J]. arXiv preprint arXiv:2107.08430, 2021.
Li C, Li L, Jiang H, et al. YOLOv6: A single-stage object detection framework for industrial applications[J]. arXiv preprint arXiv:2209.02976, 2022.
Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 7464-7475.
Li M, Wang H, Wan Z. Surface defect detection of steel strips based on improved YOLOv4[J]. Computers and Electrical Engineering, 2022, 102: 108208.
Chen H, Du Y, Fu Y, et al. DCAM-Net: A Rapid Detection Network for Strip Steel Surface Defects Based on Deformable Convolution and Attention Mechanism[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: 1-12.
Zhang Y, Liu X, Guo J, et al. Surface Defect Detection of Strip-Steel Based on an Improved PP-YOLOE-m Detection Network[J]. Electronics, 2022, 11(16): 2603.
Zhao C, Shu X, Yan X, et al. RDD-YOLO: A modified YOLO for detection of steel surface defects[J]. Measurement, 2023, 214: 112776.
Wang Y, Wang H, **n Z. Efficient detection model of steel strip surface defects based on YOLO-V7[J]. IEEE Access, 2022, 10: 133936-133944.
Ding X, Zhang X, Han J, et al. Diverse branch block: Building a convolution as an inception-like unit[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 10886-10895.
Guo M H, Lu C Z, Hou Q, et al. Segnext: Rethinking convolutional attention design for semantic segmentation[J]. Advances in Neural Information Processing Systems, 2022, 35: 1140-1156.
Ding X, Zhang X, Ma N, et al. Repvgg: Making vgg-style convnets great again[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 13733-13742.
Marnerides D, Bashford‐Rogers T, Hatchett J, et al. Expandnet: A deep convolutional neural network for high dynamic range expansion from low dynamic range content[C]//Computer Graphics Forum. 2018, 37(2): 37-49.
Cao J, Li Y, Sun M, et al. Do-conv: Depthwise over-parameterized convolutional layer[J]. IEEE Transactions on Image Processing, 2022, 31: 3726-3736.
Tian C, Xu Y, Zuo W, et al. Asymmetric CNN for image superresolution[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021, 52(6): 3718-3730.
**e S, Girshick R, Dollár P, et al. Aggregated residual transformations for deep neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 1492-1500.









