End Face Detection of Rebar Based on Improved PP YOLO

Authors

  • Kunning Lai
  • Sibo Huang
  • Han Cui
  • Yong Cheng
  • Wei Luo

DOI:

https://doi.org/10.54097/3nyqen08

Keywords:

Deep Learning; Object Detection; PP-YOLO; Rebar.

Abstract

 This paper introduces an improved PP-YOLO network method to enhance the accuracy of rebar end identification and counting in construction engineering. The network is optimized for the specific characteristics of rebar images, enhancing its recognition capabilities for rebars of various sizes and shapes. Notably, the network structure introduces new pathways in the 4th and 5th layers, enabling more effective learning and identification of rebar features from low-level characteristics, thus improving overall recognition and counting accuracy. Additionally, an optimized data augmentation strategy, tailored to the unique features of rebar images, replaces the traditional Mixup method. Specific algorithms are introduced to enhance the network's efficiency in learning rebar image characteristics. These improvements led to excellent performance in rebar end face recognition tests, achieving an average precision (AP) of 96.23%, a 1.07% increase compared to the original model. This significant performance improvement confirms the effectiveness of our proposed improvements in practical applications, offering new perspectives for the development of rebar detection technology in the construction industry.

Downloads

Download data is not yet available.

References

Zhao, W. Q., Kong, Z. X., Zhou, Z. D., et al. (2021). Enhanced feature detection of small targets in aerial remote sensing. Journal of China Image and Graphics, 26(3), 644-653.

Krizhevsky, A., Sutskever, I., & Hinton, G. (2012). ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (pp. 1097-1105). New York, USA.

Girshick, R., Donahue, J., Darrell, T., et al. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 580-587). Piscataway, USA.

Girshick, R. B. (2015). Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1440-1448). Piscataway, USA.

Ren, S., He, K., Girshiek, R. B., et al. (2017). Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 91-99.

He, K., Gkioxari, G., Dollar, P., et al. (2017). Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2961-2969). Piscataway, USA.

Joseph, R., Santosh, D., Ross, G., et al. (2016). You only look once: Unified, real-time object detection. In IEEE Conference on Computer Vision and Pattern Recognition (pp. 779-788). Las Vegas, USA.

Wei, Y. H., & Wen, Y. L. (2021). Ratio-and-scale-aware YOLO for pedestrian detection. IEEE Transactions on Image Processing, 30, 934-947.

Venkat, A. A., Huan, C. C., Chi, C. H., et al. (2021). Applying deep learning to defect detection in printed circuit boards via a newest model of you-only-look-once. Mathematical Biosciences and Engineering, 18(4), 4411-4428.

Lin, T. Y., Dollar, P., Girshick, R., et al. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2117-2125). Piscataway, USA.

Law, H., & Deng, J. (2019). Cornernet: Detecting objects as paired keypoints [EB/OL]. Available at https://arxiv.org/abs/1808.01244. arXiv preprint arXiv:1808.01244.

Long, X., Deng, K., Wang, G., et al. (2020). PP-YOLO: An effective and efficient implementation of object detector [EB/OL]. Available at https://arxiv.org/abs/2007.12099. arXiv preprint arXiv:2007.12099.

Ma, J., Yao, Z., Xu, C. F., et al. (2021). A real-time tracking algorithm for multiple drones based on improved PP-YOLO and Deep SORT. Computer Applications. doi:10.11772/j.issn.1001-9081.2021071146.

Liu, R., Lehman, J., Molino, P., et al. (2018). An intriguing failing of convolutional neural networks and the coordconv solution [EB/OL]. Available at https://arxiv.org/abs/1807.03247. arXiv preprint arXiv:1807.03247.

Wang, Y., Zhen, P. B., Hou, J. H., et al. (2021). Convolutional neural networks with dynamic regularization. IEEE Transactions on Neural Networks and Learning Systems, 32(5), 2299-2304.

Liu, Z. H., Zhang, Y. D., Chen, Y. Z., et al. (2020). Detection of algorithmically generated domain names using the recurrent convolutional neural network with spatial pyramid pooling. Entropy, 22(9), 1-20.

Yun, S., Han, D., Oh, J. S., et al. (2019). CutMix: Regularization strategy to train strong classifiers with localizable features. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (pp. 6022-6031). Piscataway, USA.

Li, K. Q., Chen, Y., Liu, J. C., et al. A review of object detection algorithms based on deep learning. Computer Engineering. doi: 10.19678/j.issn.1000-3428.0062.

Downloads

Published

12-03-2024

Issue

Section

Articles

How to Cite

Lai, K., Huang, S., Cui, H., Cheng, Y., & Luo, W. (2024). End Face Detection of Rebar Based on Improved PP YOLO. Academic Journal of Science and Technology, 9(3), 91-98. https://doi.org/10.54097/3nyqen08