Semantic segmentation of pavement cracks based on an improved U-Net
DOI:
https://doi.org/10.54097/jceim.v10i3.8672Keywords:
U-Net, Semantic segmentation, ECA, FCNheadAbstract
Cracks are one of the types of pavement defects that can affect the safety and quality of roads, so identifying such defects is an important part of road maintenance. In this paper, based on the U-Net coding-decoding structure, an ECA channel attention module is added to the coding stage, thus improving feature extraction without introducing too many extra parameters and computational effort. The insertion of the FCNhead decoding dock in the decoding stage can improve the performance of the model on semantic segmentation tasks while maintaining its efficiency and interpretability, thus better meeting the needs of practical applications. Experimental results on the CFD dataset and the crack500 dataset show that the algorithm improves the accuracy of crack detection and has better robustness.
References
W. Cao, Q. Liu and Z. He, "Review of Pavement Defect Detection Methods," in IEEE Access, vol. 8, pp. 14531-14544, 2020, doi: 10.1109/ACCESS.2020.2966881.
Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 1-9.
He K, Zhang X, Ren S, et al. Deep residual learning [J]. Image Recognition, 2015, 7.
Zhai Junzhi, Sun Chaoyun, Pei Lili et al. A multi-scale feature-enhanced pavement crack detection method[J]. Journal of Transportation Engineering, 2023, 23(01): 291-308. DOI: 10.19818 / j.cnki.1671-1637.2023.01.022.
Wu Xiangdong, Zhao Health, Liu Legend. Crack detection algorithm for bridges based on CNN and CRF [J]. Computer Engineering and Design, 2021, (42): 51-56.
Dung C V. Autonomous concrete crack detection using deep fully convolutional neural network [J]. Automation in Construction, 2019, 99: 52-58.
Mei Q, Gül M, Azim M R. Densely connected deep neural network considering connectivity of pixels for automatic crack detection [J]. Automation in Construction, 2020, 110: 103018.
Lu Yinju, Li Zuzhao, Dai Shuguang. A pavement crack image segmentation algorithm incorporating high-order multiscale features [J]. Small Microcomputer Systems, 2022, 43(06): 1197-1203.
Berjaro. Research on end-to-end road crack detection technology based on deep learning [D]. Wuhan: Huazhong University of Science and Technology, 2019.
Yang F, Zhang L, Yu S, et al. Feature pyramid and hierarchical boosting network for pavement crack detection [J]. IEEE Transactions on Intelligent Transportation Systems, 2019, 21(4): 1525-1535.
Shi Y, Cui L, Qi Z, et al. Automatic road crack detection using random structured forests[J]. IEEE Transactions on Intelligent Transportation Systems, 2016, 17(12): 3434-3445.
Wu Qiuyi. Pavement crack identification based on semantic segmentation network model[J]. Transportation Science Technology,2020, No.300(03):80-83+109.
Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation [C]//Medical Image Computing and Computer- Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer International Publishing, 2015: 234-241.
Chen L C, Papandreou G, Kokkinos I, et al. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 40(4): 834-848.
Chen L C, Papandreou G, Schroff F, et al. Rethinking atrous convolution for semantic image segmentation [J]. arXiv preprint arXiv:1706.05587, 2017.
Zhao H, Shi J, Qi X, et al. Pyramid scene parsing network [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017. 2881-2890.
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 3431-3440.
Wang Q, Wu B, Zhu P, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks[C] // Proceedings of the IEEE / CVF conference on computer vision and pattern recognition. 2020: 11534-11542.
Chen L C, Zhu Y, Papandreou G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation [C]//Proceedings of the European conference on computer vision (ECCV). 2018: 801-818.