Semantic Segmentation Based on Improved Robinson Operator
Keywords:Semantic segmentation, Residual structure, Pooling layer, Lightweigth model, Robinson Operator.
Classical pixel-level semantic segmentation methods are easily affected by factors such as environment and pixel values, resulting in low accuracy of their methods. Therefore, in order to improve the accuracy of pixel-level semantic segmentation under practical conditions, this paper designs a lightweight network based on the nonlinear characteristics of residual structure and the characteristics of optimized edge operators. The experimental results show that the designed lightweight residual network can effectively improve the accuracy by about 2% compared with the traditional residual network, and the model size is reduced from the original 40.2MB to 18.9MB.
Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 8759-8768.
Li Y, Qi H, Dai J, et al. Fully convolutional instance-aware semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2359-2367.
Xu W, Wang H, Qi F, et al. Explicit shape encoding for real-time instance segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 5168-5177.
Yuan Z W, Zhang J. Feature extraction and image retrieval based on AlexNet[C]//Eighth International Conference on Digital Image Processing (ICDIP 2016). SPIE, 2016, 10033: 65-69.
Iandola F N, Han S, Moskewicz M W, et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size[J]. arXiv preprint arXiv:1602.07360, 2016.
Szegedy C , Vanhoucke V , Ioffe S , et al. Rethinking the Inception Architecture for Computer Vision[J]. IEEE, 2016:2818-2826.
Howard A G, Zhu M, Chen B, et al. Mobilenets: Efficient convolutional neural networks for mobile vision applications[J]. arXiv preprint arXiv:1704.04861, 2017.
Peng S, Jiang W, Pi H, et al. Deep snake for real-time instance segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 8533-8542.
Zhao X, Vemulapalli R, Mansfield P A, et al. Contrastive learning for label efficient semantic segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 10623-10633.
Armato S G , Roberts R Y , Mcnitt-Gray M F , et al. The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans.[J]. Academic Radiology, 2007, 14( 12):1455-1463.
Jetley S, Sapienza M, Golodetz S, et al. Straight to shapes: real-time detection of encoded shapes[C]//Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2017: 6550-6559.
Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang, Raquel Urtasun, Liwei Wang, Stephen Lin, and Han Hu. Dense reppoints: Representing visual objects with dense point sets. arXiv preprint arXiv:1912.11473, 2019.
Xingyi Zhou, Dequan Wang, and Philipp Kr¨ahenb¨uhl. Objects as points. Arxiv preprint arxiv:1904.07850, 2019.
How to Cite
This work is licensed under a Creative Commons Attribution 4.0 International License.