A Survey of Image Semantic Segmentation Algorithm Based on Deep Learning

Authors

  • Jian Chen
  • Fen Luo

DOI:

https://doi.org/10.54097/ajst.v5i1.5248

Keywords:

Image semantic segmentation, Deep learning.

Abstract

Image semantic segmentation technology is one of the core research contents in the field of computer vision, and has a wide range of applications in production and life. With the improvement of computer performance and the continuous development of deep learning technology, researchers have increasingly high research enthusiasm for the performance of image semantic segmentation. This paper summarizes the research status of image semantic segmentation based on deep learning and introduces the common datasets used in the field of semantic segmentation. Finally, we point out the existing problems and future development trend of image semantic segmentation algorithms.

Downloads

Download data is not yet available.

References

Long, J., Shelhamer, E., Darrell, T.: ‘Fully convolutional networks for semantic segmentation’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440.

Ronneberger, O., Fischer, P., Brox, T.: ; Springer. ‘U-net: Convolutional networks for biomedical image segmentation’, International Conference on Medical image computing and computer-assisted intervention, 2015, pp. 234–241.

Peng, C., Zhang, X., Yu, G., Luo, G., Sun, J.: ‘Large kernel matters–improve semantic segmentation by global convolutional network’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4353–4361.

Badrinarayanan, V., Kendall, A., Cipolla, R.: ‘Segnet: A deep convolutional encoder-decoder architecture for image segmentation’, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39, pp. 2481–2495.

Lin, G., Milan, A., Shen, C., Reid, I.: ‘Refinenet: Multi-path refinement networks for high-resolution semantic segmentation’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 1925–1934.

Ghiasi, G., Fowlkes, C.C.: ; Springer. ‘Laplacian pyramid reconstruction and refinement for semantic segmentation’, European conference on computer vision,2016, pp. 519–534.

Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: ‘Learning a discriminative feature network for semantic segmentation’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 1857–1866.

Wang, J., Sun, K., Cheng, T., Jiang, B., Deng, C., Zhao, Y., et al.: ‘Deep highresolution representation learning for visual recognition’, IEEE transactions on pattern analysis and machine intelligence, 2020, 43, (10), pp. 3349–3364.

16 Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: ‘Pyramid scene parsing network’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2017,pp. 2881–2890.

Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: ‘Deeplab:Semantic image segmentation with deep convolutional nets, atrous convolution,and fully connected crfs’, IEEE transactions on pattern analysis and machine intelligence, 2017, 40, (4), pp. 834–848.

Florian, L.C., Adam, S.H.: ‘Rethinking atrous convolution for semantic image segmentation’, Conference on Computer Vision and Pattern Recognition (CVPR) IEEE/CVF, 2017.

Zhao, H., Qi, X., Shen, X., Shi, J., Jia, J.: ‘Icnet for real-time semantic segmentation on high-resolution images’, Proceedings of the European conference on computer vision (ECCV), 2018, pp. 405–420.

Romera, E., Alvarez, J.M., Bergasa, L.M., Arroyo, R.: ‘Erfnet: Efficient residual factorized convnet for real-time semantic segmentation’, IEEE Transactions on Intelligent Transportation Systems, 2017, 19, (1), pp. 263–272.

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C.: ‘Mobilenetv2: Inverted residuals and linear bottlenecks’, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4510–4520.

Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: ‘Bisenet: Bilateral segmentation network for real-time semantic segmentation’, Proceedings of the European c.onference on computer vision (ECCV), 2018, pp. 325–341

Yu, C., Gao, C., Wang, J., Yu, G., Shen, C., Sang, N.: ‘Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation’, International Journal of Computer Vision, 2021, 129, (11), pp. 3051–3068.

Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R.,et al.: ‘The cityscapes dataset for semantic urban scene understanding’, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016,pp. 3213–3223.

Brostow, G.J., Shotton, J., Fauqueur, J., Cipolla, R.: ‘R.: Segmentation and recognition using structure from motion point clouds’, In: ECCV, 2008.

Caesar, H., Uijlings, J., Ferrari, V.: ‘Coco-stuff: Thing and stuff classes in context’,Proceedings of the IEEE conference on computer vision and pattern recognition,2018, pp. 1209–1218.

Downloads

Published

28-02-2023

Issue

Section

Articles

How to Cite

Chen, J., & Luo, F. (2023). A Survey of Image Semantic Segmentation Algorithm Based on Deep Learning. Academic Journal of Science and Technology, 5(1), 13-14. https://doi.org/10.54097/ajst.v5i1.5248