Research on Image Classification And Semantic Segmentation Model Based on Convolutional Neural Network

Authors

  • Muqing Li
  • Ziyi Zhu
  • Ruilin Xu
  • Yinqiu Feng
  • Lingxi Xiao

DOI:

https://doi.org/10.54097/qg7hakzu

Keywords:

Image Classification, Semantic Segmentation, Convolutional neural network

Abstract

This paper investigates convolutional neural network (CNN)-based approaches for image classification and semantic segmentation, with a focus on addressing spatial detail loss and multi-scale feature integration issues prevalent in semantic segmentation. The introduced EDNET model tackles these challenges through the incorporation of spatial information branches and the design of efficient feature fusion mechanisms. It further enhances performance via the use of global pooling and boundary refinement modules. Evaluations on the PASCAL VOC 2012 dataset reveal an 11.67% increase in mean intersection-over-union (IoU) compared to standard fully convolutional networks, demonstrating substantial improvement over comparable techniques. These results confirm the efficacy and practicality of the EDNET framework.

References

Zhu, A., Li, J., & Lu, C. (2021). Pseudo view representation learning for monocular RGB-D human pose and shape estimation. IEEE Signal Processing Letters, 29, 712-716.

Lan, G., Liu, X. Y., Zhang, Y., & Wang, X. (2023). Communication-efficient federated learning for resource-constrained edge devices. IEEE Transactions on Machine Learning in Communications and Networking.

Zi, Y., Wang, Q., Gao, Z., Cheng, X., & Mei, T. (2024). Research on the Application of Deep Learning in Medical Image Segmentation and 3D Reconstruction. Academic Journal of Science and Technology, 10(2), 8-12.

Li, K., Zhu, A., Zhou, W., Zhao, P., Song, J., & Liu, J. (2024). Utilizing Deep Learning to Optimize Software Development Processes. arXiv preprint arXiv:2404.13630."

Lan, G., Wang, H., Anderson, J., Brinton, C., & Aggarwal, V. (2024). Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates. Advances in Neural Information Processing Systems, 36.

Wang, X. S., Turner, J. D., & Mann, B. P. (2021). Constrained attractor selection using deep reinforcement learning. Journal of Vibration and Control, 27(5-6), 502-514.

Li, K., Zhu, A., Zhou, W., Zhao, P., Song, J., & Liu, J. (2024). Utilizing Deep Learning to Optimize Software Development Processes. arXiv preprint arXiv:2404.13630.

Xiao, M., Li, Y., Yan, X., Gao, M., & Wang, W. (2024). Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example. doi:10.48550/ARXIV.2404.08279

Zhu, A., Li, K., Wu, T., Zhao, P., Zhou, W., & Hong, B. (2024). Cross-Task Multi-Branch Vision Transformer for Facial Expression and Mask Wearing Classification. arXiv preprint arXiv:2404.14606.

Ning, Q., Zheng, W., Xu, H., Zhu, A., Li, T., Cheng, Y., ... & Wang, K. (2022). Rapid segmentation and sensitive analysis of CRP with paper-based microfluidic device using machine learning. Analytical and Bioanalytical Chemistry, 414(13), 3959-3970.

Kim, M., Lee, H., & Cho, S. Attention-Guided Dual-Task Learning for Simultaneous Image Classification and Semantic Segmentation[J]. Computer Vision and Image Understanding, Vol. 214, Article 103041, February 2023.

Misra D, Nalamada T, Arasanipalai A U, et al. Rotate to attend: Convolutional triplet attention module[C]//Proceedings IEEE Winter Conference on Applications of Computer Vision, WACV 2021: 3139-3148.

Dai, W., Tao, J., Yan, X., Feng, Z., & Chen, J. (2023, November). Addressing Unintended Bias in Toxicity Detection: An LSTM and Attention-Based Approach. In 2023 5th International Conference on Artificial Intelligence and Computer Applications (ICAICA) (pp. 375-379). IEEE.

Xin Chen, Yuxiang Hu, Ting Xu, Haowei Yang, Tong Wu. (2024). Advancements in AI for Oncology: Developing an Enhanced YOLOv5-based Cancer Cell Detection System. International Journal of Innovative Research in Computer Science and Technology (IJIRCST), 12(2),75-80, doi:10.55524/ijircst.2024.12.2.13.

Yulu Gong, Haoxin Zhang, Ruilin Xu, Zhou Yu, Jingbo Zhang. (2024). Innovative Deep Learning Methods for Precancerous Lesion Detection. International Journal of Innovative Research in Computer Science and Technology (IJIRCST), 12(2),81-86, doi:10.55524/ijircst.2024.12.2.14.

Yan, C., Qiu, Y., Zhu, Y. (2021). Predict Oil Production with LSTM Neural Network. In: Liu, Q., Liu, X., Li, L., Zhou, H., Zhao, HH. (eds) Proceedings of the 9th International Conference on Computer Engineering and Networks. Advances in Intelligent Systems and Computing, vol 1143. Springer, Singapore. https://doi.org/10.1007/978-981-15-3753-0_34.

Wang, X. S., & Mann, B. P. (2020). Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning. arXiv preprint arXiv:2010.01255.

C. Yan, "Predict Lightning Location and Movement with Atmospherical Electrical Field Instrument," 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Vancouver, BC, Canada, 2019, pp. 0535-0537, doi: 10.1109/IEMCON.2019.8936293.

Yan, X., Wang, W., Xiao, M., Li, Y., & Gao, M. (2024). Survival prediction across diverse cancer types using neural networks. doi:10.48550/ARXIV.2404.08713

Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]. 2015 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston, USA: IEEE Press, 2015: 3431-3440.

Badrinarayanan V, Kendall A, Cipolla R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495.

Ronneberger O, Fischer P, Brox T. U-net: Convolutional Networks for Biomedical Image Segmentation[C]. International Conference on Medical Image Computing and Computer-assisted Intervention, 2015: 234-241.

Chen L C, Papandreou G, Kokkinos I, et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs[J]. arXiv preprint arXiv:1412.7062, 2014.

Lan, G., Han, D. J., Hashemi, A., Aggarwal, V., & Brinton, C. G. (2024). Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis. arXiv preprint arXiv:2404.08003.

Zhao H, Shi J, Qi X, et al. Pyramid Scene Parsing Network[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2017: 2881-2890.

Papadomanolaki M, Vakalopoulou M,Karantzalos K.A Novel Object-Based Deep Learning Framework for Semantic Segmentation of Very High-Resolution Remote Sensing Data: Comparison with Convolutional and Fully Convolutional Networks [J]. Remote Sensing,2019,11(6).

Pereira M B,Santos J A D.An End-to-end Framework For Low-Resolution Remote SensingSemantic Segmentation[J].2020,ar Xiv/abs/2003.07955.

He K M, Zhang X Y, Ren S Q, et al. Deep Residual Learning for Image Recognition[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770- 778.

Li, Y., Yan, X., Xiao, M., Wang, W., & Zhang, F. (2024). Investigation of Creating Accessibility Linked Data Based on Publicly Available Accessibility Datasets. In Proceedings of the 2023 13th International Conference on Communication and Network Security (pp. 77–81). Association for Computing Machinery.

Vicente, S., Carreira, J., Agapito, L., & Batista, J. (2014). Reconstructing pascal voc. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 41-48).

Zhang Z L, Zhang X Y, Peng C, et al. Exfuse: Boosting Semantic Segmentation via Enhanced Feature Integration[C]. Proceedings of European Conference on Computer Vision. Berlin, Germany: Springer-Verlag, 2018: 273-288.

Shah, A., Kadam, E., Shah, H., Shinde, S., & Shingade, S. (2016, September). Deep residual networks with exponential linear unit. In Proceedings of the third international symposium on computer vision and the internet (pp. 59-65).

Ma Z H, Gao H J, Lei T. Algorithm for Semantic Segmentation employing an Augmented Feature Fusion Decoder[J]. Computer Engineering, 2020, 46(5): 254-258.

Downloads

Published

30-04-2024

Issue

Section

Articles

How to Cite

Li, M., Zhu, Z., Xu, R., Feng, Y., & Xiao, L. (2024). Research on Image Classification And Semantic Segmentation Model Based on Convolutional Neural Network. Journal of Computing and Electronic Information Management, 12(3), 94-100. https://doi.org/10.54097/qg7hakzu