The Application of Semantic Segmentation on 2D images
DOI:
https://doi.org/10.54097/hset.v31i.4818Keywords:
Semantic segmentation; Deep learning; Conditional Random Field; 2D images.Abstract
A fundamental problem in computer vision is semantic segmentation, which calls for the algorithm to categorize each pixel in the picture and provide the precise details of the category. Semantic segmentation is being employed extensively in a variety of applications, including autonomous vehicles and medical imaging. An overview of similar semantic segmentation approaches is given in this study. First, this paper gives a brief overview of the history and vocabulary of semantic segmentation. The key datasets for semantic segmentation, conventional segmentation models, and fundamental deep learning techniques for semantic segmentation will then be covered. In particular, traditional methods centered on context models and deep learning-centered methods are discussed in detail. Finally, we review several assessment techniques, including their benefits and drawbacks, and outline the key issues facing semantic segmentation today. In addition, this study seeks to provide an overview of the relevant literature and the difficulties in semantic segmentation. Finally, the paper summarizes the semantic segmentation and prospects the future.
Downloads
References
Carreira, Joao, and Cristian Sminchisescu. CPMC: Automatic object segmentation using constrained parametric min-cuts. IEEE transactions on pattern analysis and machine intelligence 2011, 34.7: 1312-1328.
Dalal, Navneet, and Bill Triggs. Histograms of oriented gradients for human detection. 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05). 2005 1: 129-137.
Everingham, Mark, et al. The pascal visual object classes challenge: A retrospective." International journal of computer vision 2015,111.1: 98-136.
Ros, German, et al. The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016 261:731-740.
Cordts, Marius, et al. The cityscapes dataset. CVPR Workshop on the Future of Datasets in Vision. 2015, 2:121-129.
Bell, Sean, et al. Material recognition in the wild with the materials in context database. Proceedings of the IEEE conference on computer vision and pattern recognition. 2015, 33:249-252.
Hariharan, Bharath, et al. Semantic contours from inverse detectors. 2011 international conference on computer vision. 2011, 23:762-771.
Ros, German, et al. Vision-based offline-online perception paradigm for autonomous driving. 2015 IEEE Winter Conference on Applications of Computer Vision. IEEE, 2015,13:53-61.
Zhang, Kunlei, Jun Deng, and Wenmiao Lu. Segmenting human knee cartilage automatically from multi-contrast MR images using support vector machines and discriminative random fields. 2011 18th IEEE International Conference on Image Processing. 2011, 392:14-23.
Kirkpatrick, Scott, C. Daniel Gelatt Jr, and Mario P. Vecchi. Optimization by simulated annealing. science 1983, 220.4598: 671-680.
Comaniciu, Dorin, and Peter Meer. Mean shift: A robust approach toward feature space analysis. IEEE Transactions on pattern analysis and machine intelligence 2002, 24.5: 603-619.
Levinshtein, Alex, et al. Turbopixels: Fast superpixels using geometric flows. IEEE transactions on pattern analysis and machine intelligence 2009, 31.12: 2290-2297.
Achanta, Radhakrishna, et al. "SLIC superpixels compared to state-of-the-art superpixel methods." IEEE transactions on pattern analysis and machine intelligence 2012: 2274-2282.
Noh, Hyeonwoo, Seunghoon Hong, and Bohyung Han. Learning deconvolution network for semantic segmentation. Proceedings of the IEEE international conference on computer vision. 2015, 26.5: 512-519.
Pinheiro, Pedro, and Ronan Collobert. Recurrent convolutional neural networks for scene labeling. International conference on machine learning. 2014, 244:529-535.
Luo, Yawei, et al. Macro-micro adversarial network for human parsing. Proceedings of the European conference on computer vision. 2018,121:1-11.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







