Crowd Counting Based on Context-Aware and Multi Scale Feature Fusion
DOI:
https://doi.org/10.54097/fcis.v2i2.3736Keywords:
Crowd Counting, Deep Learning, Computer Vision, Attention Mechanism, Feature FusionAbstract
Crowd counting plays an important role in public security. Estimating the number of people in an image with congested crowd accurately is a challenging task. The crowd counting method based on fully convolutional network can perform well in crowd image with complex scene. In this paper, to address the counting problems of occlusion,background clutter and perspective effect, we proposed a simple but effective method called Context-aware Multi scale Fusion Network(CMF Net).The CMF Net applied VGG network as backbone to extract coarse features. Then, three context-aware multi-scale fusion modules (CMFM) are adopted. Each CMFM consist of multi-scale feature extraction module (MEM) and context-aware feature extraction module (CEM). In addition, we propose adaptively dense connection to promoted information transmission in the counting network. Experiments on four datasets demonstrate that our network achieves competitive and effective results.
Downloads
References
Dollar P, Wojek C, Schiele B and Perona P 2012 Pedestrian detection: An evaluation of the state of the art IEEE transactions on pattern analysis and machine intelligence 34(4) pp 743–761.
Zhang Y, Zhou D, Chen S, Gao S and Ma Y 2016 Single-image crowd counting via multi-column convolutional neural network. Proceedings of the IEEE conference on computer vision and pattern recognition pp 589–597.
Sindagi A and Patel M 2017 Generating high-quality crowd density maps using contextual pyramid cnns IEEE International Conference on Computer Vision (ICCV) pp 1879–1888.
[4] Li Y, Zhang X and Chen D 2018 Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition pp 1091-1100.
Davide M, Bing S, Rahul R V, Joseph T 2021 Understanding the Impact of Mistakes on Background Regions in Crowd Counting Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2021, pp. 1650-1659.
Karen S and Andrew Z 2014 Very Deep Convolutional Networks for Large-Scale Image Recognition Computer Vision and Pattern Recognition (cs.CV) arXiv:1409.1556.
Weizhe L, Mathieu S and Pascal F 2019 Context-Aware Crowd Counting Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp 5099-5108.
Liang-Chieh C, George P, Iasonas K, Kevin M and Alan L Y 2017 DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(4): 834-848.
Sanghyun W, Jongchan P, Joon-Young L and In S K 2018 CBAM: Convolutional Block Attention Module Proceedings of the European Conference on Computer Vision (ECCV), pp. 3-19.
Gao H, Zhuang L, Laurens van der M and Kilian Q. W 2017 Densely Connected Convolutional Networks Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 4700-4708.


