Expression Recognition Based on Multi-level Multi-model Fusion Deep Convolutional Neural Network
DOI:
https://doi.org/10.54097/hset.v34i.5477Keywords:
Image expression recognition, Convolutional neural network, Multi-level, Multiple model.Abstract
In the research of Facial Expression Recognition (FER), training networks using deep learning methods often relies on data sets with accurate label and balanced number of each classification. However, when training on low-quality data sets, the performance of traditional models is insufficient. In order to solve the problem, the paper proposes an expression recognition method based on Multi-level Multi-model Fusion Deep Convolutional Neural Network. The Deep Convolutional Neural Network is composed of three models, ResNet-50, ResNet-101 and InceptionNet-V3, which have similar performance on ImageNet. We use the weighted average fusion algorithm as our fusion method. According to the accuracy of the three models, weighted fusion is carried out, and the model with high accuracy is given higher weight. It increases the width of the network and the number of parameters. At the same time, it can solve the problem of accuracy degradation when the number of layers in the deep network increases. We choose FER2013, a widely recognized low-quality data set, for the experiment. The experimental results based on FER2013 data set show that the facial expression recognition accuracy of this method reach 71.78%, which is significantly better than ResNet-50, ResNet-101 and InceptionNet-V3. Compared with other models, it also verifies the effectiveness of the proposed model in identifying low-quality data sets.
Downloads
References
Ekman P, Friesen W V. Facial action coding system: a technique for the measurement of facial movement [J]. Rivista Di Psichiatria, 1978, 47(2):126-38.
Ekman P et al., Facial Action Coding System, Consulting Psychologists Press,1978
Kaiming H, et al. Deep Residual Learning for Image Recognition [R], arXiv: Computer Vision and Pattern Recognition. 2015
Christian S et al. Going Deeper with Convolutions [R], arXiv: Computer Vision and Pattern Recognition. 2014
Christian S et al. Rethinking the Inception Architecture for Computer Vision [R]. CoRR, 2015, ABS /1512.00567
Wang W et al. Image sentiment classification based on multi-level deep convolutional neural network [J]. Journal of south China university of technology (natural science edition), 2019,47(06):39-50.
Huan R and Pan Y. Decision fusion strategies for SAR image target recognition [J]. IET radar, sonar & navigation, 2011, 5(7): 747-755.
Christian S et al. Rethinking the Inception Architecture for Computer Vision [C], computer vision and pattern recognition. 2016
Gan Y, Facial Expression Recognition Using Convolutional Neural Network[C]// the 2nd International Conference. 2018.
Tan C, Facial expression recognition based on improved alexnet convolutional neural network (in Chinese) [J], Telecommunication technology, 2020, 60 (9): 8
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







