Facial Expression Recognition Based on Pruning Optimization Technology
DOI:
https://doi.org/10.54097/hset.v41i.6784Keywords:
Network Sliming; image recognition; VGGNet; ResNet; SENet; CBAM.Abstract
Facial expression recognition is based on deep learning, which has become one of the main topics in artificial intelligence. Nevertheless, the complex network structure causes numerous parameter redundancy, which leads to low recognition efficiency. Therefore, researching how to reduce parameter redundancy makes sense. In addition, today's machine expression recognition has some limitations and constraints, while the generalization ability is poor. Aiming at addressing the problems that excessive parameter redundancy brings, for example, low efficiency and massive limitations. This paper will put forward a facial expression recognition technology that is based on pruning optimization to improve existing models. Based on VGGNet, ResNet and SENet, these three convolutional neural networks modify the convolution layer of the network, then add the attention module and pruning optimization. Through face detection, face area feature representation, expression recognition, and other steps. Realize the recognition of multiple facial expressions. The CK+ dataset and JAFFE dataset are independently divided into 4/5 for training and 1/5 for testing. Experimental results show that after inserting CBAM, except ResNet18, the accuracy of the other two networks all improved, SENet18 improved the most significantly. After pruning, the accuracy and speed of VGG16 and ResNet18 improved, but the accuracy of SENet18 decreased. In summary, the optimized model has good applicability and generalization ability.
Downloads
References
P. Lucey, J. F. Cohn, T. Kanade, J. Saragih, Z. Ambadar, and I. Matthews, “The Extended Cohn-Kanade Dataset (CK+)_ A complete dataset for action unit and emotion- specified expression,” in Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, 2010, pp. 94-101.
LYONS M J, AKAMATSU s, KAMACHI M G, et al. Coding facial expressions with Gabor wavelets [ C]//Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998:200-205.
Ian Goodfellow, Yoshua Bengio, Aaron Courville Deep Learning [M] Boston: MIT Press http://www.deeplearningbook.org 2016.
Krizhevsky A, Sutskever I, Hinton G. lmageNet classification with deep convolutional neural networks [C]//Proceedings of the 25th International Conference on Neural Information Processing Systems, 2012:1097 - 1105.
A. Zisserman, 2014 “Very Deep Convolutional Networks for Large-Scale Image Recognition” [Submitted on 4 Sep 2014 (v1), last revised 10 Apr 2015 (this version, v6)]
Szegedy C, Liu W, Jia Y Q, et al. Going deeper with convolutions [C]//Proceedings of IEEE Conference on ComputerVision and Pattern Recognition. Piscataway, NJ: IEEE Press, 2015: 1-9.
He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[c]//Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition,2016:770-778.
LIN Jing-Dong, WU Xin-Yi, CHAI Yi, et al. Structure Optimization of Convolutional Neural Networks: A Survey. ACTA AUTOMATICA SINICA, 2020, 46(1): 24-37. doi: 10.16383/j.aas.c180275.
LI Jiang-Yun, ZHAO Yi-Kai, XUE Zhuo-Er, et al.. A survey of model compression for deep neural networks [J]. Chinese Journal of Engineering, 2019, 41(10): 1229-1239.doi: 10.13374/j.issn2095-9389.2019.03.27.002.
WANG Hong-xia, ZHOU Jia-qi, GU Cheng-hao, LIN Hong, Design of activation function in CNN for image classification [J] Journal of Zhejiang University (Engineering Science), 2019, 53(7): 1363-1373.
ZHOU lie, MA Ming-dong (School of Telecommunications & Information Engineering. Nanjing University of Posts and Telecommunications.Nanjing210003.ChinaSchool of Geographical and Biological Information, Nanjing University of Posts and Telecommunications. Nanjing 210003. China)
Hu J, Shen L, SunG. Squeeze-and-excitation networks [C]. IEEE/CVF Conference on ComputerVision and Pattern Recognition. Salt Lake City, 2018:7132-7141.
Zhuang. L, Jian. G. L, Zhi. Q. S, Gao. H, Shou. M. Y, Chang. S. Z, 2017, “Learning Efficient Networks through Network Slimming” IEEE 22-29 DOI: 10.1109/ICCV.2017.298
WOO, PARK J, LEE J Y, et al. CBAM: convolutional block attention module [C]//European Conference on Computer Vision, 2018: 3-19.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







