Sichuan Cuisine Recognition Method based on Residual Neural Network

Authors

  • Weifu Li
  • Yichong Cai
  • Jinyu Huang

DOI:

https://doi.org/10.54097/fcis.v5i2.13148

Keywords:

Residual Neural Network, Attention Mechanism, Image Classification, Sichuan Cuisine Recognition

Abstract

To address issues such as the high number of parameters, significant variations among images of similar dishes, weak geometric invariance, and low recognition rates in Sichuan cuisine recognition methods, a lightweight Sichuan cuisine recognition model, RGBNet, based on residual neural network, is proposed. The model employs dilated convolutions to increase the receptive field of convolutional kernels while maintaining a consistent parameter count, thus obtaining more shallow-level features. An RGB module is constructed using asymmetric convolutions to enhance the model's geometric invariance, feature non-linear expression, and feature extraction capabilities. Finally, the DFC long-range attention mechanism is introduced to effectively capture long-range information, thereby improving adaptive learning capabilities. To validate the model's performance, the classic ChineseFoodNet benchmark dataset is utilized. A MiniChineseFood dataset is created by extracting 30 classes totaling 20,000 images for experimentation. The recognition accuracy is measured using the top1 method of image recognition performance, achieving a final image recognition accuracy of 96.62%. Compared to models such as EfficientNet, ShuffNet, FasterNet, and MobileNetV2, RGBNet demonstrates respective accuracy improvements of 16.57%, 18.52%, 17.12%, and 16.35%. This presents a novel approach for industrial food recognition.

Downloads

Download data is not yet available.

References

Min Weiqing, Liu Linhu, Liu Yuxin, et al. A Survey of Food Image Recognition Methods[J]. Journal of Computer Science. Vol.45 (2008) No. 03, p.542-566.

Wang Hang. Application of RFID Technology in University Cafeteria Practice[J]. China's Strategic Emerging Industries Vol. 172 (2008) No. 40, p. 256.

Chen Mei, Kapil Dhingra, Wu Wen, Yang Lei, et al. PFID: Pittsburgh fast-food image dataset[C]// Proceedings of the International Conference on Image Processing. Cairo, Egypt, 2009: 289-292.

Martinel N, Foresti G L, Micheloni C. Wide-slice residual networks for food recognition[C]// Proceedings of the International Conference on Applications of Computer Vision. Lake Tahoe, USA,2018:567-576.

Austin Meyers, Nick Johnston, Vivek Rathod, et al. Im2Calories: towards an automated mobile vision food diary [C]// Proceedings of the IEEE International Conference on Computer Vision. Santiago, Chile,2015:1233–124.

Deng L, Chen J., Sun Q, He, X. et al. Mixed-dish recognition with contextual relation networks[C]//Proceedings of the 27th ACM International Conference on Multimedia,2019:112-120.

Xiao G, Wu Q, Chen H, et al. A deep transfer learning solution for automating food material procurement using electronic scales[J]. IEEE Transactions on Industrial Informatics, 2019, 16 (4):2290-2300.

Lo F P W, Sun Y, Qiu J, et al. Image-based food classification and volume estimation for dietary assessment: A review[J]. IEEE Journal of Biomedical and Health Informatics, 2020, 24(7):1926-1939. DOI:10.1109/JBHI.2020.2987943.

Wang Haiyan, Zhang Miao, Liu Hulin, et al. Chinese Cuisine Image Recognition Method Based on Improved ResNet Network[J]. Journal of Shanxi University of Science and Technology. Vol. 40 (2022) No. 01, p. 154-160.

Deng Zhiliang, Li Lei. Chinese Dish Recognition Model Based on Improved Residual Networks[J]. Progress in Laser and Optoelectronics. Vol. 58 (2021) No. 06, p. 264-272.

Wu, Zhengdong. Research on Image Classification Algorithm of Chinese Cuisine (Master, University of Electronic Science and Technology of China, Chengdu, China, 2020). p. 25-33.

Liao Enhong, Li Huifang, Wang Hua, et al. Food Image Recognition Based on Convolutional Neural Networks. Journal of South China Normal University (Natural Science Edition). Vol. 51 (2019) No. 4, p. 113-119.

Tan M, Le Q. Efficientnet: Rethinking model scaling for convolutional neural networks[C]//International conference on machine learning. PMLR ,2019:6105-6114.

Zhang X, Zhou X, Lin M, et al. Shufflenet: An extremely efficient convolutional neural network for mobile devices [C]// Proceedings of the IEEE conference on computer vision and pattern recognition.2018:6848-6856.

Chen J, Kao S, He H, et al. Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks[J]. arXiv preprint arXiv: 2303. 03667, 2023.

Sandler M, Howard A, Zhu M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 4510-4520.

Gai Rongli, Cai Jianrong, Wang Shiyu, et al. A Comprehensive Review on the Application of Convolutional Neural Networks in Image Recognition[J]. Small & Miniaturized Computer Systems. Vol. 42 (2021) No. 9, p. 1980-1984.

Lin Jingdong, Wu Xinyi, Chai Yi, et al. Overview of Optimizing Convolutional Neural Network Structures. Acta Automatica Sinica. 2020, 46(1): 24-37.

Wu Haohao, Wang Fangshi. Application of Multi-Scale Dilated Convolution in Image Classification. Computer Science. 2020, 47(6A): 166-171.

Li Y, Zhang X, Chen D. Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes[C]// Proceedings of the IEEE conference on computer vision and pattern recognition.2018:1091-1100.

Ding X, Guo Y, Ding G, et al. Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks [C]// Proceedings of the IEEE/CVF international conference on computer vision.2019:1911-1920.

Tang Y, Han K, Guo J, et al. GhostNetV2: Enhance Cheap Operation with Long-Range Attention[J].arXiv preprint arXiv: 2211. 12905, 2022.

Han K, Wang Y, Tian Q, et al. Ghostnet: More features from cheap operations [C]// Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 1580-1589.

Chen X, Zhu Y, Zhou H, et al. Chinesefoodnet: A large-scale image dataset for Chinese food recognition[J].arXiv preprint arXiv: 1705.02743, 2017.

Cubuk E D, Zoph B, Shlens J, et al. Randaugment: Practical automated data augmentation with a reduced search space[C]// Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops.2020:702-703.

Zhong Z, Zheng L, Kang G, et al. Random erasing data augmentation[C]//Proceedings of the AAAI conference on artificial intelligence.2020,34(07):13001-13008.

Downloads

Published

01-09-2023

Issue

Section

Articles

How to Cite

Li, W., Cai, Y., & Huang, J. (2023). Sichuan Cuisine Recognition Method based on Residual Neural Network. Frontiers in Computing and Intelligent Systems, 5(2), 148-153. https://doi.org/10.54097/fcis.v5i2.13148