Investigations on Convolutional Neural Network Based on Image Recognition
DOI:
https://doi.org/10.54097/g1pbt481Keywords:
LeNet-5, AlexNet, VGG-16, GoogleNet, Classification of Image.Abstract
Many convolutional neural networks have emerged over the years, varying in accuracy, speed, and architecture. From LeNet-5 to GoogleNet, the CNN model developed rapidly, and the architecture of the model became more complex. These models are known for their accuracy on ImageNet. Hence, the theme of this study is to explore and study classical CNN models, because these models are similar in basic structure, but they modify the model in different ways. In addition, this paper selects four classical CNN models for learning and implementation, uses FashionMNIST datasets on four models, and records the performance differences of image classification of specific models, with the intention of studying the comparison of results among different models. The four models are LeNet-5, AlexNet, VGG-16, and GoogleNet. The training was carried out on the same equipment and under the same conditions. The study finds that GoogleNet achieves the best prediction accuracy. LeNet-5 spends the least amount of time on training and forecasting. The GoogleNet and AlexNet models can be considered for practical applications, while the VGG-16 and LeNet-6 are either inefficient due to the long training time of the models, or the accuracy cannot.
Downloads
References
[1] Yann Lecun, Leon Bottou, Yoshua Bengio and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
[2] Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei. ImageNet: A large-scale hierarchical image database. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009: 248-255.
[3] Alex Krizhevsky, Ilya Sutskever and Geoffrey E Hinton. ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Advances in Neural Information Processing Systems, 2012(25).
[4] Karen Simonyan and Andrew Zisserman. Very Deep Convolutional Networks for Large-scale Image Recognition. arXiv preprint arXiv:1409.1556, 2014.
[5] Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke and Andrew Rabinovich. Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015: 1-9.
[6] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
[7] Han Xiao. Fashion-MNIST. Github, https://github.com/zalandoresearch/fashion-mnist, 2017.
[8] Han Xiao, Kashif Rasul and Roland Vollgraf. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv preprint arXiv:1708.07747, 2017.
[9] Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Köpf, Edward Yang, Zach DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai and Soumith Chintala. PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in neural information processing systems, 2019, 32.
[10] Max Ferguson, Ronay Ak, Yung-Tsun Tina Lee and Kincho H. Law. Automatic localization of casting defects with convolutional neural networks. Proceedings of the IEEE International Conference on Big Data (Big Data), 2017: 1726-1735.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







