Comparison of different Convolutional Neural Network models on Fruit 360 Dataset
DOI:
https://doi.org/10.54097/hset.v34i.5385Keywords:
Convolutional Neural Network, VGG-16, ResNet-50, MobileNet, Image Classification.Abstract
Numerous Convolutional Neural Networks emerged in the past decade, each varies in accuracy, speed, and architecture. From AlexNet to ResNet, CNN models have been developing rapidly, and the architecture of the models become more complicated. These models are known for their accuracy on ImageNet, so the topic of this research is to explore how CNN models can perform differently on the Fruit 360 dataset. A model constructed specifically in this research and three significant models developed in the past decade are applied to the Fruit 360 dataset for result comparison: VGG-16, ResNet-50, MobileNet, and SC-3. The models are selected to be different in architecture for comparison. The models are modified in similar ways and are trained on the same device under the same circumstances. The research found that ResNet-50 obtains the best prediction accuracy. MobileNet takes the least time to train and predict. The ResNet-50 and MobileNet models can be considered for real-world application, whereas VGG-16 and SC-3 are either inefficient or fails to generalize to complicated situations.
Downloads
References
Yann Lecun, Leon Bottou, Yoshua Bengio and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998, doi: 10.1109/5.726791.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei. ImageNet: A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248-255, doi: 10.1109/CVPR.2009.5206848.
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), vol. 115, no. 3, pp. 211–252, 2015, doi: 10.1007/s11263-015-0816-y.
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, 2012, vol. 25. [Online].
Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv, 2014. doi: 10.48550/ARXIV.1409.1556.
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1-9, doi: 10.1109/CVPR.2015.7298594.
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.
Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv, 2017. doi: 10.48550/ARXIV.1704.04861.
Kaggle. 2017. Fruit 360. https://www.kaggle.com/datasets/moltean/fruits.
Horea Muresan, Mihai Oltean. Fruit recognition from images using deep learning, Acta Univ. Sapientiae, Informatica Vol. 10, Issue 1, pp. 26-42, 2018.
Diederik P. Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization. arXiv, 2014. doi: 10.48550/ARXIV.1412.6980.
F. Chollet and others, Keras. 2015. Keras. https://keras.io.
Max Ferguson, Ronay Ak, Yung-Tsun Tina Lee, and Kincho H. Law. Automatic localization of casting defects with convolutional neural networks. 2017 IEEE International Conference on Big Data (Big Data), 2017, pp. 1726–1735. doi: 10.1109/BigData.2017.8258115.
Laurent SIfre and Stéphane Mallat. Rigid-Motion Scattering for Texture Classification. arXiv, 2014. doi: 10.48550/ARXIV.1403.1687.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







