Artificial Neural Network Models for Image Recognition
DOI:
https://doi.org/10.54097/hset.v62i.10431Keywords:
Convolutional neural network, deep learning, computer vision.Abstract
Image recognition is an important application of artificial intelligence. Due to the continuous development of chips and algorithms, this field has made great progress in the past decade. At the same time, AI based image recognition has been widely used in the emerging fields like robotics, autonomous vehicles, and surveillance cameras, and their demand for AI has promoted the development of image recognition technology. Resent research has found that the convolutional neural network model is particularly effective for image classification and detection and smaller convolution kernels with deeper network structures are conducive to improving the accuracy. However, problems such as overfitting and activation function gradient descent need to be solved during the operation process. The latest convolutional neural network model ResNet applies residual units to reduce the redundant calculations and improve the efficiency of the model. In general, different variants of convolutional neural network structures have different effects on image recognition, but regional convolutional neural network structures are preferred in engineering applications for its balance between processing speed and recognition accuracy.
Downloads
References
Thomas Hofmann, Bernhard Schölkopf, Alexander J. Smola. Kernel methods in machine learning, 2008: 1171-1220.
Zhichen Wang, Hengyi Li, Xuebin Yue, et al. Briefly Analysis about CNN Accelerator based on FPGA. Procedia Computer Science, 2022, 202: 277-282.
Thomas Hofmann, Bernhard Schölkopf, Alexander J. Smola. Kernel methods in machine learning, 2008: 1171-1220.
Yann LeCun, Léon Bottou, Yoshua Bengio, Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton. ImageNet classification with deep convolutional neural networks. Communications of the ACM, 2017, 60(6): 84-90.
Karen Simonyan, Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv, 2014, 1409.1556.
Christian Szegedy, Wei Liu, Yangqing Jia, et al. Going deeper with convolutions. Proceedings of the IEEE conference on computer vision and pattern recognition, 2015: 1-9.
Min Lin, Qiang Chen, and Shuicheng Yan. Network in network. arXiv preprint arXiv, 2013, 1312.4400.
Sergey Ioffe, Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. International conference on machine learning, 2015: 448-456.
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, et al. Rethinking the inception architecture for computer vision. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 2818-2826.
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, et al. Inception-v4, inception-resnet and the impact of residual connections on learning. Proceedings of the AAAI conference on artificial intelligence, 2017, 31(1).
Kaiming He, Xiangyu Zhang, Shaoqing Ren, et al. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 770-778.
Kaiming He, Xiangyu Zhang, Shaoqing Ren, et al. Identity mappings in deep residual networks. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part IV 14, pp. 630-645. Springer International Publishing, 2016.
Saining Xie, Ross Girshick, Piotr Dollár, et al. Aggregated residual transformations for deep neural networks. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017: 1492-1500.
Ross Girshick, Jeff Donahue, Trevor Darrell, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, 2014: 580-587.
Ross Girshick. Fast R-CNN. Proceedings of the IEEE international conference on computer vision, 2015: 1440-1448.
Shaoqing Ren, Kaiming He, Ross Girshick, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. Advances in neural information processing, 2015, 28.
Joseph Redmon, Santosh Divvala, Ross Girshick, et al. You only look once: Unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 779-788.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







