Implementation and Evaluation of a Simple Convolutional Neural Network for Object Classification in Visual Assistance Systems

Raymond Pu

doi:10.54097/2tw8na43

Authors

Raymond Pu

DOI:

https://doi.org/10.54097/2tw8na43

Keywords:

Machine learning; Image classification; Visual assistance systems; Convolutional neural network.

Abstract

Image object recognition and classification has become more widespread in use and capable in functionality due to advancement in machine learning. One application of it is visual assistance systems for visually impaired persons, and there has been many existing proposed implementations and solutions about such systems. This article implements a simple convolutional neural network (CNN) machine learning model based on a simpler version of the architectures found in existing object detection models, which is trained using select data categories of the ImageNet dataset. The model is evaluated using sparse categorical accuracies, measuring the proportion of correct classifications, and comparing the number of predicted and expected classifications for each data category. The accuracy of the model did not perform well as expected because of significant overfitting behavior noticed through the validation loss. Even if early stopping to reduce overfitting is implemented, the overall accuracies still cannot be fully remediated. However, the difference between the number of expected and actual predictions is comparable.

Downloads

Download data is not yet available.

References

[1] Radovic M, Adarkwa O, Wang Q. Object recognition in aerial images using convolutional neural networks. Journal of Imaging, 2017, 3(2): 21.

[2] Yadav S, Joshi R C, Dutta M K, et al. Fusion of object recognition and obstacle detection approach for assisting visually challenged person//2020 43rd International Conference on Telecommunications and Signal Processing (TSP). IEEE, 2020: 537-540.

[3] Salavati P, Mohammadi H M. Obstacle detection using GoogleNet//2018 8th international conference on computer and knowledge engineering (ICCKE). IEEE, 2018: 326-332.

[4] Trabelsi R, Jabri I, Melgani F, et al. Indoor object recognition in RGBD images with complex-valued neural networks for visually-impaired people. Neurocomputing, 2019, 330: 94-103.

[5] Redmon J, Farhadi A. YOLO9000: better, faster, stronger//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.

[6] Redmon J. Yolov3: An incremental improvement. arxiv preprint arxiv:1804.02767, 2018.

[7] Sudharshan D P, Raj S. Object recognition in images using convolutional neural network//2018 2nd International Conference on Inventive Systems and Control (ICISC). IEEE, 2018: 718-722.

[8] Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database//2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009: 248-255.

[9] Chrabaszcz P, Loshchilov I, Hutter F. A downsampled variant of imagenet as an alternative to the cifar datasets. arxiv preprint arxiv:1707.08819, 2017.

[10] “Cross-Entropy Cost Functions Used in Classification.” GeeksforGeeks, 11 Oct. 2021, www.geeksforgeeks.org/cross-entropy-cost-functions-used-in-classification/.