A Study of The OCR Development History and Directions of Development

Authors

  • Junmiao Wang

DOI:

https://doi.org/10.54097/bm665j77

Keywords:

optical character recognition, deep learning, text extraction.

Abstract

Optical character recognition (OCR) is a well-established technology that enables the conversion of scanned images and documents into editable and searchable electronic text. This technology has numerous applications across a range of industries and has proven to be a crucial tool for digitizing books, documents, and records. One of the main benefits of OCR technology is its ability to automate data entry processes, saving time and reducing errors that can be introduced through manual data entry. Additionally, OCR technology is used to extract text from images, which can be used as input data for machine learning and artificial intelligence applications. To better understand the current state of OCR technology, this systematic literature review is conducted that collected various research articles on the topic. The results of this paper provide insights into the strengths and limitations of OCR technology and also offer directions for future research in the field. In terms of language support, the paper found that OCR technology is capable of supporting a wide range of languages, including English, Spanish, Chinese, and many others. However, the accuracy of OCR technology can vary greatly based on the language, with some languages being more challenging to recognize than others. With continued advancements in technology and the increasing need for digitization, OCR technology will continue to play a crucial role in the development of many industries.

Downloads

Download data is not yet available.

References

T. Sakai, M. Nagao, and Y. Shinmi, “A character recognition system,” in Proc. Annual Conj IECE Japan, 1963, p.,450.

M. H. Glauberman, “Character Recognition for business machines,” Electronics, pp. 132-136, Feb. 1956.

W. J. Hannan, “R. C. A. multifont reading machine,” in Optical Character Recognition. G. L. Ficher et al., Eds. McGregor & Wemer, 1962, ip.3-14.

ERA, “An electronic reading automaton,” Electronic Eng., pp. 189-190, Apr. 1957.

R. W. Weeks, “Rotating raster character recognition system,” AIEE Trans., vol. 80, pt. I, Communications and Electronics, pp. 353-359, Sept. 1961.

T. Sakai, M. Nagao, and Y. Shinmi, “A character recognition system,” in Proc. Annual Conj IECE Japan, 1963, p. ,450.

C. Y. Suen and S. Mori, “Standardization and automatic recognition of hand-printed characters,” in Computer Analysis and Perception, vol. 1, Visual Signals, C. Y. Suen and De Mori, Eds. Boca Raton, FL: CRC Press, 1982, pp.41-53.

A. Krzyzak, W. Dai, and C. Y. Suen, “Unconstrained handwritten classification using modified backpropagation model,” in Proc. Frontiers in Handwritting Recognition (CENPARMI Concordia University), 1990, pp. 155-164.

J. J. Hull et al., “A blackboard-based approach to handwritten Zip Code recognition,” in Proc. US. PostalServiceAdv. Techol. C&f, 1988, pp. 1018-1032.

S. Lavirott and L. Pottier, ‘‘Mathematical formula recognition using graph grammar,’’ Proc. SPIE, vol. 3305, pp. 44–52, Apr. 1998.

H. Zhang, J. Guo, G. Chen, and C. Li, ‘‘HCL2000—A large-scale handwritten Chinese character database for handwritten character recognition,’’ in Proc. 10th Int. Conf. Document Anal. Recognit. (ICDAR), 2009, pp. 286–290.

D. Lin, F. Lin, Y. Lv, F. Cai, and D. Cao, ‘‘Chinese character CAPTCHA recognition and performance estimation via deep neural network,’’ Neurocomputing, vol. 288, pp. 11–19, May 2018.

Chaudhury, Ayan, et al. "A Deep OCR for Degraded Bangla Documents." Transactions on Asian and Low-Resource Language Information Processing 21.5 (2022): 1-20.

C. Moseley, Ed., Atlas of the World’s Languages in Danger. Paris, France: UNESCO Publishing, 2010.

Naseer, Asma, and Kashif Zafar. "Meta features-based scale invariant OCR decision making using LSTM-RNN." Computational and Mathematical Organization Theory 25 (2019): 165-183.

Y. Akbari, M. J. Jalili, J. Sadri, K. Nouri, I. Siddiqi, and C. Djeddi, ‘‘A novel database for automatic processing of persian handwritten bank checks,’’ Pattern Recognit., vol. 74, pp. 253–265, Feb. 2018.

H. S. Baird, “Feature extraction for hybrid structural/statistical pattern classification,” CVGIP, vol. 42, pp. 318-333, 1988.

C. Nadal, R. Legault, and C. Y. Sum, “Complementary algorithms for the recognition of totally unconstrained handwritten numerals,” in Proc. IOth IJCPR, June 1990, pp. 443-449.

Bai, Zhen-Long, and Qiang Huo. "A study on the use of 8-directional features for online handwritten Chinese character recognition." Eighth International Conference on Document Analysis and Recognition (ICDAR'05). IEEE, 2005.

Hwang, Wonseok, et al. "post-OCR parsing: building simple and robust parser via BIO tagging." Workshop on Document Intelligence at NeurIPS 2019. 2019.

Downloads

Published

15-12-2023

How to Cite

Wang, J. (2023). A Study of The OCR Development History and Directions of Development. Highlights in Science, Engineering and Technology, 72, 409-415. https://doi.org/10.54097/bm665j77