Real-time Classifier of Multilingual Font Styles based on ResNet, SwordNet, Logistic Regression and Random Forest Algorithms

Authors

  • Yue Wu

DOI:

https://doi.org/10.54097/fcis.v4i3.10735

Keywords:

Font Style Recognition, Real-time Font Identification, Image Text Recognition, Language Identification, ResNet; Random Forest, Logistic Regression, Pytesseract

Abstract

Different languages have different characters. At the same time, each character has a lot of font styles. This makes it difficult for humans to recognize different font styles for different characters. However, being able to detect and identify these font styles quickly and accurately has many important application use cases in different fields. At the same time, a large number of Internet users use web pages to query font styles. Therefore, I choose to make this real-time multilingual font style recognition algorithm. In this paper, I propose an algorithm that recognizes the input text and pictures in real time to judge the language and style of the text. It includes ResNet, SwordNet, logistic regression and random forest algorithms. The whole algorithm also calls pytesseract and Google Tesseract to realize text recognition and text positioning. I used Font Datasets used in "Font and Calligraphy Style Recognition Using Complex Wavelet Transform" for training. At the same time, I also built an image text recognition algorithm and generated various font styles as a data source. Based on this data, we adjusted the parameters and finally achieved an accuracy rate higher than 90%.

Downloads

Download data is not yet available.

References

A. Bozkurt, (2013, Nov. 19). Font Datasets used in "Font and Calligraphy Style Recognition Using Complex Wavelet Transform" [Online]. Available: https:// github. com/ alicanb/ fonts.

ACRON (2018) Australian Cybercrime Online Reporting Network (ACORN) [Online]. Available: http://www. acorn. gov. au/learn-about-cybercrime.

A. Biswal, (2021, Sep. 14). Bagging in Machine Learning: Step to Perform And Its Advantages [Online]. Available: https:// www.simplilearn.com/tutorials/machine-learning-tutorial/ bagging-in-machine-learning.

A. Ze, (2021, Jan. 20). [Machine Learning] Logistic Regression (very detailed) [Online]. Available: https://zhuanlan. zhihu. com/ p/74874291.

S.H. Tsang, (2018, Sep. 16). Review: ResNet --Winner of ILSVRC 2015 (Image Classification, Localization, Detection) [Online]. Available: https://towardsdatascience.com/review-resnet-winner-of-ilsvrc-2015-image-classification-localization -detection-e39402bfa5d8.

X. Li, et.al, “SwordNet: Chinese Character Font Style Recognition Network” Tianjin, 2016, pp.1-11.

Wangcy, (2016, Dec. 14). Maxpool Layer [Online]. Available: https: // www.jianshu.com/p/6928203bf75b.

L. Han, et.al, “A New Method of Mixed Gas Identification Based on a Convolutional Neural Network for Time Series Classification” Beijing, 2019, pp.9-9.

yalesaleng, (2018, Jul. 13). Global average pooling (GAP) [Online]. Available: https://www.jianshu.com/p/04f7771f4da2.

Downloads

Published

31-07-2023

Issue

Section

Articles

How to Cite

Wu, Y. (2023). Real-time Classifier of Multilingual Font Styles based on ResNet, SwordNet, Logistic Regression and Random Forest Algorithms. Frontiers in Computing and Intelligent Systems, 4(3), 7-16. https://doi.org/10.54097/fcis.v4i3.10735