Researches advanced in Natural Scenes Text Detection Based on Deep Learning

Authors

  • Qingyang Zhao

DOI:

https://doi.org/10.54097/hset.v16i.2500

Keywords:

text detection; Natural Scenes; Deep learning.

Abstract

The research on text detection and recognition in natural scenes is of great significance for obtaining information from scenes. Thanks to the rapid development of convolutional neural networks and the continuous proposal of scene text detection methods based on deep learning, breakthroughs have been made in the recognition accuracy and speed of scene texts. This paper mainly sorts, analyzes and summarizes the scene text detection method based on deep learning and its development. Firstly, the related research background and significance of scene text detection are discussed. Then, the second part is the elaboration of some main technical research routes of scene text detection. According to the timeline of the detection methods, the specific contents of various text detection models are further introduced. Thirdly, this paper compares and analyzes the experimental results of different models. Furthermore, improvements of some models with relationship, effects, advantages and disadvantages and expectations are further introduced. Finally, the challenges and development trends of scene text detection technology based on deep learning are summarized.

Downloads

Download data is not yet available.

References

Zhi Tian, Weilin Huang, Tong He, Pan He, Yu Qiao. 2016. Detecting Text in Natural Image with Connectionist Text Proposal Network. ECCV, 2016. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1609.03605

Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang. 2017. EAST: An Efficient and Accurate Scene Text Detector. CVPR 2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1704.03155

Baoguang Shi, Xiang Bai, Serge Belongie. 2017. Detecting Oriented Text in Natural Images by Linking Segments. CVPR 2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1703.06520

Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang, Wenyu Liu. 2017. TextBoxes: A Fast Text Detector with a Single Deep Neural Network. AAAI2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1611.06779

Minghui Liao, Baoguang Shi, Xiang Bai. 2018. TextBoxes++: A Single-Shot Oriented Scene Text Detector. IEEE Transactions on Image Processing 27 (2018). Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1801.02765

Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao. 2019. Shape Robust Text Detection with Progressive Scale Expansion Network. CVPR 2019. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1903.12473

Chuhui Xue, Shijian Lu, Wei Zhang. 2019. MSR: Multi-Scale Shape Regression for Scene Text Detection. IJCAI19. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1901.02596

Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen. 2019. Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network. ICCV 2019. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1908.05900v1

Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang. 2020. ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network. CVPR 2020. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2002.10200v2

Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang. 2021. Fourier Contour Embedding for Arbitrary-Shaped Text Detection. CVPR 2021. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2104.10442

Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin. 2022. SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition. CVPR 2022. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2203.10209

Downloads

Published

10-11-2022

How to Cite

Zhao, Q. (2022). Researches advanced in Natural Scenes Text Detection Based on Deep Learning. Highlights in Science, Engineering and Technology, 16, 188-197. https://doi.org/10.54097/hset.v16i.2500