Researches advanced in Natural Scenes Text Detection Based on Deep Learning
DOI:
https://doi.org/10.54097/hset.v16i.2500Keywords:
text detection; Natural Scenes; Deep learning.Abstract
The research on text detection and recognition in natural scenes is of great significance for obtaining information from scenes. Thanks to the rapid development of convolutional neural networks and the continuous proposal of scene text detection methods based on deep learning, breakthroughs have been made in the recognition accuracy and speed of scene texts. This paper mainly sorts, analyzes and summarizes the scene text detection method based on deep learning and its development. Firstly, the related research background and significance of scene text detection are discussed. Then, the second part is the elaboration of some main technical research routes of scene text detection. According to the timeline of the detection methods, the specific contents of various text detection models are further introduced. Thirdly, this paper compares and analyzes the experimental results of different models. Furthermore, improvements of some models with relationship, effects, advantages and disadvantages and expectations are further introduced. Finally, the challenges and development trends of scene text detection technology based on deep learning are summarized.
Downloads
References
Zhi Tian, Weilin Huang, Tong He, Pan He, Yu Qiao. 2016. Detecting Text in Natural Image with Connectionist Text Proposal Network. ECCV, 2016. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1609.03605
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang. 2017. EAST: An Efficient and Accurate Scene Text Detector. CVPR 2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1704.03155
Baoguang Shi, Xiang Bai, Serge Belongie. 2017. Detecting Oriented Text in Natural Images by Linking Segments. CVPR 2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1703.06520
Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang, Wenyu Liu. 2017. TextBoxes: A Fast Text Detector with a Single Deep Neural Network. AAAI2017. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1611.06779
Minghui Liao, Baoguang Shi, Xiang Bai. 2018. TextBoxes++: A Single-Shot Oriented Scene Text Detector. IEEE Transactions on Image Processing 27 (2018). Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1801.02765
Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao. 2019. Shape Robust Text Detection with Progressive Scale Expansion Network. CVPR 2019. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1903.12473
Chuhui Xue, Shijian Lu, Wei Zhang. 2019. MSR: Multi-Scale Shape Regression for Scene Text Detection. IJCAI19. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1901.02596
Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen. 2019. Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network. ICCV 2019. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/1908.05900v1
Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang. 2020. ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network. CVPR 2020. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2002.10200v2
Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang. 2021. Fourier Contour Embedding for Arbitrary-Shaped Text Detection. CVPR 2021. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2104.10442
Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin. 2022. SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition. CVPR 2022. Computer Vision and Pattern Recognition (cs.CV). Address of the paper: https://arxiv.org/abs/2203.10209
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







