Location of Scene Text Based on Yolov7

Authors

  • Yulong Chang
  • Youchan Zhu
  • Kaili Cui
  • Fujun Guan
  • Zheng Li

DOI:

https://doi.org/10.54097/fcis.v2i3.5509

Keywords:

Text detection, YOLOv7, Deep learning

Abstract

Text is everywhere in daily life, and it carries rich and accurate information. Natural scene text detection technology can be widely used in various fields of life. Chinese is an important tool to carry culture. Therefore, it is of great significance to study natural scene Chinese text detection. For text location in Chinese scene, we use target detection method to train based on YOLOv7 model, and obtain effective detection results.

Downloads

Download data is not yet available.

References

Judd T. Learning to predict where humans look IEEE international conference on computer vision[J]. Proc. ICCV, 2009, 2009.

Breuel T M, Yanikoglu B A, Berkner K. The OCRopus open source OCR system (Proceedings Paper) [J]. IEEE, 2002.

Dalal N, Triggs B. Histograms of Oriented Gradients for Human Detection[C]. IEEE Computer Society Conference on Computer Vision & Pattern Recognition, 2005.

Low D G. Distinctive Image Features from Scale-Invariant Keypoints[J]. International Journal of Computer Vision, 2004.

Ojala T, Pietikainen M, Maenpaa T. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns[C]. IEEE, 2002: 971-987.

Ren S, He K, Girshick R, et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137-1149.

Liu W, Berg A C, Fu C Y, et al. SSD: Single Shot MultiBox Detector[C]. ECCV, 2016: 21-37.

Liao M, Shi B, Bai X, et al. TextBoxes: a fast text detector with a single deep neural network[C]. National Conference on Artificial Intelligence, 2017.

Shi B, Bai X, Belongie S. Detecting oriented text in natural images by linking segments[C]. Proceedings of the IEEE conference on computer vision and pattern recognition, 2017: 2550-2558.

Ma J, Shao W, Ye H, et al. Arbitrary-Oriented Scene Text Detection via Rotation Proposals[J]. IEEE Transactions on Multimedia, 2017: 3111-3122.

Wang C-Y, Bochkovskiy A, Liao H-Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J]. arXiv preprint arXiv:2207.02696, 2022.

Ding X, Zhang X, Ma N, et al. RepVGG: Making VGG-style ConvNets Great Again[C]. IEEE, 2021: 13728-13737.

Downloads

Published

27-02-2023

Issue

Section

Articles

How to Cite

Chang, Y., Zhu, Y., Cui, K., Guan, F., & Li, Z. (2023). Location of Scene Text Based on Yolov7. Frontiers in Computing and Intelligent Systems, 2(3), 119-122. https://doi.org/10.54097/fcis.v2i3.5509