Review of Small Target Detection based on Deep Learning

Authors

  • Heng Zhang
  • Wei Fu
  • Ke Wu

DOI:

https://doi.org/10.54097/fcis.v4i2.9900

Keywords:

Convolutional Neural Network, Residual Network, Image Classification

Abstract

With a large number of applications of target detection in daily life, the performance requirements of target detection are constantly improving. Many challenges about target detection have been put forward constantly, such as imbalanced samples, fewer pixels, and occlusion of the detected target, all of which bring difficulties for the model to correctly identify the target. Small target detection has always been a difficult point and research hotspot. In recent years, many algorithms for small target detection have been proposed, such as data enhancement, feature fusion, attention mechanism, and super-resolution network structure. According to the characteristics of different network structures, the training strategy can be appropriately adjusted and then applied to different environments, which can greatly improve the detection accuracy of small targets. This paper will introduce the data sets and related small target detection algorithms proposed in recent years, and classify, analyze, and compare the corresponding training strategies.

Downloads

Download data is not yet available.

References

CHEN C, LIU M Y, TUZEL O, et al. R CNN for small object detection[C]//Proceeding of Asian Conference on Computer Vision. Cham: Springer, 2016: 214 230.

LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: Common objects in context[C]//Proceedings of European Conference on Computer Vision. Cham: Springer, 2014: 740 755.

YAEGER L, LYON R, WEBB B.Effective training of a neural network character classifier for word recognition[J].Advances in Neural Information Processing Systems,1996,9: 807 816.

Wei Wei, Pu Wei, Liu Yi. Improvement of YOLOv3 in aerial target detection [J]. Computer Engineering and Applications, 2020, 56(7): 17-23.

Wang Dongli, Liao Chunjiang, Mou Jinzhen, et al. Feature fusion-based small target detection for SSD vision [J]. Computer Engineering and Applications, 2020, 56(16): 31-36.

XING C, LIANG X, BAO Z.A small object detection solution by using super-resolution recovery[C]//2019 IEEE 7th International Conference on Computer Science and Network Technology(ICCSNT),2019:313-316.

Zhao P.F., Xie L.B., Peng L. A deep small target detection algorithm incorporating attention mechanism [J/OL]. Computer Science and Exploration [2021-10-18].

WOO S, HWANG S, KWEON I S. StairNet: top-down semantic aggregation for accurate one-shot detection [C]// Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, Lake Tahoe, Mar 12-15, 2018. Piscataway: IEEE, 2018:1093-1102.

Li W.T., Peng L. Small target detection algorithm for multi-scale channel attention fusion networks[J]. Computer Science and Exploration, 2021, 15(12): 2390-2400.

LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: common objects in context [C]// European Conference on Computer Vision. Zurich: ECVA,2014: 740-755.

Pang Y, Cao J, Li Y, et al. TJU-DHD: A Diverse High-Resolution Dataset for Object Detection[J]. IEEE Transactions on Image Processing, 2021, 30:207-219.

XIA G S,BAI X,DING J,et al. DOTA: a large-scale dataset for object detection in aerial images[C] //IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE, 2018: 3974-3983.

YU X,GHONG Y,JIANG N,et al. Scale match for tiny person detection [C]//Winter Conference on Applications of Computer Vision.Piscataway: IEEE,2020: 1246- 1254.

Tuggener L, Elezi I, Schmidhuber J. et al. DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects: IEEE Computer Society, 10.1109/ ICPR. 2018. 8545 307 [P]. 2018.

YANG S, LUO P, LOY C C, et al. WIDER FACE: a face detection benchmark[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, Jun 27- 30, 2016. Washington: IEEE Computer Society, 2016: 5525-5533.

ZHANG S, BENENSON R, SCHIELE B. CityPersons: a diverse dataset for pedestrian detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Jul 21-26, 2017. Washington: IEEE Computer Society, 2017:3213-3221.

WANG J W, YANG W, GUO H W, et al. Tiny object detection in aerial images[C]//Proceedings of the 2020 25th International Conference on Pattern Recognition, Milan, Jan 10-15, 2021. Piscataway: IEEE, 2021: 3791-3798.

ZAMIR S W, ARORA A, GUPTA A, et al. iSAID: a large-scale dataset for instance segmentation in aerial images [C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, Jun 16- 20, 2019. Piscataway: IEEE, 2019: 28-37.

ZHANG S F, XIE Y L, WAN J, et al. WiderPerson: a diverse dataset for dense pedestrian detection in the wild [J]. IEEE Transactions on Multimedia, 2020, 22(2): 380-393.

SIMARD P Y, STEINKRAUS D, PLATT J C. Best practices for convolutional neural networks applied to visual document analysis [C]//Proceedings of ICDAR. [S.l.]: IEEE, 2003, 3 (2003).

KRIZHEVSKY A, SUTSKEVER I, HINTON G E. Imagenet classification with deep convolutional neural networks[J]. Advances in Neural Information Processing Systems,2012,25: 1097 1105.

WAN L, ZEILER M, ZHANG S, et al. Regularization of neural networks using dropconnect[C]//Proceedings of International Conference on Machine Learning. [S.l.]: PMLR, 2013: 1058 1066.

DEVRIES T, TAYLOR G W. Improved regularization of convolutional neural networks with cutout[EB/OL]. (2017 08 15)[2017 11 29].https://arxiv.org/abs/1708.04552.

ZHANG H, CISSE M, DAUPHIN Y N,et al.Mixup: Beyond empirical risk minimization [EB/OL]. (2017 10 25) [2018 04 27].

Yun S. Han D. Oh S J. et al. CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features[J]. 2019.

Bochkovskiy A, Wang C Y, Liao H . YOLOv4: Optimal Speed and Accuracy of Object Detection[J]. 2020.

KISANTAL M, WOJNA Z, MURAWSKI J, et al. Augmentation for small object detection[EB/OL]. (2019 02 19) [2019 02 19].

YUN S, HAN D, OH S J, et al. Cutmix: Regularization strategy to train strong classifiers with localizable features[C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. New York: IEEE, 2019: 6023 6032.

BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection[EB/OL]. (2020 04 23)[2020 04 23].https://arxiv.org/abs/2004.10934.

MÜLLER S G,HUTTER F.TrivialAugment:tuning- free yet state- of- the- art data augmentation[J].arXiv:2103. 10158, 2021.

Jie H, Li S, Gang S, et al. Squeeze-and-Excitation Networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, PP(99).

LI Y, LI S, DU H, et al. YOLO- ACN: focusing on small target and occluded object detection[J]. IEEE Access, 2020, 8: 227288-227303.

PAN H, CHEN G, JIANG J. Adaptively dense feature pyramid network for object detection[J]. IEEE Access, 2019, 7: 81132-81144.

ZHU G, WEI Z, LIN F. An object detection method combining multi-level feature fusion and region channel attention[J]. IEEE Access, 2021, 9: 25101-25109.

SHI W, BAO S, TAN D. FFESSD: an accurate and efficient single-shot detector for target detection[J]. Applied Sciences, 2019, 9(20): 4276.

WOO S, HWANG S, KWEON I S. StairNet: top-down segmantic aggregation for accurate one-shot detection[C]// Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, Lake Tahoe, Mar 12-15, 2018. Piscataway: IEEE, 2018:1093-1102.

TANG X,DU D K,HE Z,et al. PyramidBox: a context assisted single shot face detector [C]// European Conference on Computer Vision. Zurich: ECVA,2018: 812-828.

HU H,GU J,ZHANG Z,et al.Relation networks for object detection [C] // IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE,2018: 3588-3597.

Downloads

Published

25-06-2023

Issue

Section

Articles

How to Cite

Zhang, H., Fu, W., & Wu, K. (2023). Review of Small Target Detection based on Deep Learning. Frontiers in Computing and Intelligent Systems, 4(2), 40-45. https://doi.org/10.54097/fcis.v4i2.9900