Detection of Small Object based on Improved-YOLOv8

Authors

  • Yingying Tan
  • Jinpeng Song
  • Chen Chu

DOI:

https://doi.org/10.54097/n5rtnt71

Keywords:

Small Object Detection, YOLOv8, Wavelet Transform, Multiscale Separation, Feature Fusion

Abstract

An improved YOLOv8 model is proposed to address the issue of poor recognition performance caused by their low resolution and weak feature representation in small object detection task. Firstly, to extract a richer set of low-level features from images, a WT_Conv module is designed to fuse the feature components extracted by WT (Wavelet Transform) with those extracted by convolutional layer. Secondly, based on the idea that shallow and deep feature maps contain information at different scales, a MS (Multiscale Separation) module is designed to preserve the features of small objects separated from shallow layer and transfer the salient features of large objects to the deeper layers, effectively solving the problem of inconsistent feature expression caused by the direct fusion of shallow and deep feature maps. Finally, we introduce the DE (Detail Enhancement) module capable of fusing adjacent feature maps to process the small-objects features separated by the MS module, enhancing feature representation for small objects. Experiment results on UAVOD-10 and Small Object datasets show that our model achieves a mAP improvement of 9.5% and 2.3% respectively over the baseline, and it also shows a significant advantage over other comparative models, affirming the effectiveness of the proposed model for small object detection tasks.

Downloads

Download data is not yet available.

References

[1] BOCHKOVSKIY A, WANG C Y, LIAO H Y M.YOLOv4: Optimal Speed and Accuracy of Object Detection [EB/OL]. 2020. https://arxiv.org/ abs/2004.10934.

[2] KISANTAL M, WOJNA Z, MURAWSKI J, et al. Augmentation for Small Object Detection [EB/OL]. 2019. https: // arxiv.org/abs/1902.07296. DOI: https://doi.org/10.5121/csit.2019.91713

[3] LI Li-xia, WANG Xin, WANG Jun, et al. Small object detection algorithm in UAV image based on feature fusion and attention mechanism[J]. Journal of Graphics, 2023,44(04):658-666.

[4] LI Qingyuan, DENG Zhaohong, LUO Xiaoqing, et al. SSD Object Detection Algorithm with Attention and Cross-Scale Fusion[J]. Journal of Frontiers of Computer Science and Technology, 2022,16(11): 2575-2586.

[5] MA J, SHAO W, YE H, et al. Arbitrary-Oriented Scene Text Detection via Rotation Proposals[J]. IEEE Transactions on Multimedia, 2017,99: 1-1.

[6] LI Y, PANG Y, SHEN J, et al. NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection [J]. IEEE, 2020. DOI: https://doi.org/10.1109/CVPR42600.2020.01336

[7] HUANG S, LIU Q. Addressing Scale Imbalance for Small Object Detection with Dense Detector[J]. Neurocomputing, 2022, 473:68-78. DOI: https://doi.org/10.1016/j.neucom.2021.11.107

[8] LIU S, QI L, QIN H, et al. Path Aggregation Network for Instance Segmentation[C]//Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition. 2018: 8759-8768. DOI: https://doi.org/10.1109/CVPR.2018.00913

[9] TAN M, PANG R, LE Q V. Efficientdet: Scalable and Efficient Object Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 10781-10790. DOI: https://doi.org/10.1109/CVPR42600.2020.01079

[10] GHIASI G, LIN T Y, LE Q V. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 7036-7045. DOI: https://doi.org/10.1109/CVPR.2019.00720

[11] REDMON J, FARHADI A, YOLO9000: Better, Faster, Stronger [C] //IEEE Conference on Computer Vision & Pattern Recognition, Honolulu: IEEE Press, 2017: 6517-6525. DOI: https://doi.org/10.1109/CVPR.2017.690

[12] B W, Y J, C Y J. Beyond Deep Residual Learning for Image Restoration: Persistent Homology-guided Manifold Simplification [C]// Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 2017: 145-153.

[13] HUANG H, HE R, SUN Z, et al. Wavelet-SRNet: A Wavelet-Based CNN for Multi-scale Face Super Resolution[C]//2017 IEEE International Conference on Computer Vision (ICCV). IEEE, 2017. DOI: https://doi.org/10.1109/ICCV.2017.187

[14] LI Lin, JIN Zhixin, YU Xiaolei, et al. Road Vehicle and Pedestrian Detection Based on YOLOv9 for Haar Wavelet Downsampling [J/OL]. Computer Engineering and Applications, 2024:1-9.

[15] HE Guohuan, ZHU Jiangping. WT-U-Net++: Surface Defect Detection Network based on Wavelet Transform[J]. Journal of Computer Applications, 2023, 43(10): 3260-3266.

[16] HAN W, LI J, WANG S, et al. A Context-scale-aware Detector and a New Benchmark for Remote Sensing Small Weak Object Detection in Unmanned Aerial Vehicle Images[J]. International Journal of Applied Earth Observation and Geoinformation, 2022, 112:102966. DOI: https://doi.org/10.1016/j.jag.2022.102966

[17] ZHENG M, LEI Y, CHAN A B. Small Instance Detection by Integer Programming on Object Density Maps[C]// Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 3689-3697.

[18] ZHU X, LYU S, WANG X, et al. TPH-YOLOv5: Improved YOLOv5 based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 2778-2788. DOI: https://doi.org/10.1109/ICCVW54120.2021.00312

[19] ZHANG Z. Drone-YOLO: an Efficient Neural Network Method for Target Detection in Drone Images[J]. Drones, 2023, 7(8): 526. DOI: https://doi.org/10.3390/drones7080526

Downloads

Published

29-12-2024

Issue

Section

Articles

How to Cite

Tan, Y., Song, J., & Chu, C. (2024). Detection of Small Object based on Improved-YOLOv8. Frontiers in Computing and Intelligent Systems, 10(3), 79-85. https://doi.org/10.54097/n5rtnt71