Object Detection of UAV Aerial Image based on YOLOv8

Authors

  • Chen Liu
  • Fanrun Meng
  • Zhiren Zhu
  • Liming Zhou

DOI:

https://doi.org/10.54097/fcis.v5i3.13852

Keywords:

YOLOv8, UAV, WIoU

Abstract

With the development of technology, unmanned aerial vehicles (UAVs) have shed their military uses and gradually expanded to civilian and commercial fields. With the development of drone technology, object detection technology based on deep learning has become an important research topic in the field of drone applications. Apply object detection technology to unmanned aerial vehicles to achieve object detection and recognition of ground scenes from an aerial perspective. However, in aerial images taken by drones, the detection objects are mostly small targets, and the target scale changes greatly due to the influence of aerial perspective; The image background is complex, and the target object is easily occluded. It has brought many challenges to the target detection of unmanned aerial vehicles. Conventional object detection algorithms cannot guarantee detection accuracy when applied to drones, and optimizing the target detection performance of drones has become an important research topic in the field of drone applications. We improve the WIoUv3 loss function on the basis of YOLOv8s to reduce regression localization loss during training and improve the regression accuracy of the model. The experimental results indicate that the improved model mAP@0.5 It increased by 0.6 percentage points to 40.7%.

Downloads

Download data is not yet available.

References

Ren S,He K,Girshick R , et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6):1137-1149.

Berg A C , Fu C Y , Szegedy C , et al. SSD: Single Shot MultiBox Detector:,10.1007/978-3-319-46448-0_2[P]. 2015.

Redmon J, Divvala S ,Girshick R , et al. You Only Look Once: Unified, Real-Time Object Detection[J]. IEEE, 2016.

Zheng Z ,Wang P , Ren D , et al. Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation[J]. 2020.

TONG Z, CHEN Y, XU Z, et al. Wise-Io U: Bounding box regression loss with dynamic focusing mechanism[J]. ar Xiv preprint ar Xiv:2301.10051, 2023

Bochkovskiy A , Wang C Y , Liao H Y M .YOLOv4: Optimal Speed and Accuracy of Object Detection[J]. 2020. DOI: 10. 48550/arXiv.2004.10934. R. Girshick, "Fast r-cnn," Proceedings of the IEEE international conference on computer vision, vol. 12, pp. 1440-1448, 2015.

Kaiming,He,Xiangyu,et al.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015.DOI:10.1109/tpami.2015.2389824.

Lin T Y , Dollar P , Girshick R , etal. Feature Pyramid Networks for Object Detection[J]. IEEE Computer Society, 2017.

Wang W , Xie E , Song X , et al. Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network[J]. IEEE, 2019.

Zhang Y F , Ren W , Zhang Z , et al. Focal and Efficient IoU Loss for Accurate Bounding Box Regression[J].2021.

Gevorgyan Z. SIoU loss: More powerful learning for bounding box regression[J]. ar**v preprint ar**v:2205.12740, 2022.

Downloads

Published

14-11-2023

Issue

Section

Articles

How to Cite

Liu, C., Meng, F., Zhu, Z., & Zhou, L. (2023). Object Detection of UAV Aerial Image based on YOLOv8. Frontiers in Computing and Intelligent Systems, 5(3), 46-50. https://doi.org/10.54097/fcis.v5i3.13852