YOLOv10-based Model for Player and Football Detection
DOI:
https://doi.org/10.54097/2sx59328Keywords:
YOLOv10, Target detection, Football detection, Player detection, Deep learningAbstract
This study presents an advanced YOLOv10n-based method for the automatic detection of football players and balls directly from match videos. We enhance the YOLOv10 architecture with several significant improvements, including additional detection heads, the integration of C2f_faster and C3_faster modules for enhanced processing speed and accuracy, and the inclusion of BotNet modules with self-attention mechanisms for managing complex visual scenes. Further, we incorporate GhostConv modules to reduce computational overhead while maintaining effective feature extraction. These architectural modifications ensure robust detection capabilities in real-time sports environments, addressing challenges such as high-speed movements, frequent occlusions, and variable lighting conditions typical of both indoor and outdoor stadiums. Validation on internet-sourced images from football matches demonstrates the practicality and effectiveness of our model.
References
[1] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779-788).
[2] Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., Han, J., & Ding, G. (2024). Yolov10: Real-time end-to-end object detection. arxiv preprint arxiv:2405.14458.
[3] Maćkowiak, S., Kurc, M., Konieczny, J., & Maćkowiak, P. (2010, September). A complex system for football player detection in broadcasted video. In ICSES 2010 International Conference on Signals and Electronic Circuits (pp. 119-122). IEEE.
[4] Direkoglu, C., Sah, M., & O’Connor, N. E. (2018). Player detection in field sports. Machine Vision and Applications, 29, 187-206.
[5] Komorowski, J., Kurzejamski, G., & Sarwas, G. (2019). Footandball: Integrated player and ball detector. arxiv preprint arxiv:1912.05445.
[6] Cioppa, A., Deliege, A., Huda, N. U., Gade, R., Van Droogenbroeck, M., & Moeslund, T. B. (2020). Multimodal and multiview distillation for real-time player detection on a football field. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (pp. 880-881).
[7] Wang, T., & Li, T. (2022). Deep Learning‐Based Football Player Detection in Videos. Computational Intelligence and Neuroscience, 2022(1), 3540642.
[8] Diwan, K., Bandi, R., Dicholkar, S., & Khadse, M. (2023, February). Football player and ball tracking system using deep learning. In Proceedings of International Conference on Data Science and Applications: ICDSA 2022, Volume 1 (pp. 757-769). Singapore: Springer Nature Singapore.
[9] Wang, C. Y., Liao, H. Y. M., Wu, Y. H., Chen, P. Y., Hsieh, J. W., & Yeh, I. H. (2020). CSPNet: A new backbone that can enhance learning capability of CNN. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (pp. 390-391).
[10] Lin, T. Y., Dollár, P., Girshick, R., He, K., Hariharan, B., & Belongie, S. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2117-2125).
[11] Neubeck, A., & Van Gool, L. (2006, August). Efficient non-maximum suppression. In 18th international conference on pattern recognition (ICPR'06) (Vol. 3, pp. 850-855). IEEE.
[12] Chen, J., Kao, S. H., He, H., Zhuo, W., Wen, S., Lee, C. H., & Chan, S. H. G. (2023). Run, don't walk: chasing higher FLOPS for faster neural networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 12021-12031).
[13] Srinivas, A., Lin, T. Y., Parmar, N., Shlens, J., Abbeel, P., & Vaswani, A. (2021). Bottleneck transformers for visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 16519-16529).
[14] Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., & Xu, C. (2020). Ghostnet: More features from cheap operations. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 1580-1589).
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
