Competitive Sports Research Based on Data Analytics and Machine Learning Models

Authors

  • Shuting Hou
  • Xile Lan
  • Huiyao Zhang
  • Xingying Luo

DOI:

https://doi.org/10.54097/jg7zv636

Keywords:

Machine Learning, LightGBM, Regression Modeling.

Abstract

In contemporary competitive sports, data analysis and modeling applications have become indispensable tools for optimizing training and improving athletic performance. To effectively handle the variety and changing nature of matches, match data is combined to create models that can manage intricate match situations. In this paper, the data are normalized, missing values are filled by interpolation, and the outlier detection based on box diagram is used to complete the preprocessing. Furthermore, to assess player scores, 16 metrics are implemented, conventional machine learning models underwent training, and the top-performing LightGBM model is chosen, indicating the impact of each metric on the score. The results show that the LightGBM model is highly robust, precise, and accurate.

Downloads

Download data is not yet available.

References

[1] Wei Mengli, Zhong Yaping, Gui Huixian, et al. A machine learning based sports injury warning model[J]. Chinese Journal of Tissue Engineering Research, 2025, 29 (02): 409-418.

[2] Kong Xiangshen, Chu Xiangtong, Gao Han, et al. Empowering Innovative Applications and Development of Sports with Digital Intelligence: A Review of the 14th International Symposium on Sports Computer Science[J]. Sports Research, 2023, 44 (06): 20-26.

[3] Gao Xuebo, Yang Jihong. Research on Swimming Training Method Based on Machine Learning[J]. Journal of Lanzhou University of Arts and Sciences (Natural Science Edition), 2024, 38 (03): 110-114.

[4] Aziz Rabia Musheer, Baluch Mohammed Farhan, Patel Sarthak, et al. LightGBM: a machine learning approach for Ethereum fraud detection[J]. International Journal of Information Technology, 2022, 14(7): 3321-3331.

[5] Sharma Sachin. Stroke Prediction Using XGB Classifier, Logistic Regression, GaussianNB and BernaulliNB Classifier [C]. 2023 International Conference on Circuit Power and Computing Technologies (ICCPCT). IEEE, 2023: 1577-1581.

[6] Mannino, Gaetano, et al. "Nonlinear and multivariate regression models of current and voltage at maximum power point of bifacial photovoltaic strings." Solar Energy 269 (2024): 112357.

[7] Song, Honglin, et al. "Using complex networks and multiple artificial intelligence algorithms for table tennis match action recognition and technical-tactical analysis." Chaos, Solitons & Fractals 178 (2024): 114343.

Downloads

Published

31-10-2024

Issue

Section

Articles

How to Cite

Hou, S., Lan, X., Zhang, H., & Luo, X. (2024). Competitive Sports Research Based on Data Analytics and Machine Learning Models. Academic Journal of Science and Technology, 13(1), 78-82. https://doi.org/10.54097/jg7zv636