Prediction for Olympic Medal Tables by Machine Learning

Authors

  • Jianzhang Li
  • Yueran Zhang

DOI:

https://doi.org/10.54097/a1e0zc18

Keywords:

ARIMA Model, XGBoost, SVM, Random Forest

Abstract

The goal of this article is to predict the gold and overall medal rankings for 2028. ARIMA is used to analyze cyclical fluctuations, followed by XGBoost optimization for time-series predictions. The model forecasts gold medal standings and visualizes the top 20 countries. Further, this paper pay attention to predicting the likelihood of countries without Olympic gold medals winning their first in 2028. Our study create binary classification labels and countries that have never won gold are identified and relevant features are extracted. SVM is employed for classification, with an AUC score close to 1, indicating high accuracy. Eventually, the relationship between event selection and medal counts is analyzed. The data is preprocessed and event types are converted into categorical variables. Random Forest regression is used, revealing that host country event selection affects medal performance but has minimal impact on overall rankings. The model’s performance is validated using MAE, MSE, and other metrics.

Downloads

Download data is not yet available.

References

[1] Bernard A B, Busse M R. Who Wins the Olympic Games: Economic Resources and Medal Totals[J]. Review of Economics Statistics,2006,86(1).

[2] Aimin Guo, Mingfa Zhao. “Prediction of the 2016 Summer Olympics Gold Medal Rankings Based on Grey Theory.” China Science and Technology Information, 2013(9).

[3] Shiyu Wang. “Olympic Medal Prediction Model Based on Nonlinear Regression and BP Neural Network.” Physical Observation and Sports Equipment Technology, 2017(24):3.

[4] Box, G. E., Jenkins, G. M., & Reinsel, G. C. (2008). Time Series Analysis: Forecasting and Control (4th ed.). Wiley.

[5] Hyndman, R. J., & Athanasopoulos, G. (2018). Forecasting: principles and practice (2nd ed.). OTexts.

[6] Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785-794.

[7] Liu, H., & Yang, X. (2022). Research on the Prediction of Housing Prices Based on Random Forest Regression Model. Journal of Data Science and Analytics, 4(3), 45-56.

[8] Zhang, Y., & Li, Z. (2021). Avocado Price Prediction Using Random Forest Algorithm. International Journal of Data Science, 3(2), 89-102.

[9] Wang, L., & Zhang, J. (2023). Performance Prediction Model of Cryogenic Cooling Machine Based on Random Forest Regression. Optics and Precision Engineering, 31(5), 126-134.

[10] Chen, J., & Zhang, W. (2023). Spatiotemporal Evolution and Driving Mechanisms of Urban Construction Land Structure in the Yellow River Basin: A Random Forest Approach. Progress in Geography, 42(7), 1105-1118.

Downloads

Published

20-05-2025

Issue

Section

Articles

How to Cite

Li, J., & Zhang, Y. (2025). Prediction for Olympic Medal Tables by Machine Learning. International Journal of Biology and Life Sciences, 10(2), 64-73. https://doi.org/10.54097/a1e0zc18