Determinants of Global Life Expectancy: A Machine Learning Analysis Using Random Forest Models
DOI:
https://doi.org/10.54097/swfw4290Keywords:
Life Expectancy, Socioeconomic Determinants, Health Expenditure, Machine Learning, Random Forest RegressionAbstract
In this paper, I adopt a data-driven approach to analyze global life expectancy. Using World Bank data from 174 countries covering the period 2001–2019, I model economic, social, and health factors that may predict a nation’s life expectancy. I first performed data preprocessing, which included median imputation for missing values, and then tested several regression models, as well as Random Forest, Boosting, and Decision Tree models. The Random Forest Regressor exhibited the best predictive performance and stability, as it can capture both non-linear and multi-level relationships between income and region. Feature importance analysis showed that undernourishment, communicable diseases, and healthcare expenditure had the most significant impacts on life expectancy, while education expenditure and injury had the least. These results support the view that income alone cannot guarantee population health; instead, investments in health infrastructure and access to basic resources are key to extending overall life expectancy.
Downloads
References
[1] Barber, R. M., et al. (2017). Healthcare access and quality index based on mortality from causes amenable to personal health care in 195 countries and territories, 1990–2015. The Lancet, 390(10091), 231–266. https://doi.org/10.1016/S0140-6736(17)30818-8.
[2] Balaj, M., et al. (2024). Education and life expectancy: Cross-national evidence. Social Science & Medicine, 347, 116833. https://doi.org/10.1016/j.socscimed.2024.116833.
[3] Deaton, A. (2003). Health, inequality, and economic development. Journal of Economic Literature, 41(1), 113–158. https://doi.org/10.1257/jel.41.1.113.
[4] Preston, S. H. (1975). The changing relation between mortality and level of economic development. Population Studies, 29(2), 231–248. https://doi.org/10.1080/00324728.1975.10410201.
[5] Zarulli, V., et al. (2021). Health system efficiency and the reallocation of resources. European Journal of Public Health, 31(4), 617–623. https://doi.org/10.1093/eurpub/ckab026.
[6] Morina, F., Komoni, A., Kilaj, D., Selmonaj, D., Grima, S. (2022). The effect of health expenditure on life expectancy. International Journal of Sustainable Development and Planning, Vol. 17, No. 5, pp. 1389-1401. https://doi.org/10.18280/ijsdp.170502.
[7] Reynolds, M., et al. (2018). Social spending and life expectancy in high-income countries. Journal of Epidemiology & Community Health, 72(3), 240-246. 10.1016/j.amepre.2017.09.001.
[8] Shkolnikov, V. M., Andreev, E. M., Tursun-Zade, R., & Leon, D. A. (2019). Patterns in the relationship between life expectancy and gross domestic product in Russia in 2005-15: a cross-sectional analysis. The Lancet. Public health, 4(4), e181–e188. https://doi.org/10.1016/S2468-2667(19)30036-2.
[9] Global Burden of Disease Study Collaborators. (2024). Decomposition of global life expectancy gains. The Lancet, 403(10440), 2100–2132. https://doi.org/10.1016/S0140-6736(24)00367-2.
[10] Chavan, S. S. (n.d.). Life expectancy & socio-economic (World Bank) [Data set]. Kaggle. Retrieved October 29, 2025, from https://www.kaggle.com/datasets/mjshri23/life-expectancy-and-socio-economic-world-bank.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Academic Journal of Management and Social Sciences

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

