Predicting Heart Disease Risk Using Logistic Regression and Random Forest Models: Balancing Interpretability and Accuracy
DOI:
https://doi.org/10.54097/97hr8750Keywords:
Heart disease, Machine learning, Logistic regression, Random Forest, Predictive modeling.Abstract
Cardiovascular disease, particularly heart disease, continues to be a major contributor to global mortality, making timely detection a critical priority. Traditional diagnostic approaches often identify the illness only after significant progression, underscoring the need for predictive tools that rely on clinical and demographic information. This study utilizes the Cleveland subset of the UCI Machine Learning Repository, which contains 270 patient records with 14 attributes, to explore predictive modeling for heart disease. We developed and assessed two methods: logistic regression and random forest. Logistic regression achieved an accuracy of 85% and an AUC of 0.918, offering transparency in interpreting risk factors such as ST depression, abnormal vessel counts, and thallium test outcomes. The random forest model delivered a comparable accuracy of 85% with an AUC of 0.906, highlighting similar predictors while capturing nonlinear patterns within the data. Our findings indicate that combining interpretable models with machine learning techniques provides a balanced and reliable strategy to support early heart disease detection and improve clinical decision-making.
Downloads
References
[1] Tsao C W, Aday A W, Almarzooq Z I, et al. heart disease and stroke statistics—2023 update: a report from the American Heart Association[J]. Circulation, 2023, 147(8): e93-e621.
[2] Liu S, Li Y, Zeng X, et al. Burden of cardiovascular diseases in China, 1990–2016: findings from the 2016 Global Burden of Disease Study[J]. JAMA Cardiology, 2019, 4(4): 342-352.
[3] Detrano R, Janosi A, Steinbrunn W, et al. international application of a new probability algorithm for the diagnosis of coronary artery disease[J]. American Journal of Cardiology, 1989, 64(5): 304-310.
[4] Karna V V R, Reddy K, Shyamala K, et al. A comprehensive review on heart disease risk prediction using machine learning and deep learning algorithms[J]. Archives of Computational Methods in Engineering, 2025, 32(3): 1763-1795.
[5] Riyaz L, Reddy P K, Bhatia M P S, et al. heart disease prediction using machine learning techniques: a quantitative review[C]//Proceedings of the International Conference on Innovative Computing and Communications (ICICC). Singapore: Springer, 2021: 627-639.
[6] Azimi Lamir A, Razzagzadeh S, Rezaei Z. A comprehensive machine learning framework for heart disease prediction: performance evaluation and future perspectives[J]. arXiv preprint arXiv:2505.09969, 2025.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

