Cardiovascular Diseases Risk Prediction Based on Machine Learning Models

Authors

  • Steven Kalung Li
  • Yuli Peng

DOI:

https://doi.org/10.54097/wdsr1890

Keywords:

Machine learning, CVD, logistic regression, decision tree.

Abstract

Cardiovascular Diseases (CVD) are the leading cause of death worldwide, CVD has a high death rate of 95% as it causes a blockage in blood vessels, which prevents blood from flowing to the brain or heart. Necessitating accurate risk prediction for effective prevention and management becomes a necessity of lowering death caused by cardiovascular disease. This paper presents a comprehensive study utilizing Logistic Regression and Decision Tree models implemented via Python to predict CVD risk. The models were trained on a dataset comprising numerous variables such as age, checkup, sex, and BMI which are essential factors and indicators that are related to health and wellbeing. A detailed evaluation would be produced, including accuracy, precision, and confusion matrix, demonstrates the efficiency of the proposed approach, with the Logistic Regression model achieving an accuracy of 91.94% and a precision of 71.94%. followed by the decision tree model achieving an accuracy of 86.14% and a precision of 19.7%.

Downloads

Download data is not yet available.

References

World Health Organization Cardiovascular diseases, 2023, https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1

Lloyd-Jones, D M et al. Framingham risk score and prediction of lifetime risk for coronary heart disease. The American Journal of Cardiology 94.1 2004, 20-24

Lakshmanarao, A Y et al. Machine learning techniques for heart disease prediction. Forest 95.99, 2019, 97.

Pal, M, and Smita P. Prediction of heart diseases using random forest. Journal of Physics: Conference Series. Vol. 1817. No. 1. IOP Publishing, 2021.

Ambrish, G, et al. Logistic regression technique for prediction of cardiovascular disease." Global Transitions Proceedings 3.1, 2022, 127-130.

Pasha, S N, et al. Cardiovascular disease prediction using deep learning techniques. IOP conference series: materials science and engineering. Vol. 981. No. 2. IOP Publishing, 2020.

LaValley M P. Logistic regression. Circulation, 2008, 117(18): 2395-2399.

Myles A J, Feudale R N, Liu Y, et al. An introduction to decision tree modeling. Journal of Chemometrics: A Journal of the Chemometrics Society, 2004, 18(6): 275-285.

Ambrish, G., et al. Logistic regression technique for prediction of cardiovascular disease. Global Transitions Proceedings 3.1, 2022, 127-130.

Pal, M, and Smita P. Prediction of heart diseases using random forest. Journal of Physics: Conference Series. Vol. 1817. No. 1. IOP Publishing, 2021.

Downloads

Published

26-01-2024

How to Cite

Li, S. K., & Peng, Y. (2024). Cardiovascular Diseases Risk Prediction Based on Machine Learning Models. Highlights in Science, Engineering and Technology, 81, 473-477. https://doi.org/10.54097/wdsr1890