Influencing Factors and Prediction of Heart Disease

Authors

  • Changhai Xia

DOI:

https://doi.org/10.54097/p88ad172

Keywords:

Heart Disease, Logistic Regression, Accuracy, Random Forest, Support Vector Machine.

Abstract

Heart disease (HD) is a key issue of concern in the global safety and health field nowadays. Some researchers listed the predisposing factors and established some models to predict the incidence rate. However, there is still a gap in the research on the influence of related factors and the comparison of different models. Therefore, the research topic of this article is the importance of HD factors and the comparison of model predictions. Following are this article’s research methods: Firstly, this project preprocesses and visualizes the data based on the Heart Failure Prediction Data Set. Then this article establishes three mainstream research models, including Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) model. Finally, this article introduces the Receiver Operating Characteristic (ROC) curve for better comparison of models and illustrates the importance of variables on HD through the ranking graph. Research has found that the RF model has the highest prediction accuracy, reaching 86.34%, and its Area Under Curve (AUC) is also the largest. Therefore, through comparison, the predicted accuracy of the RF model is the best. Meanwhile, in the ranking image of variables, it can be concluded that the Slope of peak exercise ST segment (ST_Slope) has the greatest impact on HD. Due to the possibility of prediction errors and limited applicability, it is possible to consider improving the variables to enhance model accuracy and contribute to health.

Downloads

Download data is not yet available.

References

[1] Titti R R, Pukkella S, Radhika L S T. Augmenting HD prediction with explainable AI: A study of classification models. Computational and Mathematical Biophysics, 2024, 12 (1).

[2] Darrab S, Broneske D, Saake G. Exploring the predictive factors of HD using rare association rule mining. Scientific Reports, 2024, 14 (1): 18178 - 18178.

[3] Mienye D I, Jere N. Optimized ensemble learning approach with explainable AI for improved HD prediction. Information, 2024, 15 (7): 394 - 394.

[4] Islam A M, Majumder H Z M, Miah S M, et al. Precision healthcare: A deep dive into ML algorithms and feature selection strategies for accurate HD prediction. Computers in biology and medicine, 2024, 176108432 - 108432.

[5] A. H A A, Prabu P, Chandra R P, et al. Comprehensive evaluation and performance analysis of ML in HD prediction. Scientific Reports, 2024, 14 (1): 7819 - 7819.

[6] Omar D E, Mat H, Karim A Z A, et al. Comparative analysis of LR, Gradient Boosted Trees, SVM, and RF algorithms for prediction of acute kidney injury requiring dialysis after cardiac surgery. International journal of nephrology and renovascular disease, 2024, 17197 - 204.

[7] Yang Yanping, Li Rong Factors affecting HD based on Decision Tree LR model. Industrial control computer, 2024, 37 (08): 114 - 116.

[8] Sridharan K. An automated HD prediction approach using linearly support vector regression and stacked linear swarm optimization. Journal of Intelligent & Fuzzy Systems, 2023, 44 (2): 3189 - 3202.

[9] Mert O, Serhat P. A classification and regression tree algorithm for HD modeling and prediction. Healthcare Analytics, 2023, 3.

[10] Hendriks M P, Bosch D V E A, Geenen W L, et al. Blood biomarkers predict 10-year clinical outcomes in adult patients with congenital HD. JACC. Advances, 2024, 3 (9): 101130.

Downloads

Published

24-12-2024

How to Cite

Xia, C. (2024). Influencing Factors and Prediction of Heart Disease. Highlights in Science, Engineering and Technology, 123, 586-592. https://doi.org/10.54097/p88ad172