Study of Factors for Heart Disease Prediction between Men and Women based on Multiple Regression Models

Authors

  • Jingyi Guo

DOI:

https://doi.org/10.54097/hset.v54i.9755

Keywords:

Heart disease; prediction; important factors; gender; regression models.

Abstract

Although previous studies have shown that there are differences in heart disease between men and women, the importance of some specific physical and chemical factors in the prediction of heart disease in different genders has not been clearly clarified. In this research, K-means clustering, multiple linear regression, logistic regression and random forest are adopted to analyze the UCI Heart Disease Data Set, which contains various physical and chemical indicators worth studying. The results demonstrate that exercise induced angina is more significant to the judgement of heart disease in women, while number of major vessels colored by fluoroscopy is more significant to the judgement of heart disease in men and type of chest pain is a statistically significant variable for both men and women. Thalassemia, ST depression induced by exercise relative to rest, greatest number of heartbeats per minute, age, resting blood pressure also have reference value for the judgment of heart disease. In terms of each model's fit to heart disease prediction, for women, the accuracy of random forest is the first, logistic regression is the second, and multiple linear regression is the third, while for men, the accuracy of random forest is the first, multiple linear regression is the second, and logistic regression is the third. These conclusions are an optimization of previous studies, and to a certain extent reflect that this study is of great significance to the prevention of heart disease in different groups of people.

Downloads

Download data is not yet available.

References

Chenxi Xia, Xiang Wang, Xuyang Meng, et al. Risk factors analysis of coronary heart disease in Chinese elderly patients with severe valvular heart disease: a national multicenter cross-sectional study. Chinese Journal of Molecular Cardiology, 2022, 22(06): 5000-5004.

Yin Ouyang, Ye Tan, Xin Deng. Application of regression analysis in clinical research of coronary atherosclerotic heart disease. China Medicine and Pharmacy, 2021, 11(18): 44-47.

Ying Yang, et al. Current status and etiology of valvular heart disease in China: a population-based survey. BMC Cardiovasc Disord, 2021, 21(01): 339.

Mengmeng Chen, Zhenhong Fang, Wenyi Tu, et al. Construction and effect analysis of heart disease prediction model based on logistic regression model. Hospital Management Forum, 2022, 39(02): 32-35.

Xiaodan Nie, Saisai Cui, Bo Sun, et al. Analysis of risk factors of coronary atherosclerotic heart disease. Trauma and Critical Care Medicine, 2022, 10(01): 68-70.

Jinchao Zhao, Yi Li, Dong Wang, et al. Heart disease prediction algorithm based on optimization random forest. Journal of Qingdao University of Science and Technology (Natural Science Edition), 2021, 42(02): 112-118.

Li Yang, Haibin Wu, Xiaoqing Jin, et al. Study of cardiovascular disease prediction model based on random forest in eastern China. Scientific Reports, 2020, 10(01): 5245.

Yu Liu, Mu Qiao. Prediction of heart disease based on clustering and XGboost algorithm. Computer system application, 2019, 28(01): 228-232.

Norris C M, et al. State of the science in women's cardiovascular disease: a Canadian perspective on the influence of sex and gender. J Am Heart Assoc, 2020, 9(04): e015634.

Nussbaum S S, et al. Sex-specific considerations in the presentation, diagnosis, and management of ischemic heart disease: JACC focus seminar 2/7, Journal of the American College of Cardiology, 2021, 78(02): 189-192.

Downloads

Published

04-07-2023

How to Cite

Guo, J. (2023). Study of Factors for Heart Disease Prediction between Men and Women based on Multiple Regression Models. Highlights in Science, Engineering and Technology, 54, 189-198. https://doi.org/10.54097/hset.v54i.9755