A Predictive Study of Heart Attack Triggers Based on Machine Learning
DOI:
https://doi.org/10.54097/f66wpg13Keywords:
Heart attack, machine learning, feature importance, random forest.Abstract
With increasing health concerns in society and the enormous surge in digital data generated from technological advancements, the need for efficient and accurate methods of value extraction from data is more evident than ever. Thus, such methods are developed to help medical professionals make thoughtful decisions. This paper explores a piece of health data from the Centers for Disease Control and Prevention (CDC) to find the most influential factors contributing to the occurrences of heart attacks and the best algorithm to help predict such occurrences. To achieve that goal, this paper employed algorithms including Support Vector Machine (SVM), Random Forest, and Logistic Regression, along with a technique called permutation feature importance to identify important features. In the end, Random Forest was found to be the most powerful and consistent algorithm to complete the task of prediction in this case, while also showing that average sleep hours, BMI, and mental health status of individuals can be potential indicators of heart attacks. With this result, this paper marks significant features and algorithms that can be subjects of further and more extensive studies, and eventually be applied to the work of real medical professionals.
Downloads
References
Eric J, Topol. COVID-19 can affect the heart. Science, 2020, 370: 408 - 409.
Ralph S. Physical activity as an index of heart attack risk in college alumni, American Journal of Epidemiology, 1978, 108 (3): 161 – 175.
Obasi. T and Omair Shafiq M. Towards comparing and using machine learning techniques for detecting and predicting heart attack and diseases, IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, 2393 - 2402.
Hasib K. M. A survey of methods for managing the classification and solution of data imbalance problem,2020, arXiv preprint arXiv: 2012. 11870.
Gregory P. Dietl, Mary E. On the measurement of repair frequency: how important is data standardization? Palaios, 2013, 28 (6): 394 – 402.
Thara D.K., Prema Sudha B.G, Fan X. Auto-detection of epileptic seizure events using deep neural network with different feature scaling techniques, Pattern Recognition Letters, 2019, 128, 544 - 550.
Iwendi C. COVID-19 patient health prediction using boosted random forest algorithm, Frontiers in Public Health, 2020,8.
Karen M. Health differences between European countries, Social Science & Medicine, 2007, 64 (8): 1665 - 1678.
Aaby A, Friis K, Christensen B, Rowlands G, Maindal HT. Health literacy is associated with health behaviour and self-reported health: A large population-based study in individuals with cardiovascular disease. European Journal of Preventive Cardiology, 2017, 24 (17): 1880 - 1888.
Ram Kumar, R.P., Polepaka, S. Performance comparison of random forest classifier and convolution neural network in predicting heart diseases. In: Raju, K., Govardhan, A., Rani, B., Sridevi, R., Murty, M. (eds) Proceedings of the Third International Conference on Computational Intelligence and Informatics. Advances in Intelligent Systems and Computing, Springer, Singapore. 2020, 1090.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







