Predicting Stroke Risk: A Logistic Regression and Data Visualization Approach
DOI:
https://doi.org/10.54097/sv423433Keywords:
Stroke prediction, Losgistic regression, machine learning, data visualization.Abstract
Strokes significantly threaten global health, underscoring the urgent need for accurate predictive models to aid in prevention efforts. This study delves into the application of logistic regression and data visualization techniques to analyze a dataset comprising 5110 stroke patients and their characteristics. This paper identifies key attributes associated with stroke risk, including age, hypertension, heart disease, and glucose levels. The logistic regression model demonstrates a high accuracy of approximately 95%, with nuanced insights provided by comparing single-year versus multi-year data models. This paper demonstrates the power of data visualization and machine learning to analyze complex relationships between patient attributes and stroke incidence, concluding that the integration of advanced analytics into clinical practice can significantly bolster stroke prevention, emphasizing a personalized approach to healthcare that prioritizes early detection and intervention. The study contributes to the medical community’s arsenal against strokes, offering a robust healthcare predictive modeling framework. The synergy between data science and medicine plays a critical role in mitigating the impact of stroke, enhancing outcomes, and optimizing allocations.
Downloads
References
[1] E. D. Simon, O. Olatunji, U. D. Itanyi, & J. O. Aiyekomogbon. Computerized tomographic brain findings in hypertensives with stroke in Abuja, Nigeria. Journal of Radiation Medicine in the Tropics, 5 (1), 2024, 7 - 13.
[2] I. Kortazar-Zubizarreta, A. Pinedo-Brochado, I. Azkune-Calle, U. Aguirre-Larracoechea, M. Gomez-Beldarrain, & J. C. Garcia-Monco. Predictors of in‐hospital mortality after ischemic stroke: a prospective, single‐center study. Health Science Reports, 2 (4) 2019.
[3] J. V. E. Gerstl, S. E. Blitz, Q. R. Qu, A. G. Yearley, Philipp Lassarén, & R. Lindberg, et al. Global, regional, and national economic consequences of stroke. Stroke, 54 (9), 2023, 2380 - 2389.
[4] A. Bachmeier, & M. Heroux. Data Visualization through Graphic Representation, 2023.
[5] J. Shoultz. Geographic information systems in public health data visualization, analysis and sharing, 2015.
[6] A. Callahan, & N. H. Shah. Machine learning in healthcare. Key Advances in Clinical Informatics, 2017, 279 - 291.
[7] C. G. Soh, Y. Zhu, & T. L. Toh. A regularized logistic regression model with structured features for classification of geographical origin in olive oils. Chemometrics and Intelligent Laboratory Systems 2023.
[8] Lijuan Zhang. Data quality: the accuracy dimension (book review).
[9] B. Gaye, & A. Wulamu. Sentiment Analysis of Text Classification Algorithms Using Confusion Matrix, 2019.
[10] S. Simsin, & J. Khiewyoo. A Report of the Management of Continuous Independent Variables in Logistic Regression Analysis in Medical and Public Health Journals in Thailand 2017.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Academic Journal of Science and Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.








