Introdu Research on the Influencing Factors of Diabetes Prediction Model among Pima Indian Heritages Female Data
DOI:
https://doi.org/10.54097/rf0dyd88Keywords:
Diabetes; multiple linear regression; influencing factors.Abstract
Diabetes, as the chronic disease that has the greatest impact on the health of the world, has caused an immeasurable impact on people's lives. This research employed Multiple Linear Regression to explore various factors contributing to the incidence of diabetes. The data contains multiple variables that may predict diabetes. The goal was to assess their predictive value on the diabetes binary outcome through a multiple linear regression model. Diagnostic plots indicated potential model violations, with the residuals vs. fitted values plot suggesting non-linear relationships and variable residual variance. The normal Q-Q plot revealed minor departures from normality, principally at the data extremes. The Scale-Location plot raised questions about the consistency of variance across the data, while leverage analysis did not identify any significantly influential data points. The MLR model yielded an R-squared value of 0.3033, suggesting that approximately 30% of the variability in diabetes outcomes was accounted for by the model. There were some variables that were found to be very significant: Glucose and BMI, which showed p-values that suggested strong associations with diabetes. The findings from this study have identified important determinants of health and have also indicated reasons why there is a need for improvements in the model diagnostics for diabetes. For better health care outcomes, these findings show that diabetes risks are complex and that predictive capacities could benefit from more sophisticated analytical models.
Downloads
References
Xie Z, Nikolayeva O, Luo J, Li D. Building Risk Prediction Models for Type 2 Diabetes Using Machine Learning Techniques. Prev Chronic Dis, 2019, 16: 190109.
Ambhrish G, Bharathi G, Anitha G, Chetana S, Dhanraj, Kiran M. Logistic regression technique for prediction of cardiovascular disease. Global Transitions Proceedings, 2020, 3(1), 127-130.
Wu Y, Ding Y, Tanaka Y, Zhang W. Risk factors contributing to type 2 diabetes and recent advances in the treatment and prevention. Int J Med Sci., 2014, 11(11): 1185-1200.
Chang V, Bailey J, Xu Q A, Sun Z. Pima Indians diabetes mellitus classification based on machine learning (ML) algorithms. Neural Comput Appl., 2022, 24: 1-17.
Pierannunzi C, Hu S S, Balluz L. A systematic review of publications assessing reliability and validity of the Behavioral Risk Factor Surveillance System (BRFSS), 2004-2011. BMC Med Res Methodol., 2013, 13: 49.
Butt U M, Letchmunan S, Ali M, Hassan F H, Baqir A, Sherazi H H R. Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications. J Healthc Eng., 2021.
Rath S, Tripathy A, Tripathy A R. Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model. Diabetes Metab Syndr., 2020, 14(5): 1467-1474.
Chakraborty S, Bhattacharyya R, Banerjee D. Infections: A Possible Risk Factor for Type 2 Diabetes. Adv Clin Chem, 2017, 80: 227-251.
Massouh N, Jaffa A A, Tamim H, Jaffa M A. Social and racial inequalities in diabetes and cancer in the United States. Front Public Health, 2023, 11: 1178979.
Tan S Y, et al. Type 1 and 2 diabetes mellitus: A review on current treatment approach and gene therapy as potential intervention. Diabetes Metab Syndr, 2019, 13(1): 364-372.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







