Pdiapredict-Explainer: An Interpretable AI System for Personalized Diabetes Risk Management
DOI:
https://doi.org/10.54097/kyn9yq85Keywords:
Interpretable AI, Diabetes Risk Prediction, Shap Values, Personalized Medicine, Behavioral Intervention.Abstract
This study presents an interpretable LightGBM model designed to predict diabetes risk, developed using the BRFSS2015 dataset, which includes more than 250,000 records. To make which is called the model more understandable and tackle the common "black box" issue, we incorporated feature engineering—such as creating a Metabolic. Risk score—and applied SMOTE to handle class imbalance. The resulting model achieved an AUC-ROC of 0.8264 and was particularly effective at identifying low-risk individuals, with a 98% recall rate, which can help simplify screening efforts. Using SHAP (SHapley Additive exPlanations), we visualized how different factors contribute to each person's risk, including waterfall and decision plots that break down individual cases—like showing how metabolic dysfunction might account for roughly 31% of a person's risk. This approach supports personalized intervention strategies aligned with ADA guidelines. However, the model's ability to catch high-risk cases was limited, with only an 18% recall, emphasizing the difficulty of detecting complex risk profiles without biochemical data like HbA1c. Moving forward, integrating such markers and tracking patients over time with electronic health records could improve performance. Overall, this framework illustrates how explainable AI can build trust in preventive diabetes strategies by making predictions transparent and clinically relevant.
References
[1] Saeedi, P., Petersohn, I., Salpea, P., Malanda, B., Karuranga, S., Unwin, N., ... Williams, R. (2019). Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045. Diabetes Research and Clinical Practice, *157*, 107843. https://doi.org/10.1016/j.diabres.2019.107843
[2] Choubey, S., Agrahari, S., Shaw, A., Dhar, S., Sarma, R. R., Singh, S. K., Das, P., & Saha, B. (2023). Diabetes prediction using ML. International Journal for Research in Applied Science & Engineering Technology, *11*(6), 4209–4212. https://doi.org/10.22214/jiraset.2023.54415
[3] Amann, J., Blasimme, A., Vayena, E., Frey, D., & Madai, V. I. (2020). Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Medical Informatics and Decision Making, *20*(1), 310. https://doi.org/10.1186/s12911-020-01332-6
[4] Mishra, V. T., Mohan, R., Sreedharan, J. (2025). Prediction of 30-day readmission in diabetes management using Machine learning. Computers in Biology and Medicine, Volume 108, Article 110616. https://doi.org/10.1016/j.compbiomed.2025.110616
[5] Rudin, C. (2019). Stop explaining black box machine learning models. Nature Machine Intelligence, *1*(5), 206-215. https://doi.org/10.1038/s42256-019-0048-x
[6] Arrieta, A. B., Díaz-Rodríguez, N., Bennetot, A., Tabik, S., Barbado, A., García, S., ... Herrera, F. (2020). Explainable AI in healthcare. Artificial Intelligence in Medicine, *110*, 101964. https:// doi.org/10.1016/j.artmed.2020.101964.
[7] American Diabetes Association. (2022). American Diabetes Association releases 2023 standards of care in diabetes to guide prevention, diagnosis, and treatment for people living with diabetes [Press release]. https://diabetes.org/newsroom/american-diabetes-association-2023-standards-care-diabetes-guide-for-prevention-diagnosis-treatment-people-living-with-diabetes
[8] Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, *30*, 4765–4774. https: // proceedings.neurips.cc/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html.
[9] American Diabetes Association Professional Practice Committee. (2024). Introduction and Methodology: Standards of Care in Diabetes—2024. Diabetes Care, Volume 47(Supplement_1), S1–S4. https: //doi.org/10.2337/dc24-SINT
[10] American Diabetes Association. (2022). Standards of medical care in diabetes—2022. American Diabetes Association. https://doi.org/10.2337/dc22-Sint.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







