A Comparison of Machine Learning-Based Classification and Recognition Methods for Parkinson's Prediction

Authors

  • Jiayu Zhang

DOI:

https://doi.org/10.54097/2dq7z209

Keywords:

Phonetic character analysis; Parkinson's disease prediction; Logistic regression; XGBoost; Support vector machines.

Abstract

Parkinson's disease (PD) is a neurodegenerative disease that is often associated with abnormal voice function. Early recognition of speech features is crucial for the diagnosis and intervention of PD, and the development of efficient and reliable early screening tools has become a research hotspot in clinical medicine. Current research mostly focuses on single model optimization and lacks a multidimensional assessment framework. This study compares the classification performance of three machine learning models, Logistic Regression, XGBoost and Support Vector Machine (SVM), in PD prediction based on the publicly available Parkinson's speech feature dataset from UCI. Key features are extracted through data feature engineering, oversampling is used to address data imbalance, and metrics such as confusion matrices are used for multidimensional assessment. The results of the study showed that logistic regression had the best overall performance on the validation set, with an accuracy of 81.47% and a recall rate of 95% for healthy people, but its patient recall rate (71%) still needs to be optimized. XGBoost does not adequately deal with class imbalance, under-recognizes minority classes, and SVM performs poorly with the current data distribution. This study provides a theoretical basis for model selection for disease diagnosis and lays down a methodology for intelligent processing of multimodal medical data.

Downloads

Download data is not yet available.

References

[1] O. P. Neto, Harnessing Voice Analysis and Machine Learning for Early Diagnosis of Parkinson's Disease: A Comparative Study Across Three Datasets, Journal of Voice, Journal of Voice, 2024, ISSN 0892-1997.

[2] G. Solana-Lavalle, R. Rosas-Romero, Analysis of voice as an assisting tool for detection of Parkinson's disease and its subsequent clinical interpretation, Biomedical Signal Processing and Control, 2021, ISSN 1746-8094.

[3] N. D. Pah, V. Indrawati and D. K. Kumar, Voice-Based SVM Model Reliability for Identifying Parkinson’s Disease, IEEE Access, 2023, ISSN 2169-3536.

[4] C.O. Sakar, G. Serbes, A. Gunduz, H.C. Tunc, H. Nizam, B.E. Sakar, et al., A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform, Applied Soft Computing, Applied Soft Computing, 2019, ISSN 1568-4946.

[5] S. Lahmiri, A. Shmuel, Detection of Parkinson's disease based on voice patterns ranking and optimized support vector machine, Biomedical Signal Processing and Control, 2019, ISSN 1746-8094.

[6] Y.C. Tai, P.G. Bryan, F. Loayza, E. Peláez, A voice analysis approach for recognizing Parkinson’s disease patterns, IFAC-PapersOnLine, 2021, ISSN 2405-8963.

[7] T. J. Wroge, Y. Özkanca, C. Demiroglu, D. Si, D. C. Atkins and R. H. Ghomi, Parkinson’s Disease Diagnosis Using Machine Learning and Voice, 2018 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), 2018, ISSN 2473-716X.

[8] P. Schober, T.R. Vetter, Logistic Regression in Medical Research. Anesthesia & Analgesia 132(2), 2021, ISSN 0003-2999.

[9] R. Santhanam, N. Uzir, S. Raman, S. Banerjee, Experimenting XGBoost Algorithm for Prediction and Classification of Different Datasets, 2017, ISSN 0974–5572.

[10] D.M. Abdullah, A.M. Abdulazeez, Machine Learning Applications based on SVM Classification A Review. Qubahan Academic Journal, 2021, ISSN 2709-8206

Downloads

Published

13-11-2025

Issue

Section

Articles

How to Cite

Zhang, J. (2025). A Comparison of Machine Learning-Based Classification and Recognition Methods for Parkinson’s Prediction. Academic Journal of Science and Technology, 17(1), 1-11. https://doi.org/10.54097/2dq7z209