Parkinson's Disease Classification Using Vocal Biomarkers, XGBoost, and SHAP

Authors

  • Wafiq Mariatul Azizah Writer
  • Irma Amelia Dewi

DOI:

https://doi.org/10.35194/mji.v18i1.6458

Keywords:

Parkinson's disease , XGBoost , SHAP , SMOTE , Voice biomarkers

Abstract

Parkinson's disease (PD) is a progressive neurodegenerative disorder affecting more than 11.77 million people worldwide. Voice signal analysis has gained attention as a non-invasive screening approach because nearly 90% of PD patients experience measurable speech impairments. However, previous machine learning studies on PD voice datasets commonly face several limitations, including class imbalance that may lead to data leakage, the use of accuracy as the primary evaluation metric, and limited utilization of model interpretability methods. This study proposes a PD classification pipeline integrating SMOTE, XGBoost, and SHAP using the UCI Parkinson dataset, which consists of 195 samples and 22 acoustic features. A quantitative experimental approach was employed using 5-fold stratified cross-validation, where SMOTE was applied only to the training data within each fold to prevent data leakage, while SHAP was used for feature analysis and feature reduction experiments. The results showed that SMOTE improved the F1-Score from 0.9400 to 0.9527 and the Accuracy from 0.9077 to 0.9282. The final model achieved a mean AUC-ROC of 0.9614 and a Recall of 0.9592 across five folds. SHAP analysis showed differences between SHAP feature rankings and XGBoost built-in importance, with MDVP:Shimmer exhibiting the largest ranking change. In addition, the top-8 SHAP-ranked features achieved performance comparable to the full 22-feature model, obtaining an Accuracy of 0.9282 and an AUC of 0.9612. These findings indicate that the proper application of SMOTE and SHAP-based feature selection can improve model evaluation and provide additional information for feature analysis in Parkinson's disease classification.

References

[1] Y. Luo, L. Qiao, M. Li, X. Wen, W. Zhang, and X. Li, “Global, regional, national epidemiology and trends of Parkinson’s disease from 1990 to 2021: findings from the Global Burden of Disease Study 2021,” Front. Aging Neurosci., vol. 16, no. January, pp. 1–12, 2024, doi: 10.3389/fnagi.2024.1498756.

[2] B. Egbo, Z. Nigmetolla, N. A. Khan, and P. K. Jamwal, “Explainable machine learning for early detection of Parkinson’s disease in aging populations using vocal biomarkers,” Front. Aging Neurosci., vol. 17, no. September, 2025, doi: 10.3389/fnagi.2025.1672971.

[3] M. A. Little, P. E. McSharry, E. J. Hunter, J. Spielman, and L. O. Ramig, “Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease,” IEEE Trans. Biomed. Eng., vol. 56, no. 4, pp. 1–7, 2009, doi: 10.1109/TBME.2008.2005954.Suitability.

[4] R. Alshammri, G. Alharbi, E. Alharbi, and I. Almubark, “Machine learning approaches to identify Parkinson’s disease using voice signal features,” Front. Artif. Intell., vol. 6, 2023, doi: 10.3389/frai.2023.1084001.

[5] M. Meral, F. Ozbilgin, and F. Durmus, “Fine-Tuned Machine Learning Classifiers for Diagnosing Parkinson’s Disease Using Vocal Characteristics: A Comparative Analysis,” Diagnostics, vol. 15, no. 5, 2025, doi: 10.3390/diagnostics15050645.

[6] Md Abu Sayed et al., “Parkinson’s Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms,” Journal of Computer Science and Technology Studies, vol. 5, no. 4, pp. 142–149, 2023, doi: 10.32996/jcsts.2023.5.4.14.

[7] S. Sheikhi and M. T. Kheirabadi, “An Efficient Rotation Forest-Based Ensemble Approach for Predicting Severity of Parkinson’s Disease,” J. Healthc. Eng., vol. 2022, 2022, doi: 10.1155/2022/5524852.

[8] S. Srinivasan, P. Ramadass, S. K. Mathivanan, K. Panneer Selvam, B. D. Shivahare, and M. A. Shah, “Detection of Parkinson disease using multiclass machine learning approach,” Sci. Rep., vol. 14, no. 1, pp. 1–17, 2024, doi: 10.1038/s41598-024-64004-9.

[9] A. H. Rahayu and A. Sudrajat, “Improving Heart Disease Severity Prediction Using SMOTE for Imbalanced Data,” Journal of Applied Intelligent System, vol. 9, no. 2, pp. 250–259, 2024, doi: 10.62411/jais.v9i2.11180.

[10] H. Sedigh Malekroodi, B. Il Lee, and M. Yi, “Voice-Based Detection of Parkinson’s Disease Using Machine and Deep Learning Approaches: A Systematic Review,” Bioengineering, vol. 12, no. 11, pp. 1–29, 2025, doi: 10.3390/bioengineering12111279.

[11] D. Y. Saputra, L. K. Wardhani, and H. Nanang, “Multi-Class Fault Detection under Class-Imbalance in Wireless Sensor Network Using Random Undersampling and Extra Trees,” Media Jurnal Informatika, vol. 17, no. 2, pp. 367–378, 2025.

[12] G. Vandewiele et al., “Overly optimistic prediction results on imbalanced data: a case study of flaws and benefits when applying over-sampling,” Artif. Intell. Med., vol. 111, 2021, doi: 10.1016/j.artmed.2020.101987.

[13] S. S. Putra and D. Riminarsih, “Evaluating Machine Learning Models Across Feature Extraction and Data Balancing Scenarios for Coretax Sentiment Analysis,” Media Jurnal Informatika, vol. 17, no. 2, pp. 379–396, 2025.

[14] N. M. Nayan, A. M. Rana, M. M. Islam, J. Uddin, T. Yasmin, and J. Uddin, “An interpretable and balanced machine learning framework for Parkinson’s disease prediction using feature engineering and explainable AI,” PLoS One, vol. 20, no. 10 October, pp. 1–34, 2025, doi: 10.1371/journal.pone.0333418.

[15] N. A. Azhar, M. S. Mohd Pozi, A. M. Din, and A. Jatowt, “An investigation of SMOTE based methods for imbalanced datasets with data complexity analysis,” IEEE Trans. Knowl. Data Eng., vol. 35, no. 7, pp. 6651–6672, 2023, doi: 10.1109/TKDE.2022.3179381.

[16] T. Chen and C. Guestrin, “XGBoost: A Scalable Tree Boosting System,” KDD ’16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol. KDD ’16, no. 10, pp. 785–794, 2016, doi: 10.1145/2939672.2939785.

[17] S. M. Lundberg and S. I. Lee, “A unified approach to interpreting model predictions,” Adv. Neural Inf. Process. Syst., vol. 2017-Decem, no. Section 2, pp. 4766–4775, 2017.

[18] F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” in Journal ofMachine Learning Research 12, vol. 12, 2011, pp. 633–642. [Online]. Available: https://link.springer.com/10.1007/978-3-031-40336-1_56

[19] S. Prusty, S. Patnaik, and S. K. Dash, “SKCV: Stratified K-fold cross-validation on ML classifiers for predicting cervical cancer,” Frontiers in Nanotechnology, vol. 4, no. August, pp. 1–12, 2022, doi: 10.3389/fnano.2022.972421.

[20] H. Hairani, T. Widiyaningtyas, and D. D. Prasetya, “Addressing Class Imbalance of Health Data: a Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies,” International Journal on Informatics Visualization, vol. 8, no. 3, pp. 1310–1318, 2024, doi: 10.62527/joiv.8.3.2283.

Downloads

Published

2026-06-28

How to Cite

Azizah, W. M., & Irma Amelia Dewi. (2026). Parkinson’s Disease Classification Using Vocal Biomarkers, XGBoost, and SHAP. Media Jurnal Informatika, 18(1), 126–138. https://doi.org/10.35194/mji.v18i1.6458