探花系列

This website stores cookies on your computer. These cookies are used to collect information about how you interact with our website and allow us to remember your browser. We use this information to improve and customize your browsing experience, for analytics and metrics about our visitors both on this website and other media, and for marketing purposes. By using this website, you accept and agree to be bound by UVic鈥檚 Terms of Use and Protection of Privacy Policy.聽聽If you do not agree to the above, you can configure your browser鈥檚 setting to 鈥渄o not track.鈥

Skip to main content

Sadam Hussain

Notice of the Final Oral Examination for the Degree of Master of Applied Science

Topic

Explainable Machine Learning for Diabetes Prediction

Department of Electrical and Computer Engineering

Date & location

  • Wednesday, March 4, 2026
  • 1:30 P.M.
  • Engineering Office Wing, Room 430

Examining Committee

Supervisory Committee

  • Dr. T. Aaron Gulliver, Department of Electrical and Computer Engineering, 探花系列 (Supervisor)
  • Dr. Mihai Sima, Department of Electrical and Computer Engineering, UVic (Member)

External Examiner

  • Dr. Daniela Constantinescu, Department of Mechanical Engineering, UVic

Chair of Oral Examination

  • Dr. Tao Wang, Department of Economics, UVic

Abstract

Diabetes is a growing global health concern, contributing to significant morbidity, mortality, and long-term economic burden. Machine Learning (ML) methods are increasingly applied to diabetes prediction, however, selecting appropriate classifiers and understanding the key features driving model decisions remain essential for reliable and clinically acceptable performance. This is particularly important in healthcare settings where clinicians may have limited familiarity with ML techniques and where transparency and trust in predictive outputs are critical. This study evaluates eight ML classifiers, Logistic Regression (LR), Random Forest (RF), Gradient Boosting (GB), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), AdaBoost (AB), Decision Tree (DT) and a Neural Network (NN) using a dataset of 100,000 patient records for diabetes prediction. Models are evaluated using various configurations which includes baseline training and hyperparameter optimization using RandomizedSearchCV. The global and local interpretability is examined using SHapley Additive exPlanations (SHAP), Local Interpretable Model-Agnostic Explanations (LIME) and Explain Like I’m 5 (ELI5) to identify the most influential features contributing to predictions. These findings show that ensemble based models achieve strongest predictive performance with RF and GB outperforming other evaluated classifiers. Interpretability analyses consistently highlight that Hamoglobin A1c (HbA1c), blood glucose, Body Mass Index (BMI) and age are the dominant predictive features. A final evaluation using a reduced feature set derived with the help of Explainable AI (XAI) demonstrates that strong predictive accuracy can be maintained while improving model simplicity and interpretability. This work underscores the importance of combining the ML performance with transparent feature explanations in order to support trustworthy and clinically meaningful decision support systems for diabetes prediction.