|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 61 |
| Published: December 2025 |
| Authors: Yogita Soni, Ritesh Kumar Yadav |
10.5120/ijca2025925986
|
Yogita Soni, Ritesh Kumar Yadav . Analysis Diabetes Disease Prediction for Healthcare System Using Machine Learning Technique. International Journal of Computer Applications. 187, 61 (December 2025), 36-39. DOI=10.5120/ijca2025925986
@article{ 10.5120/ijca2025925986,
author = { Yogita Soni,Ritesh Kumar Yadav },
title = { Analysis Diabetes Disease Prediction for Healthcare System Using Machine Learning Technique },
journal = { International Journal of Computer Applications },
year = { 2025 },
volume = { 187 },
number = { 61 },
pages = { 36-39 },
doi = { 10.5120/ijca2025925986 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2025
%A Yogita Soni
%A Ritesh Kumar Yadav
%T Analysis Diabetes Disease Prediction for Healthcare System Using Machine Learning Technique%T
%J International Journal of Computer Applications
%V 187
%N 61
%P 36-39
%R 10.5120/ijca2025925986
%I Foundation of Computer Science (FCS), NY, USA
This study compares traditional statistical models, tree-based ensembles, and neural networks to examine machine-learning methods for Type-2 diabetes prediction in healthcare systems. We employ stringent preprocessing (missing-value techniques, feature engineering, imbalance handling) on public and clinical datasets (e.g., Pima Indians, regional EHR cohorts), and we assess models using nested cross-validation and measures that are resistant to class imbalance (ROC-AUC, F1, recall, MCC). Top predictors (HbA1c, fasting glucose, BMI, age, and waist circumference) are identified using explainability techniques (SHAP/feature permutation), and clinical value is evaluated using calibration and decision-curve analysis. We demonstrate model fairness across age/sex subgroups and suggest an ensemble stacking process (base learners: logistic regression, Random Forest, XGBoost, LightGBM; meta learner: calibrated logistic regression). In addition to offering suggestions for incorporation into EHR decision support with privacy and bias prevention principles, the results will quantify tradeoffs between accuracy, interpretability, and clinical readiness.