International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 187 - Issue 19 |
Published: July 2025 |
Authors: Justine Aku Azigi, Frederick Adrah |
![]() |
Justine Aku Azigi, Frederick Adrah . Toward Smart Biosensing: A Machine Learning Approach for Early Diabetes Detection. International Journal of Computer Applications. 187, 19 (July 2025), 8-11. DOI=10.5120/ijca2025925268
@article{ 10.5120/ijca2025925268, author = { Justine Aku Azigi,Frederick Adrah }, title = { Toward Smart Biosensing: A Machine Learning Approach for Early Diabetes Detection }, journal = { International Journal of Computer Applications }, year = { 2025 }, volume = { 187 }, number = { 19 }, pages = { 8-11 }, doi = { 10.5120/ijca2025925268 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2025 %A Justine Aku Azigi %A Frederick Adrah %T Toward Smart Biosensing: A Machine Learning Approach for Early Diabetes Detection%T %J International Journal of Computer Applications %V 187 %N 19 %P 8-11 %R 10.5120/ijca2025925268 %I Foundation of Computer Science (FCS), NY, USA
Diabetes is a global metabolic disorder characterized by impaired glucose metabolism, leading to hyperglycemia and severe complications if untreated. With 1 in 10 Americans affected and rising incidence among youth, early detection is critical. Traditional diagnostic methods, though effective, face limitations in scalability and human error. This study proposes a machine learning (ML) framework for early diabetes prediction using the Behavioral Risk Factor Surveillance System (BRFSS) 2015 dataset (N=70,692), balanced with 50% diabetic cases. We analyze 22 features spanning clinical indicators (e.g., HighBP, HighChol, BMI), lifestyle factors (smoking, exercise), and socioeconomic variables (income, education). Feature engineering introduces interaction terms (BMI×GenHlth, Age×PhysHlth), aggregated chronic conditions, and binned health metrics. Correlation analysis reveals key predictors: HighBP (r=0.38), GenHlth (r=0.32), BMI (r=0.29), and Age (r=0.28), while physical activity and education exhibit protective effects (r=−0.16 to −0.22). Multi-collinearity is observed between health constructs (e.g., GenHlth–PhysHlth: r=0.55). Three ensemble models (Random Forest, XGBoost, LightGBM) consistently rank GenHlth, BMI, and chronic conditions as top predictors. Our approach demonstrates how engineered features enhance ML performance, offering a scalable tool for identifying at-risk individuals missed by conventional screening. This work underscores AI’s potential to transform diabetes surveillance through computational biosensing, bridging gaps in preventive healthcare.