|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 64 |
| Published: December 2025 |
| Authors: A.S.M. Sabiqul Hassan, Tanzina Tazreen Meem, Md. Ruhul Amin, Tasniah Mohiuddin, Muhammed Samsuddoha Alam, Mst. Najnin Sultana |
10.5120/ijca2025926079
|
A.S.M. Sabiqul Hassan, Tanzina Tazreen Meem, Md. Ruhul Amin, Tasniah Mohiuddin, Muhammed Samsuddoha Alam, Mst. Najnin Sultana . A Machine Learning Approach for Optimized Heart Disease Diagnosis with SMOTE and Voting Classifiers. International Journal of Computer Applications. 187, 64 (December 2025), 30-36. DOI=10.5120/ijca2025926079
@article{ 10.5120/ijca2025926079,
author = { A.S.M. Sabiqul Hassan,Tanzina Tazreen Meem,Md. Ruhul Amin,Tasniah Mohiuddin,Muhammed Samsuddoha Alam,Mst. Najnin Sultana },
title = { A Machine Learning Approach for Optimized Heart Disease Diagnosis with SMOTE and Voting Classifiers },
journal = { International Journal of Computer Applications },
year = { 2025 },
volume = { 187 },
number = { 64 },
pages = { 30-36 },
doi = { 10.5120/ijca2025926079 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2025
%A A.S.M. Sabiqul Hassan
%A Tanzina Tazreen Meem
%A Md. Ruhul Amin
%A Tasniah Mohiuddin
%A Muhammed Samsuddoha Alam
%A Mst. Najnin Sultana
%T A Machine Learning Approach for Optimized Heart Disease Diagnosis with SMOTE and Voting Classifiers%T
%J International Journal of Computer Applications
%V 187
%N 64
%P 30-36
%R 10.5120/ijca2025926079
%I Foundation of Computer Science (FCS), NY, USA
Heart disease is globally considered a primary cause of a notable number of deaths. Each year, 17.9 million people die from heart disease, according to a report by the World Health Organization (WHO). In this study, a machine learning based optimal model has been developed that primarily includes SMOTE for handling class imbalance in the dataset, and ensemble learning strategies to improve the performance and reliability of heart disease diagnosis. This research work has been conducted on a publicly available dataset from the Kaggle online dataset repository, which includes relevant attributes for heart disease patients. Several base models: LR, KNN, DT, RF, and SVM have been trained for performance evaluation in terms of Accuracy, Precision, Recall, F1-score, and ROC-AUC values. SMOTE has been applied to address the class imbalance issue in the dataset and Soft Voting and Hard Voting classifiers have been used to optimize the model performance by combining all base classifiers. Finally, the Soft Voting classifier has achieved the optimal result: an Accuracy of 70.5%, Precision of 69.8%, Recall of 72.2%, F1-score of 71%, and ROC-AUC of 77%. This optimal model can be used as a decision making tool in the healthcare sector for the early diagnosis of heart diseases followed by necessary steps to prevent those diseases.