International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 182 - Issue 12 |
Published: Aug 2018 |
Authors: Priyanka Lahoti, Ajeet Kumar Rai |
![]() |
Priyanka Lahoti, Ajeet Kumar Rai . Imbalanced Data Classification using Sampling Techniques and XGBoost. International Journal of Computer Applications. 182, 12 (Aug 2018), 19-22. DOI=10.5120/ijca2018917735
@article{ 10.5120/ijca2018917735, author = { Priyanka Lahoti,Ajeet Kumar Rai }, title = { Imbalanced Data Classification using Sampling Techniques and XGBoost }, journal = { International Journal of Computer Applications }, year = { 2018 }, volume = { 182 }, number = { 12 }, pages = { 19-22 }, doi = { 10.5120/ijca2018917735 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2018 %A Priyanka Lahoti %A Ajeet Kumar Rai %T Imbalanced Data Classification using Sampling Techniques and XGBoost%T %J International Journal of Computer Applications %V 182 %N 12 %P 19-22 %R 10.5120/ijca2018917735 %I Foundation of Computer Science (FCS), NY, USA
While implementing any machine learning algorithms it is good to have the descriptive knowledge of the dataset. In any dataset, in case having more than 90% of the data in target variable is from class 1 and the remaining data is from class 2. In such type of dataset, error evaluation metric accuracy is not going to help much. Having the unknown dataset with only class 1 itself gives more than 90% accuracy, which shows accuracy as evaluation metric should be ignored. Such a problem with highly skewed target outcome is known as an Imbalanced classification problem. There is a number of techniques to deal with imbalanced dataset. In this paper, we are interested to see how sampling techniques and XGBoost can be used while working with the Imbalanced dataset.