loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Data Mining (ICDM'06)
Exploratory Under-Sampling for Class-Imbalance Learning
Hong Kong
December 18-December 22
ISBN: 0-7695-2701-9
Xu-Ying Liu, Nanjing University, China
Jianxin Wu, Georgia Institute of Technology, USA
Zhi-Hua Zhou, Nanjing University, China
Under-sampling is a class-imbalance learning method which uses only a subset of major class examples and thus is very efficient. The main deficiency is that many major class examples are ignored. We propose two algorithms to overcome the deficiency. EasyEnsemble samples several subsets from the major class, trains a learner using each of them, and combines the outputs of those learners. BalanceCascade is similar to EasyEnsemble except that it removes correctly classified major class examples of trained learners from further consideration. Experiments show that both of the proposed algorithms have better AUC scores than many existing class-imbalance learning methods. Moreover, they have approximately the same training time as that of under-sampling, which trains significantly faster than other methods.
Citation:
Xu-Ying Liu, Jianxin Wu, Zhi-Hua Zhou, "Exploratory Under-Sampling for Class-Imbalance Learning," icdm, pp.965-969, Sixth IEEE International Conference on Data Mining (ICDM'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.