loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007)
Skewed Class Distributions and Mislabeled Examples
Omaha, Nebraska, USA
October 28-October 31
ISBN: 0-7695-3033-8
Both imbalanced data and class noise are problems which have received attention in data mining research, how- ever learning from imbalanced data with labeling errors has not been adequately addressed. We present system- atic experimentation on imbalanced datasets with simulated class noise and evaluate the impact on various classifica- tion algorithms. Our results show that class noise is a sig- nificant detriment to learning from skewed data, but more importantly, we demonstrate that the class in which the noise is located is critical. This has significant repercus- sions for noise treatment procedures, which often handle noise equally in both classes. In addition, an examination of 11 classifiers demonstrates that the learners react very differently when confronted with class noise.
Citation:
Jason Van Hulse, Taghi M. Khoshgoftaar, Amri Napolitano, "Skewed Class Distributions and Mislabeled Examples," icdmw, pp.477-482, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.