loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Data Mining (ICDM'06)
A Balanced Ensemble Approach to Weighting Classifiers for Text Classification
Hong Kong
December 18-December 22
ISBN: 0-7695-2701-9
Gabriel Pui Cheong Fung, The Chinese University of Hong Kong, Hong Kong
Jeffrey Xu Yu, The Chinese University of Hong Kong, Hong Kong
Haixun Wang, IBM T.J. Watson Research Center, USA
David W. Cheung, The University of Hong Kong, Hong Kong
Huan Liu, Arizona State University, USA
This paper studies the problem of constructing an effective heterogeneous ensemble classifier for text classification. One major challenge of this problem is to formulate a good combination function, which combines the decisions of the individual classifiers in the ensemble. We show that the classification performance is affected by three weight components and they should be included in deriving an effective combination function. They are: (1) Global effectiveness, which measures the effectiveness of a member classifier in classifying a set of unseen documents; (2) Local effectiveness, which measures the effectiveness of a member classifier in classifying the particular domain of an unseen document; and (3) Decision confidence, which describes how confident a classifier is when making a decision when classifying a specific unseen document. We propose a new balanced combination function, called Dynamic Classifier Weighting (DCW), that incorporates the afore-mentioned three components. The empirical study demonstrates that the new combination function is highly effective for text classification.
Citation:
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Haixun Wang, David W. Cheung, Huan Liu, "A Balanced Ensemble Approach to Weighting Classifiers for Text Classification," icdm, pp.869-873, Sixth IEEE International Conference on Data Mining (ICDM'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.