Third IEEE International Conference on Data Mining (ICDM'03)
Building Text Classifiers Using Positive and Unlabeled Examples
Melbourne, Florida
November 19-November 22
ISBN: 0-7695-1978-4
Bing Liu, University of Illinois at Chicago
Yang Dai, University of Illinois at Chicago
Xiaoli Li, National University of Singapore/Singapore-MIT Alliance
Wee Sun Lee, National University of Singapore/Singapore-MIT Alliance
This paper studies the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. In this paper, we first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.
Citation:
Bing Liu, Yang Dai, Xiaoli Li, Wee Sun Lee, Philip S. Yu, "Building Text Classifiers Using Positive and Unlabeled Examples," icdm, pp.179, Third IEEE International Conference on Data Mining (ICDM'03), 2003