This Article 
 Bibliographic References 
 Add to: 
2010 IEEE International Conference on Data Mining
Rare Category Characterization
Sydney, Australia
December 13-December 17
ISBN: 978-0-7695-4256-0
Rare categories abound and their characterization has heretofore received little attention. Fraudulent banking transactions, network intrusions, and rare diseases are examples of rare classes whose detection and characterization are of high value. However, accurate characterization is challenging due to high-skewness and non-separability from majority classes, e.g., fraudulent transactions masquerade as legitimate ones. This paper proposes the RACH algorithm by exploring the compactness property of the rare categories. It is based on an optimization framework which encloses the rare examples by a minimum-radius hyper ball. The framework is then converted into a convex optimization problem, which is in turn effectively solved in its dual form by the projected sub gradient method. RACH can be naturally kernelized. Experimental results validate the effectiveness of RACH.
Index Terms:
rare category, minority class, characterization, compactness, optimization, hyperball, subgradient
Jingrui He, Hanghang Tong, Jaime Carbonell, "Rare Category Characterization," icdm, pp.226-235, 2010 IEEE International Conference on Data Mining, 2010
Usage of this product signifies your acceptance of the Terms of Use.