The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - Feb. (2013 vol.25)
pp: 374-386
Timothy M. Hospedales , Queen Mary University of London, London
Shaogang Gong , Queen Mary University of London, London
Tao Xiang , Queen Mary University of London, London
ABSTRACT
Discovering rare categories and classifying new instances of them are important data mining issues in many fields, but fully supervised learning of a rare class classifier is prohibitively costly in labeling effort. There has therefore been increasing interest both in active discovery: to identify new classes quickly, and active learning: to train classifiers with minimal supervision. These goals occur together in practice and are intrinsically related because examples of each class are required to train a classifier. Nevertheless, very few studies have tried to optimise them together, meaning that data mining for rare classes in new domains makes inefficient use of human supervision. Developing active learning algorithms to optimise both rare class discovery and classification simultaneously is challenging because discovery and classification have conflicting requirements in query criteria. In this paper, we address these issues with two contributions: a unified active learning model to jointly discover new categories and learn to classify them by adapting query criteria online; and a classifier combination algorithm that switches generative and discriminative classifiers as learning progresses. Extensive evaluation on a batch of standard UCI and vision data sets demonstrates the superiority of this approach over existing methods.
INDEX TERMS
Uncertainty, Data models, Adaptation models, Training, Support vector machines, Entropy, Approximation methods, discriminative models, Active learning, rare class discovery, imbalanced learning, classification, generative models
CITATION
Timothy M. Hospedales, Shaogang Gong, Tao Xiang, "Finding Rare Classes: Active Learning with Generative and Discriminative Models", IEEE Transactions on Knowledge & Data Engineering, vol.25, no. 2, pp. 374-386, Feb. 2013, doi:10.1109/TKDE.2011.231
REFERENCES
[1] D. Pelleg and A. Moore, "Active Learning for Anomaly and Rare-Category Detection," Proc. Neural Information Processing Systems, 2004.
[2] S. Bay, K. Kumaraswamy, M.G. Anderle, R. Kumar, and D.M. Steier, "Large Scale Detection of Irregularities in Accounting Data," Proc. Int'l Conf. Data Mining, pp. 75-86, 2006.
[3] J.W. Stokes, J.C. Platt, J. Kravis, and M. Shilman, "Aladin: Active Learning of Anomalies to Detect Intrusions," Technical Report 2008-24, MSR, 2008.
[4] T. Hospedales, S. Gong, and T. Xiang, "A Markov Clustering Topic Model for Behaviour Mining in Video," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[5] T. Hospedales, J. Li, S. Gong, and T. Xiang, "Identifying Rare and Subtle Behaviours: A Weakly Supervised Joint Topic Model," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 33, no. 12, pp. 2451-2464, Dec. 2011.
[6] B. Settles, "Active Learning Literature Survey," Technical Report 1648, Univ. Wisconsin-Madison, 2009.
[7] M. Wang and X.-S. Hua, "Active Learning in Multimedia Annotation and Retrieval: A Survey," ACM Trans. Intelligent Systems and Technology, vol. 2, pp. 10:1-10:21, 2011.
[8] A. Ng and M. Jordan, "On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes," Proc. Neural Information Processing Systems, 2001.
[9] V.J. Hodge and J. Austin, "A Survey of Outlier Detection Methodologies," Artificial Intelligence Rev., vol. 22, pp. 85-126, 2004.
[10] R. Sillito and R. Fisher, "Incremental One-Class Learning with Bounded Computational Complexity," Proc. Int'l Conf. Artificial Neural Networks, 2007.
[11] P. Jain and A. Kapoor, "Active Learning for Large Multi-Class Problems," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 762-769, 2009.
[12] S. Tong and D. Koller, "Support Vector Machine Active Learning with Applications to Text Classification," Proc. Int'l Conf. Machine Learning, 2000.
[13] H.S. Seung, M. Opper, and H. Sompolinsky, "Query by Committee," Proc. Int'l Conf. Learning Theory, 1992.
[14] N. Roy and A. McCallum, "Toward Optimal Active Learning through Sampling Estimation of Error Reduction," Proc. Int'l Conf. Machine Learning, pp. 441-448, 2001.
[15] D.A. Cohn, Z. Ghahramani, and M.I. Jordan, "Active Learning with Statistical Models," J. Artificial Intelligence Research, vol. 4, pp. 129-145, 1996.
[16] M. Wang, X.-S. Hua, Y. Song, J. Tang, and L.-R. Dai, "Multi-Concept Multi-Modality Active Learning for Interactive Video Annotation," Proc. Int'l Conf. Semantic Computing (ICSC), 2007.
[17] Y. Baram, R. El-Yaniv, and K. Luz, "Online Choice of Active Learning Algorithms," J. Machine Learning Research, vol. 5, pp. 255-291, 2004.
[18] J. He and J. Carbonell, "Nearest-Neighbor-Based Active Learning for Rare Category Detection," Proc. Neural Information Processing Systems, 2007.
[19] P. Vatturi and W.-K. Wong, "Category Detection Using Hierarchical Mean Shift," Proc. ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 847-856, 2009.
[20] H. Huang, Q. He, J. He, and L. Ma, "Radar: Rare Category Detection via Computation of Boundary Degree," Proc. Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD), 2011.
[21] J.-H. Xue and D.M. Titterington, "Comment on "Discriminative versus Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes"," Neural Processing Letters, vol. 28, no. 3, pp. 169-187, 2008.
[22] T. Deselaers, G. Heigold, and H. Ney, "Object Classification by Fusing Svms and Gaussian Mixtures," Pattern Recognition, vol. 43, no. 7, pp. 2476-2484, 2010.
[23] C. Campbell, N. Cristianini, and A. Smola, "Query Learning with Large Margin Classifiers," Proc. Int'l Conf. Machine Learning, 2000.
[24] D. Shen, J. Zhang, J. Su, G. Zhou, and C.-L. Tan, "Multi-Criteria-Based Active Learning for Named Entity Recognition," Proc. 42nd Ann. Meeting Assoc. Computational Linguistics, 2004.
[25] P. Donmez, J.G. Carbonell, and P.N. Bennett, "Dual Strategy Active Learning," Proc. European Conf. Machine Learning, 2007.
[26] N. Cebron and M.R. Berthold, "Active Learning for Object Classification: From Exploration to Exploitation," Data Mining and Knowledge Discovery, vol. 18, no. 2, pp. 283-299, 2009.
[27] J. Goldberger and S. Roweis, "Hierarchical Clustering of a Mixture Model," Proc. Neural Information Processing Systems, 2004.
[28] T.-F. Wu, C.-J. Lin, and R.C. Weng, "Probability Estimates for Multi-Class Classification by Pairwise Coupling," J. Machine Learning Research, vol. 5, pp. 975-1005, 2004.
[29] J. Kittler, M. Hatef, R.P.W. Duin, and J. Matas, "On Combining Classifiers," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 226-239, Mar. 1998.
[30] A. Asuncion and D. Newman, "UCI Machine Learning Repository," http://www.ics.uci.eduml/, 2007.
[31] S. Ertekin, J. Huang, L. Bottou, and L. Giles, "Learning on the Border: Active Learning in Imbalanced Data Classification," Proc. ACM Conf. Conf. Information and Knowledge Management, 2007.
[32] J. Han and B. Bhanu, "Individual Recognition Using Gait Energy Image," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 2, pp. 316-322, Feb. 2006.
[33] R. Raina, Y. Shen, A. Ng, and A. McCallum, "Classification with Hybrid Generative/Discriminative Models," Proc. Neural Information Processing Systems, 2003.
[34] A.D. Holub, M. Welling, and P. Perona, "Combining Generative Models and Fisher Kernels for Object Recognition," Proc. IEEE Int'l Conf. Computer Vision, pp. 136-143, 2005.
[35] T.S. Jaakkola and D. Haussler, "Exploiting Generative Models in Discriminative Classifiers," Proc. Neural Information Processing Systems, pp. 487-493, 1998.
[36] H. He and E. Garcia, "Learning from Imbalanced Data," IEEE Trans. Data and Knowledge Eng., vol. 21, no. 9, pp. 1263-1284, Sept. 2009.
[37] C.C. Loy, T. Xiang, and S. Gong, "Stream Based Active Anomaly Detection," Proc. Asian Conf. Computer Vision, 2010.
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool