loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Using One-Class and Two-Class SVMs for Multiclass Image Annotation
October 2005 (vol. 17 no. 10)
pp. 1333-1346
We propose using one-class, two-class, and multiclass SVMs to annotate images for supporting keyword retrieval of images. Providing automatic annotation requires an accurate mapping of images' low-level perceptual features (e.g., color and texture) to some high-level semantic labels (e.g., landscape, architecture, and animals). Much work has been performed in this area; however, there is a lack of ability to assess the quality of annotation. In this paper, we propose a confidence-based dynamic ensemble (CDE), which employs a three-level classification scheme. At the base-level, CDE uses one-class Support Vector Machines (SVMs) to characterize a confidence factor for ascertaining the correctness of an annotation (or a class prediction) made by a binary SVM classifier. The confidence factor is then propagated to the multiclass classifiers at subsequent levels. CDE uses the confidence factor to make dynamic adjustments to its member classifiers so as to improve class-prediction accuracy, to accommodate new semantics, and to assist in the discovery of useful low-level features. Our empirical studies on a large real-world data set demonstrate CDE to be very effective.

[1] 1333 E.L. Allwein , R.E. Schapire , and Y. Singer , “Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers,” J. Machine Learning Research, vol. 1, 2000.[2] A.B. Benitez and S.-F. Chang , “Semantic Knowledge Construction from Annotated Image Collection,” Proc. IEEE Int'l Conf. Multimedia, Aug. 2002.[3] D. Bouchaffra , V. Govindaraju , and S.N. Srihari , “A Methodology for Mapping Scores to Probabilities,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 9, pp. 923-927, 1999. [4] L. Breiman , “Bagging Predicators,” Machine Learning, pp. 123–140, 1996.[5] C. Burges , “A Tutorial on Support Vector Machines for Pattern Recognition,” Data Mining and Knowledge Discovery, vol. 2, pp. 121-167, 1998.[6] E. Chang , K. Goh , G. Sychay , and G. Wu , “Content-Based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machines,” IEEE Trans. Circuits and Systems for Video Technology, special issue on conceptual and dynamical aspects of multimedia content description, vol. 13, no. 1, pp. 26-38, 2003. [7] S.-F. Chang , W. Chen , and H. Sundaram , “Semantic Visual Templates: Linking Visual Features to Semantics,” Proc. IEEE Int'l Conf. Image Processing, 1998.[8] C.K. Chow , “On Optimum Recognition Error and Reject Tradeoff,” IEEE Trans. Information Theory, vol. 16, no. 1, pp. 41-46, 1970. [9] R. Collobert and S. Bengio , “SVMtorch: Support Vector Machines for Large-Scale Regression Problems,” J. Machine Learning Research, vol. 1, pp. 143-160, 2001. [10] T. Dietterich and G. Bakiri , “Solving Multiclass Learning Problems via Error-Correcting Output Codes,” J. Artifical Intelligence Research, vol. 2, 1995.[11] J. Fan , Y. Gao , and H. Luo , “Multi-Level Annotation of Natural Scenes Using Dominant Image Components and Semantic Concepts,” Proc. ACM Int'l Conf. Multimedia, Oct. 2004.[12] K. Fukunaga , Introduction to Statistical Pattern Recognition, second ed. Boston, Mass.: Academic Press, 1990.[13] K. Goh , E. Chang , and K.T. Cheng , “SVM Binary Classifier Ensembles for Image Classification,” Proc. ACM Conf. Information and Knowledge Management, pp. 395-402, Nov. 2001.[14] T. Hastie and R. Tibshirani , “Classification by Pairwise Coupling,” Advances in Neural Information Processing Systems, M.I. Jordan, M.J. Kearns, and S.A. Solla, eds., vol. 10, The MIT Press, 1998.[15] X. He , W.-Y. Ma , O. King , M. Li , and H. Zhang , “Learning and Inferring a Semantic Space from User's Relevance Feedback for Image Retrieval,” Proc. ACM Int'l Conf. Multimedia, pp. 343-347, Dec. 2002.[16] J. Li and J.Z. Wang , “Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 2, Feb. 2003.[17] P. Lipson , “Context and Configuration Based Scene Classification,” Phd Dissertation, MIT EECS Dept., Sept. 1996.[18] M. Moreira and E. Mayoraz , “Improving Pairwise Coupling Classification with Error Correcting Classifiers,” Proc. 10th European Conf. Machine Learning, Apr. 1998.[19] J. Platt , “Probabilistic Outputs for SVMs and Comparisons to Regularized Likelihood Methods,” Advances in Large Margin Classifiers. MIT Press, 1999.[20] J. Platt , N. Cristianini , and J. Shawe-Taylor , “Large Margin Dags for Multiclass Classification,” Advances in Neural Information Processing Systems, vol. 12, pp. 547-553, MIT Press, 2000.[21] P. Poddar and P. Rao , “Hierarchical Ensemble of Neural Networks,” Proc. Int'l Conf. Neural Networks, vol. 1, 1993. [22] G. Ritter and M.T. Gallegos , “Outliers in Statistical Pattern Recognition and an Application to Automatic Chromosome Classification,” Pattern Recognition Letters, vol. 18, pp. 525-539, 1997. [23] C. Rodriguez , J. Muguerza , M. Navarro , A. Zarate , J. Martin , and J. Perez , “A Two-Stage Classifier for Broken and Blurred Digits in Forms,” Proc. Int'l Conf. Pattern Recognition, vol. 2, pp. 1101-1105, 1998. [24] R.F. Schapire and Y. Singer , “Improved Boosting Algorithms Using Confidence-Rated Predictions,” Proc. 11th Ann. Conf. Computational Learning Theory, pp. 80-91, July 1998.[25] B. Scholkopf , J.C. Platt , J. Shawe-Taylor , A.J. Smola , and R.C. Williamson , “Estimating the Support of a High-Dimensional Distribution,” Technical Report MSR-TR-99-87, Microsoft, Nov. 1999.[26] B. Scholkopf , R.C. Williamson , A.J. Smola , J. Shawe-Taylor , and J.C. Platt , “Support Vector Method for Novelty Detection,” Advances in Neural Information Processing Systems, S.A. Solla, T.K. Leen, and K.R. Miller, eds., vol. 12, The MIT Press, 2000.[27] H.T. Shen , B.C. Ooi , and K.L. Tan , “Giving Meanings to WWW Images,” Proc. ACM Multimedia, pp. 39-48, Nov. 2000.[28] J.R. Smith and S.-F. Chang , “Multi-Stage Classification of Images from Features and Related Text,” Proc. Fourth DELOS Workshop, Aug. 1997.[29] R. Srihari , Z. Zhang , and A. Rao , “Intelligent Indexing and Semantic Retrieval of Multimodal Documents,” Information Retrieval, vol. 2, pp. 245-275, 2000.[30] D.M.J. Tax and R.P.W. Duin , “Data Domain Description by Support Vectors,” Proc. European Symp. Artificial Neural Networks, pp. 251-256, Apr. 1999.[31] S. Tong and E. Chang , “Support Vector Machine Active Learning for Image Retrieval,” Proc. ACM Int'l Conf. Multimedia, Oct. 2001.[32] V. Vapnik , The Nature of Statistical Learning Theory. New York: Springer, 1995.[33] V. Vapnik , Statistical Learning Theory. Wiley, 1998.[34] J. Wang , J. Li , and G. Wiederhold , “Simplicity: Semantics-Sensitive Integrated Matching for Picture Libraries,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 9, pp. 947-963, 2001. [35] J.Z. Wang and J. Li , “Learning-Based Linguistic Indexing of Pictures with 2-D MHMMs,” Proc. ACM Multimedia, pp. 436-445, Dec. 2002.[36] L. Wenyin , S. Dumais , Y. Sun , H. Zhang , M. Czerwinski , and B. Field , “Semi-Automatic Image Annotation,” Proc. Interact 2001: Conf. Human-Computer Interaction, pp. 326-333, July 2001.[37] G. Wu and E. Chang , “Adaptive Feature-Space Conformal Transformation for Learning Imbalanced Data,” Proc. Int'l Conf. Machine Learning, Aug. 2003.[38] H. Wu , M. Li , H. Zhang , and W.-Y. Ma , “Improving Image Retrieval with Semantic Classification Using Relevance Feedback,” Proc. Sixth Conf. Visual Database Systems, pp. 327-339, 2002.[39] X.Z.Y. Chen and T.S. Huang , “One-Class SVM for Learning in Image Retrieval,” Proc. IEEE Int'l Conf. Image Processing, 2001.[40] K. Goh , B. Li , and E.Y. Chang , “Semantics and feature Discovery via Confidence-Based Dynamic Ensemble,” ACM Trans. Multimedia, vol. 1, no. 2, pp. 168-189, 2005. [41] B. Li , K. Goh , and E.Y. Chang , “Confidence-Based Dynamic Ensemble for Image Annotation and Semantics Discovery,” Proc. Int'l Conf. Multimedia, 2003.

Index Terms:
Index Terms- Pattern recognition, models, statistical, artificial intelligence, learning.
Citation:
King-Shy Goh, Edward Y. Chang, Beitao Li, "Using One-Class and Two-Class SVMs for Multiclass Image Annotation," IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 10, pp. 1333-1346, Oct. 2005, doi:10.1109/TKDE.2005.170
Usage of this product signifies your acceptance of the Terms of Use.