The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - Oct. (2013 vol.35)
pp: 2442-2453
Gang Wang , Adv. Digital Sci. Center, Nanyang Technol. Univ., Singapore, Singapore
D. Forsyth , Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
D. Hoiem , Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
ABSTRACT
Due to the intrinsic long-tailed distribution of objects in the real world, we are unlikely to be able to train an object recognizer/detector with many visual examples for each category. We have to share visual knowledge between object categories to enable learning with few or no training examples. In this paper, we show that local object similarity information--statements that pairs of categories are similar or dissimilar--is a very useful cue to tie different categories to each other for effective knowledge transfer. The key insight: Given a set of object categories which are similar and a set of categories which are dissimilar, a good object model should respond more strongly to examples from similar categories than to examples from dissimilar categories. To exploit this category-dependent similarity regularization, we develop a regularized kernel machine algorithm to train kernel classifiers for categories with few or no training examples. We also adapt the state-of-the-art object detector to encode object similarity constraints. Our experiments on hundreds of categories from the Labelme dataset show that our regularized kernel classifiers can make significant improvement on object categorization. We also evaluate the improved object detector on the PASCAL VOC 2007 benchmark dataset.
INDEX TERMS
Training, Kernel, Detectors, Adaptation models, Support vector machines, Visualization, Object detection,sharing, Comparative object similarity, object categorization, object detection, kernel machines, SVM, deformable part model, PASCAL VOC
CITATION
Gang Wang, D. Forsyth, D. Hoiem, "Improved Object Categorization and Detection Using Comparative Object Similarity", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 10, pp. 2442-2453, Oct. 2013, doi:10.1109/TPAMI.2013.58
REFERENCES
[1] K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D.M. Blei, and M.I. Jordan, "Matching Words and Pictures," The J. Machine Learning Research, vol. 3, pp. 1107-1135, 2003.
[2] E. Bart, I. Porteous, P. Perona, and M. Welling, "Unsupervised Learning of Visual Taxonomies," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[3] E. Bart and S. Ullman, "Cross-Generalization: Learning Novel Classes from a Single Example by Feature Replacement," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2005.
[4] L. Breiman, "Arcing Classifier," The Annals of Statistics, vol. 26, no. 3, pp. 801-849, 1998.
[5] N. Dalai, B. Triggs, I. Rhone-Alps, and F. Montbonnot, "Histograms of Oriented Gradients for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2005.
[6] J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei-Fei, "Imagenet: A Large-Scale Hierarchical Image Database," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[7] M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results," 2007.
[8] A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth, "Describing Objects by Their Attributes," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[9] L. Fei-Fei, R. Fergus, and P. Perona, "One-Shot Learning of Object Categories," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 4, pp. 594-611, Apr. 2006.
[10] P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan, "Object Detection with Discriminatively Trained Part Based Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627-1645, Sept. 2010.
[11] R. Fergus, Y. Weiss, and A. Torralba, "Semi-Supervised Learning in Gigantic Image Collections," Proc. Neural Information Processing Systems Conf., 2009.
[12] A. Frome, Y. Singer, F. Sha, and J. Malik, "Learning Globally Consistent Local Distance Functions for Shape-Based Image Retrieval an Classification," Proc. IEEE Int'l Conf. Computer Vision, 2007.
[13] R. Herbrich, T. Graepel, and K. Obermayer, "Large Margin Rank Boundaries for Ordinal Regression," Proc. ACM Conf. Knowledge Discovery and Data Mining, 2002.
[14] E. Hüllermeier, J. Fürnkranz, W. Cheng, and K. Brinker, "Label Ranking by Learning Pairwise Preferences," Artificial Intelligence, vol. 172, nos. 16/17, pp. 1897-1916, 2008.
[15] R. Salakhutdinov, J.J. Lim, and A. Torralba, "Transfer Learning by Borrowing Examples for Multiclass Object Detection," Proc. Neural Information Processing Systems Conf., 2011.
[16] D.W. Jacobs, D. Weinshall, and Y. Gdalyahu, "Classification with Nonmetric Distances: Image Retrieval and Class Representation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 6, pp. 583-600, June 2000.
[17] T. Joachims, "Optimizing Search Engines Using Clickthrough Data," Proc. Eighth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 133-142, 2002.
[18] J. Kivinen, A.J. Smola, and R.C. Williamson, "Online Learning with Kernels," IEEE Trans. Signal Processing, vol. 52, no. 8, pp. 2165-2176, Aug. 2004.
[19] N. Kumar, A.C. Berg, P.N. Belhumeur, and S.K. Nayar, "Describable Visual Attributes for Face Verification and Image Search," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 33, no. 10, pp. 1962-1977, Oct. 2011.
[20] C.H. Lampert, H. Nickisch, and S. Harmeling, "Learning to Detect Unseen Object Classes by Between-Class Attribute Transfer," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[21] J. Langford, L. Li, and T. Zhang, "Sparse Online Learning via Truncated Gradient," The J. Machine Learning Research, vol. 10, pp. 777-801, 2009.
[22] S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[23] T.Y. Liu, "Learning to Rank for Information Retrieval," Foundations and Trends in Information Retrieval, vol. 3, no. 3, pp. 225-331, 2009.
[24] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[25] E.G. Miller, N.E. Matsakis, and P.A. Viola, "Learning from One Example through Shared Densities on Transforms," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 464-471, 2000.
[26] A. Opelt, A. Pinz, and A. Zisserman, "Incremental Learning of Object Detectors Using a Visual Shape Alphabet," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 3-10, 2006.
[27] M. Palatucci, D. Pomerleau, G. Hinton, and T.M. Mitchell, "Zero-Shot Learning with Semantic Output Codes," Proc. Neural Information Processing Systems Conf., 2009.
[28] C.H. Papadimitriou and K. Steiglitz, Combinatorial Optimization: Algorithms and Complexity. Dover Publications, 1998.
[29] J.C. Platt, "Fast Training of Support Vector Machines Using Sequential Minimal Optimization," Advances in Kernel Methods: Support Vector Learning, 1999.
[30] B.C. Russell, A. Torralba, K.P. Murphy, and W.T. Freeman, "LabelMe: A Database and Web-Based Tool for Image Annotation," Int'l J. Computer Vision, vol. 77, pp. 157-173, 2008.
[31] R. Salakhutdinov, A. Torralba, and J. Tenenbaum, "Learning to Share Visual Appearance for Multiclass Object Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2011.
[32] S. Shalev-Shwartz, Y. Singer, and N. Srebro, "Pegasos: Primal Estimated Sub-Gradient Solver for SVM," Proc. 24th Int'l Conf. Machine Learning, pp. 807-814, 2007.
[33] J. Sivic, B.C. Russell, A. Zisserman, W.T. Freeman, and A.A. Efros, "Unsupervised Discovery of Visual Object Class Hierarchies," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[34] A. Torralba, K. Murphy, and W.T. Freeman, "Sharing Features: Efficient Boosting Procedures for Multiclass Object Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2004.
[35] A. Tversky et al., "Features of Similarity," Psychological Rev., vol. 84, pp. 327-352, 1977.
[36] P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2001.
[37] G. Wang and D. Forsyth, "Joint Learning of Visual Attributes Object Classes and Visual Saliency," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[38] G. Wang, D. Forsyth, and D. Hoiem, "Comparative Object Similar for Improved Recognition with Few or Zero Examples," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[39] G. Wang, D. Hoiem, and D. Forsyth, "Building Text Features for Object Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[40] G. Wang, D. Hoiem, and D. Forsyth, "Learning Image Similarity from Flickr Groups Using Stochastic Intersection Kernel Machines," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[41] K.Q. Weinberger and L.K. Saul, "Distance Metric Learning for Large Margin Nearest Neighbor Classification," The J. Machine Learning Research, vol. 10, pp. 207-244, 2009.
[42] E.P. Xing, A.Y. Ng, M.I. Jordan, and S. Russell, "Distance Metric Learning with Application to Clustering with Side-Information," Proc. Neural Information Processing Systems Conf., pp. 521-528, 2003.
[43] J. Xu and H. Li, "Adarank: A Boosting Algorithm for Information Retrieval," Proc. 30th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 391-398, 2007.
[44] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, "Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study," Int'l J. Computer Vision, vol. 73, no. 2, pp. 213-238, 2007.
26 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool