The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - February (2012 vol.34)
pp: 346-358
Yong Jae Lee , Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
K. Grauman , Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
ABSTRACT
How can knowing about some categories help us to discover new ones in unlabeled images? Unsupervised visual category discovery is useful to mine for recurring objects without human supervision, but existing methods assume no prior information and thus tend to perform poorly for cluttered scenes with multiple objects. We propose to leverage knowledge about previously learned categories to enable more accurate discovery, and address challenges in estimating their familiarity in unsegmented, unlabeled images. We introduce two variants of a novel object-graph descriptor to encode the 2D and 3D spatial layout of object-level co--occurrence patterns relative to an unfamiliar region and show that by using them to model the interaction between an image's known and unknown objects, we can better detect new visual categories. Rather than mine for all categories from scratch, our method identifies new objects while drawing on useful cues from familiar ones. We evaluate our approach on several benchmark data sets and demonstrate clear improvements in discovery over conventional purely appearance-based baselines.
INDEX TERMS
ubiquitous computing, graph theory, object detection, benchmark data sets, object graphs, context aware visual category discovery, human supervision, cluttered scenes, leverage knowledge, object graph descriptor, 3D spatial layout, 2D spatial layout, Image segmentation, Object recognition, Visualization, Unsupervised learning, Layout, Three dimensional displays, unsupervised learning., Object recognition, context, category discovery
CITATION
Yong Jae Lee, K. Grauman, "Object-Graphs for Context-Aware Visual Category Discovery", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 2, pp. 346-358, February 2012, doi:10.1109/TPAMI.2011.122
REFERENCES
[1] K. Grauman and T. Darrell, "Unsupervised Learning of Categories from Sets of Partially Matching Image Features," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2006.
[2] B. Russell, A. Efros, J. Sivic, W. Freeman, and A. Zisserman, "Using Multiple Segmentations to Discover Objects and Their Extent in Image Collections," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2006.
[3] D. Liu and T. Chen, "Unsupervised Image Categorization and Object Localization Using Topic Models and Correspondences between Images," Proc. IEEE Int'l Conf. Computer Vision, 2007.
[4] G. Kim, C. Faloutsos, and M. Hebert, "Unsupervised Modeling of Object Categories Using Link Analysis Techniques," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[5] Y.J. Lee and K. Grauman, "Foreground Focus: Unsupervised Learning from Partially Matching Images," Int'l J. Computer Vision, vol. 85, pp. 143-166, 2009.
[6] A. Kaplan and G. Murphy, "The Acquisition of Category Structure in Unsupervised Learning," Memory and Cognition, vol. 27, pp. 699-712, 1999.
[7] R. Weischedel, "Adaptive Natural Language Processing," Proc. Workshop Speech and Natural Language, 1990.
[8] S. Gould, R. Fulton, and D. Koller, "Decomposing a Scene into Geometric and Semantically Consistent Regions," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[9] Y.J. Lee and K. Grauman, "Object-Graphs for Context-Aware Category Discovery," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[10] D. Dueck and B. Frey, "Non-Metric Affinity Propagation for Unsupervised Image Categorization," Proc. IEEE Int'l Conf. Computer Vision, 2007.
[11] M. Cho, Y.M. Shin, and K.M. Lee, "Unsupervised Detection and Segmentation of Identical Objects," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[12] L. Fei-Fei, R. Fergus, and P. Perona, "A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories," Proc. IEEE Int'l Conf. Computer Vision, 2003.
[13] E. Bart and S. Ullman, "Cross-Generalization: Learning Novel Classes from a Single Example by Feature Replacement," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2005.
[14] A. Torralba, "Contextual Priming for Object Detection," Int'l J. Computer Vision, vol. 53, pp. 169-191, 2003.
[15] X. He, R. Zemel, and M. Carreira-Perpinan, "Multiscale Conditional Random Fields for Image Labeling," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2004.
[16] J. Shotton, J. Winn, C. Rother, and A. Criminisi, "Textonboost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation," Proc. European Conf. Computer Vision, 2006.
[17] G. Heitz and D. Koller, "Learning Spatial Context: Using Stuff to Find Things," Proc. European Conf. Computer Vision, 2008.
[18] T. Malisiewicz and A. Efros, "Beyond Categories: The Visual Memex Model for Reasoning about Object Relationships," Proc. Neural Information Processing Systems, 2009.
[19] A. Singhal, J. Luo, and W. Zhu, "Probabilistic Spatial Context Models for Scene Content Understanding," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2003.
[20] C. Galleguillos, A. Rabinovich, and S. Belongie, "Object Categorization Using Co-Occurrence, Location and Appearance," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[21] Z. Tu, "Auto-Context and Its Application to High-Level Vision Tasks," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[22] D. Parikh, C.L. Zitnick, and T. Chen, "From Appearance to Context-Based Recognition: Dense Labeling in Small Images," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[23] S. Lazebnik and M. Raginsky, "An Empirical Bayes Approach to Contextual Region Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[24] D. Hoiem, A.A. Efros, and M. Hebert, "Putting Objects in Perspective," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2006.
[25] D. Hoiem, A.N. Stein, A.A. Efros, and M. Hebert, "Recovering Occlusion Boundaries from a Single Image," Proc. IEEE Int'l Conf. Computer Vision, 2007.
[26] A. Saxena, M. Sun, and A. Ng, "Make3D: Learning 3D Scene Structure from a Single Still Image," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 5, pp. 824-840, May 2009.
[27] D. Hoiem, A.A. Efros, and M. Hebert, "Geometric Context from a Single Image," Proc. IEEE Int'l Conf. Computer Vision, 2005.
[28] J. Shi and J. Malik, "Normalized Cuts and Image Segmentation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 888-905, Aug. 2000.
[29] T. Malisiewicz and A.A. Efros, "Improving Spatial Support for Objects via Multiple Segmentations," Proc. British Machine Vision Conf., 2007.
[30] F. Bach, G. Lanckriet, and M. Jordan, "Multiple Kernel Learning, Conic Duality, and the SMO Algorithm," Proc. Int'l Conf. Machine Learning, 2004.
[31] A. Vedaldi and S. Soatto, "Relaxed Matching Kernels for Object Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[32] T. Gartner, P. Flach, and S. Wrobel, "On Graph Kernels: Hardness Results and Efficient Alternatives," Proc. Ann. Conf. Computational Learning Theory, 2003.
[33] H. Kashima, K. Tsuda, and A. Inokuchi, "Kernels on Graphs," Kernels and Bioinformatics, 2004.
[34] A. Ng, M. Jordan, and Y. Weiss, "On Spectral Clustering: Analysis and an Algorithm," Proc. Neural Information Processing Systems, 2001.
[35] A. Bosch, A. Zisserman, and X. Munoz, "Representing Shape with a Spatial Pyramid Kernel," Proc. Int'l Conf. Image and Video Retrieval 2007.
[36] J. Platt, "Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods," Advances in Large Margin Classifiers, MIT Press, 1999.
[37] P. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining. Pearson Addison Wesley, 2005.
[38] P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik, "Contour Detection and Hierarchical Image Segmentation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 33, no. 5, pp. 898-916, May 2011.
[39] Y.J. Lee and K. Grauman, "Collect-Cut: Segmentation with Top-Down Cues Discovered in Multi-Object Images," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[40] P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik, "From Contours to Regions: An Empirical Evaluation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
6 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool