The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.09 - Sept. (2012 vol.34)
pp: 1758-1772
A. Y. Chia , Institutefor Infocomm Res., Singapore, Singapore
D. Rajan , Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
M. K. Leung , Dept. of Comput. Sci., Univ. Tunku Abdul Rahman, Kampar, Malaysia
S. Rahardja , Institutefor Infocomm Res., Singapore, Singapore
ABSTRACT
We present a novel contour-based approach that recognizes object classes in real-world scenes using simple and generic shape primitives of line segments and ellipses. Compared to commonly used contour fragment features, these primitives support more efficient representation since their storage requirements are independent of object size. Additionally, these primitives are readily described by their geometrical properties and hence afford very efficient feature comparison. We pair these primitives as shape-tokens and learn discriminative combinations of shape-tokens. Here, we allow each combination to have a variable number of shape-tokens. This, coupled with the generic nature of primitives, enables a variety of class-specific shape structures to be learned. Building on the contour-based method, we propose a new hybrid recognition method that combines shape and appearance features. Each discriminative combination can vary in the number and the types of features, where these two degrees of variability empower the hybrid method with even more flexibility and discriminative potential. We evaluate our methods across a large number of challenging classes, and obtain very competitive results against other methods. These results show the proposed shape primitives are indeed sufficiently powerful to recognize object classes in complex real-world scenes.
INDEX TERMS
shapes (structures), image classification, image representation, object detection, object recognition, object detection, line segment, ellipse, appearance feature, contour-based approach, object classes recognition, generic shape primitive, representation efficiency, storage requirement, geometrical property, shape-token, shape structure, hybrid recognition method, image classification, Shape, Feature extraction, Training, Image edge detection, Vectors, Robustness, Image segmentation, category-level object detection., Shape primitives, appearance features, image classification
CITATION
A. Y. Chia, D. Rajan, M. K. Leung, S. Rahardja, "Object Recognition by Discriminative Combinations of Line Segments, Ellipses, and Appearance Features", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 9, pp. 1758-1772, Sept. 2012, doi:10.1109/TPAMI.2011.220
REFERENCES
[1] A.-S. Chia, S. Rahardja, D. Rajan, and M. Leung, "Structural Descriptors for Category Level Object Detection," IEEE Trans. Multimedia, vol. 11, no. 8, pp. 1407-1421, Dec. 2009.
[2] K. Mikolajczyk, A. Zisserman, and C. Schmid, "Shape Recognition with Edge-Based Features," Proc. British Machine Vision Conf., pp. 779-788, 2003.
[3] F. Jurie and C. Schmid, "Scale Invariant Shape Features for Recognition of Object Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 90-96, 2004.
[4] P.M. Kumar, P.H.S. Torr, and A. Zisserman, "Extending Pictorial Structures for Object Recognition," Proc. British Machine Vision Conf., pp. 789-798, 2004.
[5] A. Opelt, A. Pinz, and A. Zisserman, "Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection," Int'l J. Computer Vision, vol. 80, no. 1, pp. 16-44, 2008.
[6] J. Shotton, A. Blake, and R. Cipolla, "Multi-Scale Categorical Object Recognition Using Contour Fragments," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 7, pp. 1270-1281, July 2008.
[7] P. David and D. DeMenthon, "Object Recognition in High Clutter Images Using Line Features," Proc. Int'l Conf. Computer Vision, pp. 1581-1588, 2005.
[8] X. Ren, "Learning and Matching Line Aspects for Articulated Objects," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[9] A.-S. Chia, S. Rahardja, D. Rajan, and M. Leung, "Object Recognition by Discriminative Combinations of Line Segments and Ellipses," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2225-2232, 2010.
[10] V. Ferrari, L. Fevrier, F. Jurie, and C. Schmid, "Groups of Adjacent Contour Segments for Object Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 1, pp. 36-51, Jan. 2008.
[11] D.R. Martin, C.C. Fowlkes, and J. Malik, "Learning to Detect Natural Image Boundaries Using Local Brightness, Color and Texture Cues," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 5, pp. 530-549, May 2004.
[12] J. Gall and V. Lempitsky, "Class-Specific Hough Forests for Object Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1022-1029, 2009.
[13] M. Leordeanu, M. Hebert, and R. Sukthankar, "Beyond Local Appearance: Category Recognition from Pairwise Interactions of Simple Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[14] R. Fergus, P. Perona, and A. Zisserman, "A Visual Category Filter for Google Images," Proc. European Conf. Computer Vision, pp. 242-256, 2004.
[15] J. Shotton, A. Blake, and R. Cipolla, "Efficiently Combining Contour and Texture Cues for Object Recognition," Proc. British Machine Vision Conf., 2008.
[16] M. Leung and Y. Yang, "Dynamic Two-Strip Algorithm in Curve Fitting," Pattern Recognition, vol. 23, nos. 1/2, pp. 69-79, 1990.
[17] A.-S. Chia, S. Rahardja, D. Rajan, and M. Leung, "A Split and Merge Based Ellipse Detector with Self-Correcting Capability," IEEE Trans. Image Processing, vol. 20, no. 7, pp. 1991-2006, July 2011.
[18] V. Ferrari, T. Tuytelaars, and L. Gool, "Object Detection by Contour Segment Networks," Proc. European Conf. Computer Vision, pp. 14-28, 2006.
[19] D. Comaniciu and P. Meer, "Mean Shift: A Robust Approach toward Feature Space Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 603-619, May 2002.
[20] E. Borenstein and S. Ullman, "Class-Specific, Top-Down Segmentation," Proc. European Conf. Computer Vision, pp. 639-641, 2002.
[21] L. Breiman, J. Friedman, C. Stone, and R. Olshen, Classification and Regression Trees. Wadsworth and Brooks, 1984.
[22] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[23] G. Csurka, C.R. Dance, L. Fan, J. Willamowski, and C. Bray, "Visual Categorization with Bags of Keypoints," Proc. European Conf. Computer Vision Workshop, pp. 1-22, 2004.
[24] B. Leibe and B. Schiele, "Interleaved Object Categorization and Segmentation," Proc. British Machine Vision Conf., pp. 759-768, 2003.
[25] G. Griffin, A. Holub, and P. Perona, "Caltech-256 Object Category Data Set," technical report, California Inst. of Technology, no. 24, 2007.
[26] X. Bai, X. Wang, L.J. Latecki, W. Liu, and Z. Tu, "Active Skeleton for Non-Rigid Object Detection," Proc. Int'l Conf. Computer Vision, 2009.
[27] J. Sivic, B.C. Russell, A.A. Efros, A. Zisserman, and W.T. Freeman, "Discovering Objects and Their Location in Images," Proc. Int'l Conf. Computer Vision, pp. 370-377, 2005.
[28] D. Crandall and D. Huttenlocher, "Weakly Supervised Learning of Part-Based Spatial Models for Visual Object Recognition," Proc. European Conf. Computer Vision, pp. 16-29, 2006.
[29] R. Fergus, P. Perona, and A. Zisserman, "Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition," Int'l J. Computer Vision, vol. 71, no. 3, pp. 273-303, 2007.
[30] A. Bar-Hillel and D. Weinshall, "Efficient Learning of Relational Object Class Models," Int'l J. Computer Vision, vol. 7, nos. 1-3, pp. 175-198, 2008.
[31] Y. Chen, L.L. Zhu, A. Yuille, and H. Zhang, "Unsupervised Learning of Probabilistic Object Models (POMs) for Object Classification, Segmentation and Recognition Using Knowledge Propagation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 10, pp. 1747-1761, Oct. 2009.
[32] L.L. Zhu, Y. Chen, and A. Yuille, "Unsupervised Learning of Probabilistic Grammar-Markov Models for Object Categories," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 1, pp. 114-128, Jan. 2009.
[33] E. Nowak, F. Jurie, and B. Triggs, "Sampling Strategies for Bag-of-Features Image Classification," Proc. European Conf. Computer Vision, pp. 490-503, 2006.
[34] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, "Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study," Int'l J. Computer Vision, vol. 73, no. 2, pp. 213-238, 2007.
43 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool