loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 IEEE Conference on Computer Vision and Pattern Recognition
Linear spatial pyramid matching using sparse coding for image classification
Miami, FL, USA
June 20-June 25
ISBN: 978-1-4244-3992-8
Jianchao Yang, Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
T. Huang, Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Recently SVMs using spatial pyramid matching (SPM) kernel have been highly successful in image classification. Despite its popularity, these nonlinear SVMs have a complexity O(n2 ~ n3) in training and O(n) in testing, where n is the training size, implying that it is nontrivial to scaleup the algorithms to handle more than thousands of training images. In this paper we develop an extension of the SPM method, by generalizing vector quantization to sparse coding followed by multi-scale spatial max pooling, and propose a linear SPM kernel based on SIFT sparse codes. This new approach remarkably reduces the complexity of SVMs to O(n) in training and a constant in testing. In a number of image categorization experiments, we find that, in terms of classification accuracy, the suggested linear SPM based on sparse coding of SIFT descriptors always significantly outperforms the linear SPM kernel on histograms, and is even better than the nonlinear SPM kernels, leading to state-of-the-art performance on several benchmarks by using a single type of descriptors.
Index Terms:
SIFT descriptor, linear spatial pyramid matching, sparse coding, image classification, SPM kernel, nonlinear SVM, training images, vector quantization, multiscale spatial max pooling, SIFT sparse codes, computational complexity, image categorization
Citation:
Jianchao Yang, Kai Yu, Yihong Gong, T. Huang, "Linear spatial pyramid matching using sparse coding for image classification," cvpr, pp.1794-1801, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009
Usage of this product signifies your acceptance of the Terms of Use.