loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1
Modeling Scenes with Local Descriptors and Latent Aspects
Beijing, China
October 17-October 20
ISBN: 0-7695-2334-X
P. Quelhas, IDIAP Research Institute
F. Monay, IDIAP Research Institute
J.-M. Odobez, IDIAP Research Institute
D. Gatica-Perez, IDIAP Research Institute
T. Tuytelaars, IDIAP Research Institute
L. Van Gool, Katholieke Universiteit Leuven

We present a new approach to model visual scenes in image collections, based on local invariant features and probabilistic latent space models. Our formulation provides answers to three open questions:(1) whether the invariant local features are suitable for scene (rather than object) classification; (2) whether unsupervised latent space models can be used for feature extraction in the classification task; and (3) whether the latent space formulation can discover visual co-occurrence patterns, motivating novel approaches for image organization and segmentation.

Using a 9500-image dataset, our approach is validated on each of these issues. First, we show with extensive experiments on binary and multi-class scene classification tasks, that a bag-of-visterm representation, derived from local invariant descriptors, consistently outperforms state-of-the-art approaches. Second, we show that Probabilistic Latent Semantic Analysis (PLSA) generates a compact scene representation, discriminative for accurate classification, and significantly more robust when less training data are available. Third, we have exploited the ability of PLSA to automatically extract visually meaningful aspects, to propose new algorithms for aspect-based image ranking and context-sensitive image segmentation.

Citation:
P. Quelhas, F. Monay, J.-M. Odobez, D. Gatica-Perez, T. Tuytelaars, L. Van Gool, "Modeling Scenes with Local Descriptors and Latent Aspects," iccv, vol. 1, pp.883-890, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005
Usage of this product signifies your acceptance of the Terms of Use.