The Community for Technology Leaders
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2010)
San Francisco, CA, USA
June 13, 2010 to June 18, 2010
ISBN: 978-1-4244-6984-0
pp: 1086-1093
Zhiqi Zhang , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
Yu Cao , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
Dhaval Salvi , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
Kenton Oliver , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
Jarrell Waggoner , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
Song Wang , Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
ABSTRACT
Object localization in an image is usually handled by searching for an optimal subwindow that tightly covers the object of interest. However, the subwindows considered in previous work are limited to rectangles or other specified, simple shapes. With such specified shapes, no subwindow can cover the object of interest tightly. As a result, the desired subwindow around the object of interest may not be optimal in terms of the localization objective function, and cannot be detected by a subwindow search algorithm. In this paper, we propose a new graph-theoretic approach for object localization by searching for an optimal subwindow without pre-specifying its shape. Instead, we require the resulting subwindow to be well aligned with edge pixels that are detected from the image. This requirement is quantified and integrated into the localization objective function based on the widely-used bag of visual words technique. We show that the ratio-contour graph algorithm can be adapted to find the optimal free-shape subwindow in terms of the new localization objective function. In the experiment, we test the proposed approach on the PASCAL VOC 2006 and VOC 2007 databases for localizing several categories of animals. We find that its performance is better than the previous efficient subwindow search algorithm.
INDEX TERMS
CITATION

D. Salvi, K. Oliver, J. Waggoner, Z. Zhang, Y. Cao and S. Wang, "Free-shape subwindow search for object localization," 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR), San Francisco, CA, USA, 2010, pp. 1086-1093.
doi:10.1109/CVPR.2010.5540095
88 ms
(Ver 3.3 (11022016))