The Community for Technology Leaders
2017 IEEE International Conference on Computer Vision Workshop (ICCVW) (2017)
Venice, Italy
Oct. 22, 2017 to Oct. 29, 2017
ISSN: 2473-9944
ISBN: 978-1-5386-1034-3
pp: 1041-1049
ABSTRACT
Feature pooling is a method that summarizes local descriptors in an image using spatial information. Spatial pyramid matching uses the statistics of local features in an image subregion as a global feature. However, the disadvantages of this method are that there is no theoretical guideline for selecting the pooling region, robustness to small image translation is lost around the edges of the pooling region, the information encoded in the different feature pyramids overlaps, and thus recognition performance stagnates as a greater pyramid size is selected. In this research, we propose a novel interpretation that regards feature pooling as an orthogonal projection in the space of functions that maps the image space to the local feature space. Moreover, we propose a novel feature-pooling method that orthogonally projects the function form of local descriptors into the space of low-degree polynomials. We also evaluate the robustness of the proposed method. Experimental results demonstrate the effectiveness of the proposed methods.
INDEX TERMS
Manganese, Spatial resolution, Robustness, Feature extraction, Standards, Encoding
CITATION

Y. Mukuta, Y. Ushiku and T. Harada, "Spatial-Temporal Weighted Pyramid Using Spatial Orthogonal Pooling," 2017 IEEE International Conference on Computer Vision Workshop (ICCVW), Venice, Italy, 2017, pp. 1041-1049.
doi:10.1109/ICCVW.2017.127
89 ms
(Ver 3.3 (11022016))