2017 IEEE International Conference on Computer Vision Workshop (ICCVW) (2017)
Oct. 22, 2017 to Oct. 29, 2017
Feature pooling is a method that summarizes local descriptors in an image using spatial information. Spatial pyramid matching uses the statistics of local features in an image subregion as a global feature. However, the disadvantages of this method are that there is no theoretical guideline for selecting the pooling region, robustness to small image translation is lost around the edges of the pooling region, the information encoded in the different feature pyramids overlaps, and thus recognition performance stagnates as a greater pyramid size is selected. In this research, we propose a novel interpretation that regards feature pooling as an orthogonal projection in the space of functions that maps the image space to the local feature space. Moreover, we propose a novel feature-pooling method that orthogonally projects the function form of local descriptors into the space of low-degree polynomials. We also evaluate the robustness of the proposed method. Experimental results demonstrate the effectiveness of the proposed methods.
Manganese, Spatial resolution, Robustness, Feature extraction, Standards, Encoding
Y. Mukuta, Y. Ushiku and T. Harada, "Spatial-Temporal Weighted Pyramid Using Spatial Orthogonal Pooling," 2017 IEEE International Conference on Computer Vision Workshop (ICCVW), Venice, Italy, 2017, pp. 1041-1049.