The Community for Technology Leaders
Green Image
Issue No. 02 - Mar./Apr. (2018 vol. 20)
ISSN: 1521-9615
pp: 52-63
Qing Li , Southwest Jiaotong University
Qiang Peng , Southwest Jiaotong University
Chuan Yan , Southwest Jiaotong University
ABSTRACT
Despite the effectiveness of convolutional neural networks (CNNs), especially for image classification tasks, the effect of convolution features on learned representations is still limited, mainly focusing on an images salient object but ignoring the variation information from clutter and local objects. The authors propose a multiple vector of locally aggregated descriptors (VLAD) encoding method with CNN features for image classification. To improve the VLAD coding methods performance, they explore the multiplicity of VLAD encoding with the extension of three encoding algorithms. Moreover, they equip the spatial pyramid patch (SPM) on VLAD encoding to add spatial information to CNN features. The addition of SPM, in particular, allows their proposed framework to yield better performance compared to the traditional method.
INDEX TERMS
feature extraction, feedforward neural nets, image classification, image coding, image representation, learning (artificial intelligence)
CITATION

Q. Li, Q. Peng and C. Yan, "Multiple VLAD Encoding of CNNs for Image Classification," in Computing in Science & Engineering, vol. 20, no. 2, pp. 52-63, 2018.
doi:10.1109/MCSE.2018.108164530
382 ms
(Ver 3.3 (11022016))