The Community for Technology Leaders
Issue No. 04 - Dec. (2015 vol. 1)
ISSN: 2332-7790
pp: 162-171
Yang Yang , School of Computer Science and Engineering, University of Electronic Science and Technology of China
Fumin Shen , School of Computer Science and Engineering, University of Electronic Science and Technology of China
Heng Tao Shen , School of Computer Science and Engineering, University of Electronic Science and Technology of China
Hanxi Li , School of Computer and Information Engineering, Jiangxi Normal University, China
Xuelong Li , Center for OPTical IMagery Analysis and Learning (OPTIMAL), State Key Laboratory of Transient Optics and Photonics, Xi?an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an, Shaanxi, P.R. China
ABSTRACT
In big data era, the ever-increasing image data has posed significant challenge on modern image retrieval. It is of great importance to index images with semantic keywords efficiently and effectively, especially confronted with fast-evolving property of the web. Learning-based hashing has shown its power in handling large-scale high-dimensional applications, such as image retrieval. Existing solutions normally separate the process of learning binary codes and hash functions into two independent stages to bypass challenge of the discrete constraints on binary codes. In this work, we propose a novel unsupervised hashing approach, namely robust discrete hashing (RDSH), to facilitate large-scale semantic indexing of image data. Specifically, RDSH simultaneously learns discrete binary codes as well as robust hash functions within a unified model. In order to suppress the influence of unreliable binary codes and learn robust hash functions, we also integrate a flexible $\ell _{2,p}$ loss with nonlinear kernel embedding to adapt to different noise levels. Finally, we devise an alternating algorithm to efficiently optimize RDSH model. Given a test image, we first conduct $r$-nearest-neighbor search based on Hamming distance of binary codes, and then propagate semantic keywords of neighbors to the test image. Extensive experiments have been conducted on various real-world image datasets to show its superiority to the state-of-the-arts in large-scale semantic indexing.
INDEX TERMS
Binary codes, Semantics, Robustness, Indexing, Optimization, Hamming distance, Electronic mail
CITATION

Y. Yang, F. Shen, H. T. Shen, H. Li and X. Li, "Robust Discrete Spectral Hashing for Large-Scale Image Semantic Indexing," in IEEE Transactions on Big Data, vol. 1, no. 4, pp. 162-171, 2015.
doi:10.1109/TBDATA.2016.2516024
CITATIONS
SHARE
624 ms
(Ver 3.3 (11022016))