|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Jun Wang, S. Kumar, Shih-Fu Chang, "Semi-Supervised Hashing for Large-Scale Search," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 12, pp. 2393-2406, Dec., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TPAMI.2012.48, author = { Jun Wang and S. Kumar and Shih-Fu Chang}, title = {Semi-Supervised Hashing for Large-Scale Search}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {34}, number = {12}, issn = {0162-8828}, year = {2012}, pages = {2393-2406}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.2012.48}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Pattern Analysis and Machine Intelligence TI - Semi-Supervised Hashing for Large-Scale Search IS - 12 SN - 0162-8828 SP2393 EP2406 EPD - 2393-2406 A1 - Jun Wang, A1 - S. Kumar, A1 - Shih-Fu Chang, PY - 2012 KW - learning (artificial intelligence) KW - content-based retrieval KW - file organisation KW - image retrieval KW - orthogonal hashing KW - semisupervised hashing method KW - large-scale search KW - hashing-based approximate nearest neighbor search KW - ANN search KW - computational efficiency KW - memory efficiency KW - locality sensitive hashing KW - spectral hashing KW - random projections KW - principal projections KW - semantic similarity KW - SSH framework KW - information theoretic regularizer KW - unlabeled sets KW - nonorthogonal hashing KW - sequential learning paradigm KW - content-based image retrieval KW - sequential hashing method KW - Artificial neural networks KW - Semantics KW - Encoding KW - Extraterrestrial measurements KW - Binary codes KW - Semisupervised learning KW - Sequential analysis KW - sequential hashing KW - Hashing KW - nearest neighbor search KW - binary codes KW - semi-supervised hashing KW - pairwise labels VL - 34 JA - IEEE Transactions on Pattern Analysis and Machine Intelligence ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2012.48
Hashing-based approximate nearest neighbor (ANN) search in huge databases has become popular due to its computational and memory efficiency. The popular hashing methods, e.g., Locality Sensitive Hashing and Spectral Hashing, construct hash functions based on random or principal projections. The resulting hashes are either not very accurate or are inefficient. Moreover, these methods are designed for a given metric similarity. On the contrary, semantic similarity is usually given in terms of pairwise labels of samples. There exist supervised hashing methods that can handle such semantic similarity, but they are prone to overfitting when labeled data are small or noisy. In this work, we propose a semi-supervised hashing (SSH) framework that minimizes empirical error over the labeled set and an information theoretic regularizer over both labeled and unlabeled sets. Based on this framework, we present three different semi-supervised hashing methods, including orthogonal hashing, nonorthogonal hashing, and sequential hashing. Particularly, the sequential hashing method generates robust codes in which each hash function is designed to correct the errors made by the previous ones. We further show that the sequential learning paradigm can be extended to unsupervised domains where no labeled pairs are available. Extensive experiments on four large datasets (up to 80 million samples) demonstrate the superior performance of the proposed SSH methods over state-of-the-art supervised and unsupervised hashing techniques.
Index Terms:
learning (artificial intelligence),content-based retrieval,file organisation,image retrieval,orthogonal hashing,semisupervised hashing method,large-scale search,hashing-based approximate nearest neighbor search,ANN search,computational efficiency,memory efficiency,locality sensitive hashing,spectral hashing,random projections,principal projections,semantic similarity,SSH framework,information theoretic regularizer,unlabeled sets,nonorthogonal hashing,sequential learning paradigm,content-based image retrieval,sequential hashing method,Artificial neural networks,Semantics,Encoding,Extraterrestrial measurements,Binary codes,Semisupervised learning,Sequential analysis,sequential hashing,Hashing,nearest neighbor search,binary codes,semi-supervised hashing,pairwise labels
Citation:
Jun Wang, S. Kumar, Shih-Fu Chang, "Semi-Supervised Hashing for Large-Scale Search," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 12, pp. 2393-2406, Dec. 2012, doi:10.1109/TPAMI.2012.48
Usage of this product signifies your acceptance of the Terms of Use.

