The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.12 - Dec. (2012 vol.34)
pp: 2481-2488
Mani Malek Esmaeili , Electr. & Comput. Eng. Dept., Univ. of British Columbia, Vancouver, BC, Canada
R. K. Ward , Electr. & Comput. Eng. Dept., Univ. of British Columbia, Vancouver, BC, Canada
M. Fatourechi , Electr. & Comput. Eng. Dept., Univ. of British Columbia, Vancouver, BC, Canada
ABSTRACT
A fast approximate nearest neighbor search algorithm for the (binary) Hamming space is proposed. The proposed Error Weighted Hashing (EWH) algorithm is up to 20 times faster than the popular locality sensitive hashing (LSH) algorithm and works well even for large nearest neighbor distances where LSH fails. EWH significantly reduces the number of candidate nearest neighbors by weighing them based on the difference between their hash vectors. EWH can be used for multimedia retrieval and copy detection systems that are based on binary fingerprinting. On a fingerprint database with more than 1,000 videos, for a specific detection accuracy, we demonstrate that EWH is more than 10 times faster than LSH. For the same retrieval time, we show that EWH has a significantly better detection accuracy with a 15 times lower error rate.
INDEX TERMS
Nearest neighbor searches, Indexes, Signal processing algorithms, Hamming distance, Algorithm design and analysis, Approximation algorithms, binary embedding, Nearest neighbor search, Hamming space, multimedia fingerprinting, copy retrieval
CITATION
Mani Malek Esmaeili, R. K. Ward, M. Fatourechi, "A Fast Approximate Nearest Neighbor Search Algorithm in the Hamming Space", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 12, pp. 2481-2488, Dec. 2012, doi:10.1109/TPAMI.2012.170
REFERENCES
[1] G. Shakhnarovich, P. Viola, and T. Darrell, "Fast Pose Estimation with Parameter-Sensitive Hashing," Proc. IEEE Int'l Conf. Computer Vision, vol. 2, pp. 750 -757, Oct. 2003.
[2] M.M. Esmaeili, M. Fatourechi, and R.K. Ward, "A Robust and Fast Video Copy Detection System Using Content-Based Fingerprinting," IEEE Trans. Information Forensics and Security, vol. 6, no. 1, pp. 213 -226, Mar. 2011.
[3] M. Fatourechi, X. Lv, M. MalekEsmaeili, Z.J. Wang, and R.K. Ward, Image and Video Copy Detection Using Content-Based Fingerprinting, ser. Multimedia Image and Video Processing, second ed., ch. 18, pp. 459-487, 2011.
[4] A.L. Varna, A. Swaminathan, and M. Wu, "A Decision Theoretic Framework for Analyzing Binary Hash-Based Content Identification Systems," Proc. ACM Workshop Digital Rights Management, http://doi.acm.org/10.11451456520.1456532 , pp. 67-76, 2008.
[5] B. Kulis and T. Darrell, "Learning to Hash with Binary Reconstructive Embeddings," Proc. Advances in Neural Information Processing Systems 22, Y. Bengio, D. Schuurmans, J. Lafferty, C.K.I. Williams, and A. Culotta, eds., pp. 1042-1050, 2009.
[6] J. Oostveen, T. Kalker, and J. Haitsma, "Feature Extraction and a Database Strategy for Video Fingerprinting," Proc. Int'l Conf. Recent Advances in Visual Information Systems, pp. 117-128, 2002.
[7] F. Hao, J. Daugman, and P. Zielinski, "A Fast Search Algorithm for a Large Fuzzy Database," IEEE Trans. Information Forensics and Security, vol. 3, no. 2, pp. 203-212, June 2008.
[8] R.R. Salakhutdinov and G.E. Hinton, "Semantic Hashing," Proc. SIGIR Workshop Information Retrieval and Applications of Graphical Models, 2007.
[9] Y. Weiss, A. Torralba, and R. Fergus, "Spectral Hashing," Proc. Ann. Conf. Neural Information Processing Systems, 2008.
[10] R. Chaudhry and Y. Ivanov, "Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos," Proc. European Conf. Computer Vision, pp. 735-748, 2010.
[11] A. Gionis, P. Indyk, and R. Motwani, "Similarity Search in High Dimensions via Hashing," Proc. Int'l Conf. Very Large Data Bases, pp. 518-529, 1999.
[12] M.S. Charikar, "Similarity Estimation Techniques from Rounding Algorithms," Proc. ACM Symp. Theory of Computing, pp. 380-388, 2002.
[13] B. Kulis and K. Grauman, "Kernelized Locality-Sensitive Hashing for Scalable Image Search," Proc. IEEE Int'l Conf. Computer Vision, pp. 2130-2137, Oct. 2009.
[14] J.A. Haitsma, A.A.C.M. Kalker, C.P.M.J. Baggen, and J.C. Oostveen, "Generating and Matching Hashes of Multimedia Content," US 2002/0178410, Nov. 2002.
[15] C. Bauer, R. Radhakrishnan, and W. Jiang, "Optimal Configuration of Hash Table Based Multimedia Fingerprint Databases Using Weak Bits," Proc. IEEE Int'l Conf. Multimedia and Expo, pp. 1667 -1672, July 2010.
[16] M. Malekesmaeili, R.K. Ward, and M. Fatourechi, "Fast Matching for Video/Audio Fingerprinting Algorithms," Proc. IEEE Int'l Workshop Information Forensics and Security, Nov. 2011.
[17] M.L. Miller, "Audio Fingerprinting: Nearest Neighbour Search in High Dimensional Binary Spaces," Proc. IEEE Workshop Multimedia Signal Processing, pp. 182-185, 2002.
[18] http://ipl.ece.ubc.camultimedia.html, 2012.
[19] http://groups.csail.mit.edu/visiontinyimages /, 2012.
41 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool