The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - Jan. (2014 vol.36)
pp: 188-194
Xin Yang , Electr. & Comput. Eng. Dept., Univ. of California, Santa Barbara, Santa Barbara, CA, USA
Kwang-Ting Cheng , Electr. & Comput. Eng. Dept., Univ. of California, Santa Barbara, Santa Barbara, CA, USA
ABSTRACT
The efficiency and quality of a feature descriptor are critical to the user experience of many computer vision applications. However, the existing descriptors are either too computationally expensive to achieve real-time performance, or not sufficiently distinctive to identify correct matches from a large database with various transformations. In this paper, we propose a highly efficient and distinctive binary descriptor, called local difference binary (LDB). LDB directly computes a binary string for an image patch using simple intensity and gradient difference tests on pairwise grid cells within the patch. A multiple-gridding strategy and a salient bit-selection method are applied to capture the distinct patterns of the patch at different spatial granularities. Experimental results demonstrate that compared to the existing state-of-the-art binary descriptors, primarily designed for speed, LDB has similar construction efficiency, while achieving a greater accuracy and faster speed for mobile object recognition and tracking tasks.
INDEX TERMS
Robustness, Training, Real-time systems, Face, Training data, Detectors, Databases,augmented reality, Binary feature descriptor, mobile devices, object recognition, tracking
CITATION
Xin Yang, Kwang-Ting Cheng, "Local Difference Binary for Ultrafast and Distinctive Feature Description", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.36, no. 1, pp. 188-194, Jan. 2014, doi:10.1109/TPAMI.2013.150
REFERENCES
[1] H. Bay, A. Ess, T. Tuytelaars, and L.V. Gool, "SURF: Speeded-Up Robust Features," Proc. European Conf. Computer Vision (ECCV), 2006.
[2] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[3] Y. Ke and R. Sukthankar, "PCA-SIFT: A More Distinctive Representation for Local Image Descriptors," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2004.
[4] M. Calonder, V. Lepetit, C. Strecha, and P. Fua, "BRIEF: Binary Robust Independent Elementary Features," Proc. European Conf. Computer Vision (ECCV), 2010.
[5] E. Rublee, V. Rabaud, K. Konolige, and G. Bradski, "ORB: An Efficient Alternative to SIFT or SURF," Proc. IEEE Int'l Conf. Computer Vision (ICCV), 2011.
[6] S. Leutengger, M. Chli, and R.Y. Siegwart, "BRISK: Binary Robust Invariant Scalable Keypoints," Proc. IEEE Int'l Conf. Computer Vision (ICCV), 2011.
[7] A. Alahi, R. Ortiz, and P. Vandergheynst, "FREAK: Fast Retinal Keypoint," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2012.
[8] E. Schechtman and M. Irani, "Matching Local Self-Similarities across Images and Videos," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2007.
[9] E. Rosten, R. Porter, and T. Drummond, "Faster and Better: A Machine Learning Approach to Corner Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 1, pp. 105-119, Jan. 2010.
[10] D. Wagner, G. Reitmayr, A. Mulloni, T. Drummond, and D. Schmalstieg, "Pose Tracking from Natural Features on Mobile Phones," Proc. Int'l Symp. Mixed and Augmented Reality (ISMAR), 2008.
[11] D. Wagner, D. Schmalstieg, and H. Bischof, "Multiple Target Detection and Tracking with Guaranteed Framerates on Mobile Phones," Proc. Int'l Symp. Mixed and Augmented Reality (ISMAR), 2009.
[12] K. Mikolajczyk and C. Schmid, "A Performance Evaluation of Local Descriptors," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, Oct. 2005.
[13] O. Chum and J. Matas, "Matching with PROSAC—Progressive Sample Consensus," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), vol. 1, pp. 220-226, 2005.
[14] P. Simard, L. Bottou, P. Haffner, and Y. LeCun, "Boxlets: A Fast Convolution Algorithm for Signal Processing and Neural Networks," Proc. Neural Information Processing Systems (NIPS), 1998.
[15] A. Gionis, P. Indik, and R. Motwani, "Similarity Search in High Dimensions via Hashing," Proc. Int'l Conf. Very Large Databases (VLDB), 2004.
[16] A.J. Davison, N.D. Molton, I. Reid, and O. Stasse, "MonoSLAM: Real-Time Single Camera SLAM," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 6, pp. 1052-1067, June 2007.
[17] P. Viola and M. Jones, "Robust Real-time Object Detection," Int'l J. Computer Vision, vol. 57, no. 2, pp. 137-154, May 2004.
[18] S. Winder, G. Hua, and M. Brown, "Picking the Best DAISY," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2009.
[19] X. Yang, Q. Liu, C.Y. Liao, and K.T. Cheng, "Large-Scale EMM Identification Based on Geometry-Constrained Visual Word Correspondence Voting," Proc. ACM Int'l Conf. Multimedia Retrieval (ICMR), 2011.
[20] X. Yang and K.T. Cheng, "LDB: An Ultra-Fast Feature for Scalable Augmented Reality on Mobile Devices," Proc. Int'l Symp. Mixed and Augmented Reality (ISMAR), 2012.
[21] J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, "Object Retrieval with Large Vocabularies and Fast Spatial Matching," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2007.
[22] M. Calonder, V. Lepetit, K. Konolige, J. Bowman, P. Mihelich, and P. Fua, "Compact Signatures for High-Speed Interest Point Description and Matching," Proc. IEEE Int'l Conf. Computer Vision (ICCV), 2009.
[23] M.A. Fischler and R.C. Bolles, "Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography," Comm. ACM, vol. 24, pp. 381-395, 1981.
58 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool