The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - Oct. (2012 vol.34)
pp: 2031-2045
Bin Fan , Chinese Academy of Sciences, Beijing
Fuchao Wu , Chinese Academy of Sciences, Beijing
Zhanyi Hu , Chinese Academy of Sciences, Beijing
ABSTRACT
This paper proposes a novel method for interest region description which pools local features based on their intensity orders in multiple support regions. Pooling by intensity orders is not only invariant to rotation and monotonic intensity changes, but also encodes ordinal information into a descriptor. Two kinds of local features are used in this paper, one based on gradients and the other on intensities; hence, two descriptors are obtained: the Multisupport Region Order-Based Gradient Histogram (MROGH) and the Multisupport Region Rotation and Intensity Monotonic Invariant Descriptor (MRRID). Thanks to the intensity order pooling scheme, the two descriptors are rotation invariant without estimating a reference orientation, which appears to be a major error source for most of the existing methods, such as Scale Invariant Feature Transform (SIFT), SURF, and DAISY. Promising experimental results on image matching and object recognition demonstrate the effectiveness of the proposed descriptors compared to state-of-the-art descriptors.
INDEX TERMS
Image matching, Object recognition, Estimation error, Detectors, Robustness, Feature extraction, SIFT., Local image descriptor, rotation invariance, monotonic intensity invariance, image matching, intensity orders
CITATION
Bin Fan, Fuchao Wu, Zhanyi Hu, "Rotationally Invariant Descriptors Using Intensity Order Pooling", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 10, pp. 2031-2045, Oct. 2012, doi:10.1109/TPAMI.2011.277
REFERENCES
[1] S. Agarwal, N. Snavely, I. Simon, S.M. Seitz, and R. Szeliski, "Building Rome in a Day," Proc. 12th IEEE Int'l Conf. Computer Vision, pp. 72-79, 2009.
[2] Y. Furukawa and J. Ponce, "Accurate, Dense, and Robust Multiview Stereopsis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 8, pp. 1362-1376, Aug. 2010.
[3] J.-M. Frahm, P. Fite-Georgel, D. Gallup, T. Johnson, R. Raguram, C. Wu, Y.-H. Jen, E. Dunn, B. Clipp, S. Lazebnik, and M. Pollefeys, "Building Rome on a Cloudless Day," Proc. 11th European Conf. Computer Vision, pp. 368-381, 2010.
[4] N. Snavely, S.M. Seitz, and R. Szeliski, "Photo Tourism: Exploring Photo Collections in 3D," ACM Trans. Graphics, vol. 25, pp. 835-846, 2006.
[5] S. Agarwal, Y. Furukawa, N. Snavely, B. Curless, S.M. Seitz, and R. Szeliski, "Reconstructing Rome," Computer, vol. 43, pp. 40-47, 2010.
[6] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[7] J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, "Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study," Int'l J. Computer Vision, vol. 73, no. 2, pp. 213-238, 2007.
[8] R. Szeliski, "Image Alignment and Stitching: A Tutorial," Foundations and Trends in Computer Graphics and Vision, vol. 2, pp. 1-104, 2006.
[9] K. Mikolajczyk and C. Schmid, "An Affine Invariant Interest Point Detector," Proc. Seventh European Conf. Computer Vision, pp. 128-142, 2002.
[10] S. Lazebnik, C. Schmid, and J. Ponce, "A Sparse Texture Representation Using Local Affine Regions," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 8, pp. 1265-1278, Aug. 2005.
[11] J.J. Koenderink and A.J.V. Doorn, "Representation of Local Geometry in the Visual System," Biological Cybernetics, vol. 55, pp. 367-375, 1987.
[12] A. Baumberg, "Reliable Feature Matching across Widely Separated Views," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2000.
[13] F. Schaffalitzky and A. Zisserman, "Multi-View Matching for Unordered Image Sets, or How Do I Organize My Holiday Snaps?" Proc. Seventh European Conf. Computer Vision, pp. 414-431, 2002.
[14] M. Heikkila, M. Pietikainen, and C. Schmid, "Description of Interest Regions with Local Binary Patterns," Pattern Recognition, vol. 42, pp. 425-436, 2009.
[15] E. Mortensen, H. Deng, and L. Shapiro, "A SIFT Descriptor with Global Context," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 184-190, 2005.
[16] E. Tola, V. Lepetit, and P. Fua, "A Fast Local Descriptor for Dense Matching," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[17] B. Fan, F. Wu, and Z. Hu, "Aggregating Gradient Distributions into Intensity Orders: A Novel Local Image Descriptor," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2377-2384, 2011.
[18] C. Harris and M. Stephens, "A Combined Corner and Edge Detector," Proc. Fourth Alvey Vision Conf., pp. 147-151, 1988.
[19] J. Matas, O. Chum, M. Urba, and T. Pajdla, "Robust Wide Baseline Stereo from Maximally Stable Extremal Regions," Proc. British Machine Vision Conf., pp. 384-396, 2002.
[20] T. Tuytelaars and L.V. Gool, "Matching Widely Separated Views Based on Affine Invariant Regions," Int'l J. Computer Vision, vol. 59, no. 1, pp. 61-85, 2004.
[21] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L.V. Gool, "A Comparison of Affine Region Detectors," Int'l J. Computer Vision, vol. 65, nos. 1/2, pp. 43-72, 2005.
[22] K. Mikolajczyk and C. Schmid, "A Performance Evaluation of Local Descriptors," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, Oct. 2005.
[23] S. Belongie, J. Malik, and J. Puzicha, "Shape Context: A New Descriptor for Shape Matching and Object Recognition," Proc. Neural Information Processing Systems Conf., pp. 831-837, 2001.
[24] Y. Ke and R. Sukthankar, "PCA-SIFT: A More Distinctive Representation for Local Image Descriptors," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 511-517, 2004.
[25] R. Gupta and A. Mittal, "SMD: A Locally Stable Monotonic Change Invariant Feature Descriptor," Proc. 10th European Conf. Computer Vision, pp. 265-277, 2008.
[26] J. Chen, S. Shan, G. Zhao, X. Chen, W. Gao, and M. Pietikainen, "A Robust Descriptor Based on Weber's Law," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[27] H. Bay, A. Ess, T. Tuytelaars, and L.V. Gool, "SURF: Speeded Up Robust Features," Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346-359, 2008.
[28] F. Tang, S.H. Lim, N.L. Change, and H. Tao, "A Novel Feature Descriptor Invariant to Complex Brightness Changes," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2631-2638, 2009.
[29] R. Gupta, H. Patil, and A. Mittal, "Robust Order-Based Methods for Feature Description," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 334-341, 2010.
[30] W.T. Freeman and E.H. Adelson, "The Design and Use of Steerable Filters," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 9, pp. 891-906, Sept. 1991.
[31] L.V. Gool, T. Moons, and D. Ungureanu, "Affine/Photometric Invariants for Planar Intensity Patterns," Proc. Fourth European Conf. Computer Vision, pp. 642-651, 1996.
[32] I.T. Joliffe, Principal Component Analysis. Springer-Verlag, 1986.
[33] S. Winder and M. Brown, "Learning Local Image Descriptors," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[34] S. Winder, G. Hua, and M. Brown, "Picking the Best DAISY," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 178-185, 2009.
[35] A.C. Berg and J. Malik, "Geometric Blur for Template Matching," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 607-614, 2001.
[36] P.-E. Forssen and D.G. Lowe, "Shape Descriptor for Maximally Stable Extremal Regions," Proc. IEEE 11th Int'l Conf. Computer Vision, 2007.
[37] A.K. Jain, Fundamentals of Digital Image Signal Processing. Prentice-Hall, 1989.
[38] A. Mittal and V. Ramesh, "An Intensity-Augmented Ordinal Measure for Visual Correspondence," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 849-856, 2006.
[39] R. Gupta and A. Mittal, "Illumination and Affine—Invariant Point Matching Using an Ordinal Approach," Proc. IEEE 11th Int'l Conf. Computer Vision, 2007.
[40] T. Ojala, M. Pietikainen, and D. Harwood, "A Comparative Study of Texture Measures with Classification Based on Feature Distributions," Pattern Recognition, vol. 29, pp. 51-59, 1996.
[41] T. Harada, H. Nakayama, and Y. Kuniyoshi, "Improving Local Descriptors by Embedding Global and Local Spatial Information," Proc. 11th European Conf. Computer Vision, pp. 736-749, 2010.
[42] H. Cheng, Z. Liu, N. Zheng, and J. Yang, "A Deformable Local Image Descriptor," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[43] http://lear.inrialpes.fr/peoplemikolajczyk /, 2011.
[44] http://www.robots.ox.ac.uk/~vgg/research affine/, 2012.
[45] P. Moreels and P. Perona, "Evaluation of Features Detectors and Descriptors Based on 3D Objects," Int'l J. Computer Vision, vol. 73, no. 3, pp. 263-284, 2007.
[46] http://www.cs.ubc.ca/~mbrown/patchdatapatchdata.html , 2012.
[47] http://www.vision.ee.ethz.ch/datasetsindex.en.htm , 2011.
[48] D. Nistér and H. Stewénius, "Scalable Recognition with a Vocabulary Tree," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 2161-2168, 2006.
47 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool