The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - February (2011 vol.33)
pp: 338-352
Hongping Cai , National University of Defense Technology, Changsha
Krystian Mikolajczyk , University of Surrey, Guildford
Jiri Matas , Czech Technical University, Prague
ABSTRACT
In this paper, we present Linear Discriminant Projections (LDP) for reducing dimensionality and improving discriminability of local image descriptors. We place LDP into the context of state-of-the-art discriminant projections and analyze its properties. LDP requires a large set of training data with point-to-point correspondence ground truth. We demonstrate that training data produced by a simulation of image transformations leads to nearly the same results as the real data with correspondence ground truth. This makes it possible to apply LDP as well as other discriminant projection approaches to the problems where the correspondence ground truth is not available, such as image categorization. We perform an extensive experimental evaluation on standard data sets in the context of image matching and categorization. We demonstrate that LDP enables significant dimensionality reduction of local descriptors and performance increases in different applications. The results improve upon the state-of-the-art recognition performance with simultaneous dimensionality reduction from 128 to 30.
INDEX TERMS
Linear discriminant projections, dimensionality reduction, image descriptors, image recognition, image matching.
CITATION
Hongping Cai, Krystian Mikolajczyk, Jiri Matas, "Learning Linear Discriminant Projections for Dimensionality Reduction of Image Descriptors", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.33, no. 2, pp. 338-352, February 2011, doi:10.1109/TPAMI.2010.89
REFERENCES
[1] A.C. Berg and J. Malik, "Geometric Blur for Template Matching," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. I-607-I-614, 2001.
[2] S. Yan, D. Xu, B. Zhang, H.-J. J. Zhang, Q. Yang, and S. Lin, "Graph Embedding and Extensions: A General Framework for Dimensionality Reduction," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 40-51, Jan. 2007.
[3] J.B. Tenenbaum, V. de Silva, and J.C. Langford, "A Global Geometric Framework for Nonlinear Dimensionality Reduction," Science, vol. 290, no. 5500, pp. 2319-2323, 2000.
[4] S.T. Roweis and L.K. Saul, "Nonlinear Dimensionality Reduction by Locally Linear Embedding," Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
[5] M. Belkin and P. Niyogi, "Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering," Proc. Advances in Neural Information Processing Systems, vol. 14, pp. 585-591, 2001.
[6] X. He and P. Niyogi, "Locality Preserving Projections," Proc. Advances in Neural Information Processing Systems, 2003.
[7] E.P. Xing, A.Y. Ng, M.I. Jordan, and S. Russell, "Distance Metric Learning, with Application to Clustering with Side-Information," Proc. Advances in Neural Information Processing Systems, vol. 15, pp. 505-512, 2002.
[8] H.-T. Chen, H.-W. Chang, and T.-L. Liu, "Local Discriminant Embedding and Its Variants," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 846-853, 2005.
[9] K.Q. Weinberger, J. Blitzer, and L. Saul, "Distance Metric Learning for Large Margin Nearest Neighbor Classification," Proc. Advances in Neural Information Processing Systems, vol. 18, pp. 1473-1480, 2006.
[10] S. Winder, G. Hua, and M. Brown, "Picking the Best DAISY," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 178-185, 2009.
[11] G. Hua, M. Brown, and S. Winder, "Discriminant Embedding for Local Image Descriptors," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[12] K. Mikolajczyk and J. Matas, "Improving Descriptors for fast Tree Matching by Optimal Linear Projection," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[13] H. Cai, K. Mikolajczyk, and J. Matas, "Learning Linear Discriminant Projections for Dimensionality Reduction of Image Descriptors," Proc. British Machine Vision Conf., 2008.
[14] D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, Nov. 2004.
[15] K. Mikolajczyk and C. Schmid, "A Performance Evaluation of Local Descriptors," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, Oct. 2005.
[16] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. Van Gool, "A Comparison of Affine Region Detectors," Int'l J. Computer Vision, vol. 65, nos. 1/2, pp. 43-72, 2005.
[17] E. Tola, V. Lepetit, and P. Fua, "A Fast Local Descriptor for Dense Matching," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[18] V. Lepetit and P. Fua, "Keypoint Recognition Using Randomized Trees," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1465-1479, Sept. 2006.
[19] Y. Ke and R. Sukthankar, "Pca-Sift: A More Distinctive Representation for Local Image Descriptors," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 506-513, 2004.
[20] K. Grauman and T. Darrell, "The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features," Proc. IEEE Int'l Conf. Computer Vision, vol. 2, pp. 1458-1465, 2005.
[21] J.C. van Gemert, J.M. Geusebroek, C.J. Veenman, and A.W.M. Smeulders, "Kernel Codebooks for Scene Categorization," Proc. European Conf. Computer Vision, 2008.
[22] T. Tuytelaars and K. Mikolajczyk, "Local Invariant Feature Detectors: A Survey," Foundations and Trends in Computer Graphics and Vision, vol. 3, no. 3, pp. 177-280, 2008.
[23] W. Triggs, "Detecting Keypoints with Stable Position, Orientation, and Scale under Illumination Changes," Proc. European Conf. Computer Vision, pp. 100-113, 2004.
[24] L. Fei-Fei and P. Perona, "A Bayesian Hierarchical Model for Learning Natural Scene Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 524-531, 2005.
[25] S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2169-2178, 2006.
[26] A. Bar-Hillel, T. Hertz, N. Shental, and D. Weinshall, "Learning Distance Functions Using Equivalence Relations," Proc. Int'l Conf. Machine Learning, pp. 11-18, 2003.
[27] K. Fukunaga, Introduction to Statistical Pattern Recognition, second ed. Academic Press Professional, Inc., 1990.
[28] J. Matas, O. Chum, U. Martin, and T. Pajdla, "Robust Wide Baseline Stereo from Maximally Stable Extremal Regions," Proc. British Machine Vision Conf., vol. 1, pp. 384-393, 2002.
[29] L.F. Fei, R. Fergus, and P. Perona, "Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories," Proc. Computer Vision and Pattern Recognition Workshop Generative Model Based Vision, pp. 178-178, 2004.
[30] P. Jain, B. Kulis, and K. Grauman, "Fast Image Search for Learned Metrics," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[31] Data, Feature Detectors, Evaluation Protocols, http:/www. featurespace.org, 2010.
[32] M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes Challenge Results," http://www.pascal-network.org/challenges VOC/, 2010.
[33] J. Goldberger, S. Roweis, G. Hinton, and R. Salakhutdinov, "Neighborhood Component Analysis," Proc. Advances in Neural Information Processing Systems, 2004.
[34] M. Schultz and T. Joachims, "Learning a Distance Metric from Relative Comparisons," Proc. Advances in Neural Information Processing Systems, 2004.
[35] L. Yang and R. Jin, "Contents Distance Metric Learning: A Comprehensive Survey," technical report, Dept. of Computer Science and Eng., Michigan State Univ., 2006.
[36] M.A. Tahir, K. van de Sande, J. Uijlings, F. Yan, X. Li, K. Mikolajczyk, J. Kittler, T. Gevers, and A. Smeulders, "Surrey uva_srkda Method," Proc. Pascal Visual Object Classes Challenge Workshop, 2008.
[37] J. Liu and M. Shah, "Scene Modeling Using Co-Clustering," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-7, 2007.
[38] A. Bosch, A. Zisserman, and X. Muoz, "Scene Classification Using a Hybrid Generative/Discriminative Approach," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 4, pp. 712-727, Apr. 2008.
[39] S. Fidler, M. Boben, and A. Leonardis, "Similarity-Based Cross-Layered Hierarchical Representation for Object Categorization," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[40] A. Frome, Y. Singer, F. Sha, and J. Malik, "Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[41] G. Wang, D. Hoiem, and D. Forsyth, "Learning Image Similarity from Flickr Groups Using Stochastic Intersection Kernel Machines," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[42] F.S. Khan, J. van de Weijer, and M. Vanrell, "Top-Down Color Attention for Object Recognition," Proc. IEEE Int'l Conf. Computer Vision, 2009.
[43] K. Grauman and T. Darrell, "The Pyramid Match Kernel: Efficient Learning with Sets of Features," J. Machine Learning Research, vol. 8, no. 4, pp. 725-760, 2007.
[44] A.C. Berg, T.L. Berg, and J. Malik, "Shape Matching and Object Recognition Using Low Distortion Correspondences," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 26-33, 2005.
[45] H. Zhang, A.C. Berg, M. Maire, and J. Malik, "Svm-knn: Discriminative Nearest Neighbor Classification for Visual Category Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2126-2136, 2006.
[46] K.E.A. van de Sande, T. Gevers, and C.G.M. Snoek, "Evaluating Color Descriptors for Object and Scene Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1582-1596, Sept. 2010.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool