The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.07 - July (2012 vol.34)
pp: 1423-1436
Ritwik Kumar , IBM Research, Almaden
Arunava Banerjee , University of Florida, Gainesville
Baba C. Vemuri , University of Florida, Gainesville
Hanspeter Pfister , Harvard University, Cambridge
ABSTRACT
In this paper, we present a novel image classification system that is built around a core of trainable filter ensembles that we call Volterra kernel classifiers. Our system treats images as a collection of possibly overlapping patches and is composed of three components: 1) A scheme for a single patch classification that seeks a smooth, possibly nonlinear, functional mapping of the patches into a range space, where patches of the same class are close to one another, while patches from different classes are far apart—in the L_2 sense. This mapping is accomplished using trainable convolution filters (or Volterra kernels) where the convolution kernel can be of any shape or order. 2) Given a corpus of Volterra classifiers with various kernel orders and shapes for each patch, a boosting scheme for automatically selecting the best weighted combination of the classifiers to achieve higher per-patch classification rate. 3) A scheme for aggregating the classification information obtained for each patch via voting for the parent image classification. We demonstrate the effectiveness of the proposed technique using face recognition as an application area and provide extensive experiments on the Yale, CMU PIE, Extended Yale B, Multi-PIE, and MERL Dome benchmark face data sets. We call the Volterra kernel classifiers applied to face recognition Volterrafaces. We show that our technique, which falls into the broad class of embedding-based face image discrimination methods, consistently outperforms various state-of-the-art methods in the same category.
INDEX TERMS
Face recognition, convolution, filtering classifier, Volterra kernels, Fisher's linear discriminant, boosting.
CITATION
Ritwik Kumar, Arunava Banerjee, Baba C. Vemuri, Hanspeter Pfister, "Trainable Convolution Filters and Their Application to Face Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 7, pp. 1423-1436, July 2012, doi:10.1109/TPAMI.2011.225
REFERENCES
[1] http://cvc.yale.edu/projects/yalefacesyalefaces.html , 2011.
[2] T. Ahonen, A. Hadid, and M. Pietikainen, "Face Description with Local Binary Patterns: Application to Face Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 2037-2041, Dec. 2006.
[3] S. An, W. Liu, and S. Venkatesh, "Face Recognition Using Kernel Ridge Regression," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[4] S. An, W. Liu, and S. Venkatesh, "Exploiting Side Information in Locality Preserving Projection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[5] M. Artiklar, P. Watta, and M. Hassoun, "Local Voting Networks for Human Face Recognition," Proc. Int'l Joint Conf. Neural Networks, 2003.
[6] P.N. Belhumeur, J. Hespanha, and D.J. Kriegman, "Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711-720, July 1997.
[7] D. Cai, X. He, J. Han, and H.J. Zhang, "Orthogonal Laplacianfaces for Face Recognition," IEEE Trans. Image Processing, vol. 15, no. 11, pp. 3608-3614, Nov. 2006.
[8] D. Cai, X. He, and J. Han, "Subspace Learning Based on Tensor Analysis," Technical Report UIUCDCS-R-2005-2572, Univ. of Illinois Urbana-Champaign, 2005.
[9] D. Cai, X. He, and J. Han, "Spectral Regression for Efficient Regularized Subspace Learning," Proc. 11th IEEE Int'l Conf. Computer Vision, 2007.
[10] D. Cai, X. He, Y. Hu, J. Han, and T. Huang, "Learning a Spatially Smooth Subspace for Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[11] J.A. Cherry, "Introduction to Volterra Methods," Distortion Analysis of Weakly Nonlinear Filters Using Volterra Series, Carleton Univ., 1994.
[12] C. Cortes and V. Vapnik, "Support-Vector Networks," Machine Learning, vol. 20, pp. 273-297, 1995.
[13] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification. Wiley-Interscience, 2000.
[14] Y. Freund and R. Schapire, "A Decision Theoretic Generalization of On-Line Learning and an Application to Boosting," J. Computer and System Sciences, vol. 55, no. 1, pp. 119-139, 1997.
[15] J.H. Friedman, "Regularized Discriminant Analysis," J. Am. Statistical Assoc., vol. 84, no. 405, pp. 165-175, 1989.
[16] Y. Fu and T.S. Huang, "Image Classification Using Correlation Tensor Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 2, pp. 226-234, Feb. 2008.
[17] R. Gross, I. Matthews, J. Cohn, S. Baker, and T. Kanade, "The CMU Multi-Pose, Illumination, and Expression (Multi-Pie) Face Database," Technical Report TR-07-08, Carnegie Mellon Univ., 2007.
[18] X. He, D. Cai, and P. Niyogi, "Locality Preserving Projections," Proc. Conf. Advances in Neural Information Processing Systems, 2003.
[19] X. He, D. Cai, and P. Niyogi, "Tensor Subspace Analysis," Proc. Conf. Advances in Neural Information Processing Systems, 2005.
[20] X. He, D. Cai, S. Yan, and H.-J. Zhang, "Neighborhood Preserving Embedding," Proc. 10th IEEE Int'l Conf. Computer Vision 2005.
[21] X. He, S. Yan, Y. Hu, P. Niyogi, and H. Zhang, "Face Recognition Using Laplacianfaces," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 3, pp. 328-340, Mar. 2005.
[22] G. Hua, P. Viola, and S. Drucker, "Face Recognition Using Discriminatively Trained Orthogonal Rank One Tensor Projections," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[23] M. Jones and P. Viola, "Face Recognition Using Boosted Local Features," Technical Report TR2003-25, MERL, 2003.
[24] R. Kumar, A. Banerjee, B.C. Vemuri, and H. Pfister, "Maximizing all Margins: Pushing Face Recognition with Kernel Plurality," Proc. IEEE Int'l Conf. Computer Vision, 2011.
[25] R. Kumar, A. Barmpoutis, A. Banerjee, and B.C. Vemuri, "Non-Lambertian Reflectance Modeling and Shape Recovery of Faces Using Tensor Splines," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 33, no. 3, pp. 533-567, Mar. 2011.
[26] R. Kumar, M. Jones, and T.K. Marks, "Morphable Reflectance Fields for Enhancing Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[27] S. Lawrence, C.L. Giles, A.C. Tsoi, and A.D. Back, "Face Recognition: A Comvolutional Neural Network Approach," IEEE Trans. Neural Networks, vol. 8, no. 1, pp. 98-113, Jan. 1997.
[28] K. Lee, J. Ho, and D.J. Kriegman, "Acquiring Linear Subspaces for Face Recognition under Variable Lighting," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 684-698, May 2005.
[29] K.C. Lee, J. Ho, M.H. Yang, and D. Kriegman, "Video-Based Face Recognition Using Probabilistic Appearance Manifolds," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2003.
[30] T. Leung and J. Malik, "Representing and Recognizing the Visual Appearance of Materials Using 3D Textons," Int'l J. Computer Vision, vol. 43, pp. 29-44, 2001.
[31] Y.W. Pang, L. Zhang, M.J. Li, Z.K. Liu, and W.Y. Ma, "A Novel Gabor-LDA Based Face Recognition Method," Advances in Multimedia Information Processing, vol. 331, pp. 352-358, 2004.
[32] D.-S. Pham and S. Venkatesh, "Robust Learning of Discriminative Projection for Multicategory Classification on the Stiefel Manifold," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2008.
[33] S. Rana, W. Liu, M. Lazarescu, and S. Venkatesh, "Recognising Faces in Unseen Modes: A Tensor Based Approach," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[34] H. Shan and G.W. Cottrell, "Looking Around the Backyard Helps to Recognize Faces and Digits," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2008.
[35] T. Sim and T. Kanade, "Combining Models and Exemplars for Face Recognition: An Illuminating Example," Proc. CVPR Workshop Models versus Exemplars in Computer Vision, 2001.
[36] M. Turk and A. Pentland, "Eigenfaces for Recognition," J. Cognitive Neurosciences, vol. 3, pp. 72-86, 1991.
[37] A. Vasilescu and D. Terzopoulos, "Multilinear Subspace Analysis of Image Ensembles," Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2003.
[38] P. Viola and M. Jones, "Robust Real-Time Face Detection," Int'l J. Computer Vision, vol. 57, pp. 137-154, 2004.
[39] V. Volterra, Theory of Functionals and of Integral and Integro-Differential Equations. Blackie and Sons Ltd., 1930.
[40] F. Wang and C. Zhang, "Feature Extraction by Maximizing the Average Neighborhood Margin," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[41] T. Weyrich, W. Matusik, H. Pfister, B. Bickel, C. Donner, C. Tu, J. McAndless, H.W. Jensen, J. Lee, A. Ngan, and M. Gross, "Analysis of Human Faces Using a Measurement-Based Skin Reflectance Model," ACM Trans. Graphics, vol. 25, pp. 1013-1024, 2006.
[42] S. Yan, D. Xu, B. Zhang, H.-J. Zhang, Q. Yang, and S. Lin, "Graph Embedding and Extension: A General Framework for Dimensionality Reduction," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 40-51, Jan. 2007.
[43] J. Yang, D. Zhang, A.F. Frangi, and J. yu Yang, "2D PCA: A New Approach to Appearance-Based Face Representation and Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 1, pp. 131-137, Jan. 2004.
[44] J. Ye, R. Janardan, and Q. Li, "2D Linear Discriminant Analysis," Proc. Conf. Advances in Neural Information Processing Systems, 2004.
[45] W.C. Zhang, S.G. Shan, W. Gao, and H.M. Zhang, "Local Gabor Binary Pattern Histogram Sequence (lgbphs): A Novel Non-Statistical Model for Face Representation and Recognition," Proc. 10th IEEE Int'l Conf. Computer Vision, 2005.
[46] J. Zhu, S. Rossetand, H. Zou, and T. Hastie, "Multiclass Adaboost," technical report, Stanford Univ., 2005.
35 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool