The Community for Technology Leaders
Subscribe
Issue No.02 - February (2012 vol.34)
pp: 372-386
A. Wagner , Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
J. Wright , Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
A. Ganesh , Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Zihan Zhou , Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
H. Mobahi , Comput. Sci. Dept., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Yi Ma , Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana- Champaign, Urbana, IL, USA
ABSTRACT
Many classic and contemporary face recognition algorithms work well on public data sets, but degrade sharply when they are used in a real recognition system. This is mostly due to the difficulty of simultaneously handling variations in illumination, image misalignment, and occlusion in the test image. We consider a scenario where the training images are well controlled and test images are only loosely controlled. We propose a conceptually simple face recognition system that achieves a high degree of robustness and stability to illumination variation, image misalignment, and partial occlusion. The system uses tools from sparse representation to align a test face image to a set of frontal training images. The region of attraction of our alignment algorithm is computed empirically for public face data sets such as Multi-PIE. We demonstrate how to capture a set of training images with enough illumination variation that they span test images taken under uncontrolled illumination. In order to evaluate how our algorithms work under practical testing conditions, we have implemented a complete face recognition system, including a projector-based training acquisition system. Our system can efficiently and effectively recognize faces under a variety of realistic conditions, using only frontal images under the proposed illuminations as training.
INDEX TERMS
image representation, face recognition, partial occlusion, face recognition system, robust alignment, robust illumination, sparse representation, public data sets, handling variations, image misalignment, illumination variation, Lighting, Face recognition, Image recognition, Databases, validation and outlier rejection., Face recognition, face alignment, illumination variation, occlusion and corruption, sparse representation, error correction
CITATION
A. Wagner, J. Wright, A. Ganesh, Zihan Zhou, H. Mobahi, Yi Ma, "Toward a Practical Face Recognition System: Robust Alignment and Illumination by Sparse Representation", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 2, pp. 372-386, February 2012, doi:10.1109/TPAMI.2011.112
REFERENCES
 [1] M. Turk and A. Pentland, "Eigenfaces for Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1991. [2] P. Belhumeur, J. Hespanda, and D. Kriegman, "Eigenfaces versus Fisherfaces: Recognition Using Class Specific Linear Projection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711-720, July 1997. [3] J. Wright, A. Yang, A. Ganesh, S. Sastry, and Y. Ma, "Robust Face Recognition via Sparse Representation," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210-227, Feb. 2009. [4] B. Amberg, A. Blake, A. Fitzgibbon, S. Romdhani, and T. Vetter, "Reconstructing High Quality Face-Surfaces Using Model Based Stereo," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007. [5] V. Blanz and T. Vetter, "Face Recognition Based on Fitting a 3D Morphable Model," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 9, pp. 1063-1074, Sept. 2003. [6] T. Cootes, G. Edwards, and C. Taylor, "Active Appearance Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 6, pp. 681-685, June 2001. [7] T. Cootes and C. Taylor, "Active Shape Models—'Smart Snakes'," Proc. British Machine Vision Conf., 1992. [8] B. Lucas and T. Kanade, "An Iterative Image Registration Technique with an Application to Stereo Vision," Proc. Int'l Joint Conf. Artificial Intelligence, vol. 3, pp. 674-679, 1981. [9] P. Belhumeur and G. Hager, "Tracking in 3D: Image Variability Decomposition for Recovering Object Pose and Illumination," Pattern Analysis and Applications, vol. 2, pp. 82-91, 1999. [10] H. Murase and S. Nayar, "Visual Learning and Recognition of 3D Objects from Appearance," Int'l J. Computer Vision, vol. 14, pp. 5-24, 1995. [11] E. Candès and T. Tao, "Decoding by Linear Programming," IEEE Trans. Information Theory, vol. 51, no. 12, pp. 4203-4215, Dec. 2005. [12] J. Wright and Y. Ma, "Dense Error Correction via $\ell^1$ -Minimization," IEEE Trans. Information Theory, vol. 56, no. 7, pp. 3540-3560, July 2010. [13] M. Osborne and R. Womersley, "Strong Uniqueness in Sequential Linear Programming," J. Australian Math. Soc., Series B, vol. 31, pp. 379-384, 1990. [14] K. Jittorntrum and M. Osborne, "Strong Uniqueness and Second Order Convergence in Nonlinear Discrete Approximation," Numerische Mathematik, vol. 34, pp. 439-455, 1980. [15] A.Y. Yang, A. Ganesh, Z. Zhou, S. Sastry, and Y. Ma, "Fast $\ell \_1$ -Minimization Algorithms and Application in Robust Face Recognition," Proc. Int'l Conf. Image Processing, 2010. [16] T. Chen, W. Yin, X. Zhou, D. Comaniciu, and T. Huang, "Total Variation Models for Variable Lighting Face Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1519-1524, Sept. 2006. [17] S. Zhou, G. Aggarwal, R. Chellappa, and D. Jacobs, "Appearance Characterization of Linear Lambertian Objects, Generalized Photometric Stereo, and Illumination-Invariant Face Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 2, pp. 230-245, Feb. 2007. [18] A. Georghiades, P. Belhumeur, and D. Kriegman, "From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 6, pp. 643-660, June 2001. [19] P. Belhumeur and D. Kriegman, "What Is the Set of Images of an Object under All Possible Illumination Conditions?" Int'l J. Computer Vision, vol. 28, no. 3, pp. 245-260, 1998. [20] R. Basri and D. Jacobs, "Lambertian Reflectance and Linear Subspaces," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 2, pp. 218-233, Feb. 2003. [21] P. Viola and M.J. Jones, "Robust Real-Time Face Detection," Int'l J. Computer Vision, vol. 57, pp. 137-154, 2004. [22] S. Baker and I. Matthews, "Lucas-Kanade 20 Years on: A Unifying Framework: Part 1: The Quantity Approximated, the Warp Update Rule, and the Gradient Descent Approximation," Int'l J. Computer Vision, vol. 56, no. 3, pp. 221-255, 2004. [23] L. Cromme, "Strong Uniqueness: A Far-Reaching Criterion for the Convergence Analysis of Iterative Procedures," Numerishe Mathematik, vol. 29, pp. 179-193, 1978. [24] A. Lewis and S. Wright, "A Proximal Method for Composite Minimization," technical report, Univ. of Wisconsin, 2008. [25] A. Wagner, J. Wright, A. Ganesh, Z. Zhou, and Y. Ma, "Toward a Practical Face Recognition System: Robust Pose and Illumination via Sparse Representation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009. [26] R. Gross, I. Matthews, J. Cohn, T. Kanade, and S. Baker, "Multi-PIE," Proc. IEEE Conf. Automatic Face and Gesture Recognition, 2008. [27] R. Gross, I. Matthews, and S. Baker, "Active Appearance Models with Occlusion," Image and Vision Computing, vol. 24, no. 6, pp. 593-604, 2006. [28] L. Wiskott, J. Fellous, N. Kuiger, and C. von der Malsburg, "Face Recognition by Elastic Bunch Graph Matching," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 775-779, July 1997. [29] J. Huang, X. Huang, and D. Metaxas, "Simultaneous Image Transformation and Sparse Representation Recovery," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008. [30] P. Debevec, T. Hawkins, C. Tchou, H. Duiker, W. Sarokin, and M. Sagar, "Acquiring the Reflectance Field of a Human Face," Proc. 27th Ann. Conf. Computer Graphics and Interactive Techniques, pp. 145-156, 2000. [31] A. Jones, A. Gardner, M. Bolas, I. McDowall, and P. Debevec, "Performance Geometry Capture for Spatially Varying Relighting," Proc. ACM Siggraph 2005 Sketches, p. 74, 2005. [32] V. Masselus, P. Dutré, and F. Anrys, "The Free-Form Light Stage," Proc. ACM Siggraph, p. 262, 2002. [33] L. Zhang, B. Curless, and S. Seitz, "Rapid Shape Acquisition Using Color Structured Light and Multi-Pass Dynamic Programming," Proc. First IEEE Int'l Symp. 3D Data Processing, Visualization, and Transmission, pp. 24-36, 2002. [34] Y. Schechner, S. Nayar, and P. Belhumeur, "Multiplexing for Optimal Lighting," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 8, pp. 1339-1354, Aug. 2007. [35] J. Yang, K. Yu, and T. Huang, "Supervised Translation-Invariant Sparse Coding," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2010. [36] K. Lee, J. Ho, and D. Kriegman, "Acquiring Linear Subspaces for Face Recognition under Variable Lighting," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 684-698, May 2005. [37] T. Ahonen, A. Hadid, and M. Pietikainen, "Face Description with Local Binary Patterns: Application to Face Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 2037-2041, Dec. 2006. [38] X. Tan, S. Chen, Z.-H. Zhou, and F. Zhang, "Face Recognition from a Single Image per Person: A Survey," Pattern Recognition, vol. 39, pp. 1725-1745, 2006. [39] P. Phillips, H. Wechsler, J. Huang, and P. Rauss, "The FERET Database and Evaluation Procedure for Face-Recognition Algorithms," Image and Vision Computing, vol. 16, no. 5, pp. 295-306, 1998. [40] W. Deng, J. Hu, J. Guo, W. Cai, and D. Feng, "Robust, Accurate and Efficient Face Recognition from a Single Training Image: A Uniform Pursuit Approach," Pattern Recognition, vol. 43, pp. 1748-1762, 2010. [41] L. Wolf, T. Hassner, and Y. Taigman, "Descriptor Based Methods in the Wild," Proc. Faces in Real-Life Images Workshop in ECCV, 2008. [42] P. Phillips, W. Scruggs, A. O'Tools, P. Flynn, K. Bowyer, C. Schott, and M. Sharpe, "FRVT 2006 and ICE 2006 Large-Scale Results," Technical Report NISTIR 7408, NIST 2007. [43] A. Pentland, B. Moghaddam, and T. Starner, "View-Based and Modular Eigenspaces for Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1994. [44] Z. Zhou, A. Wagner, J. Wright, H. Mobahi, and Y. Ma, "Face Recognition with Contiguous Occlusion Using Markov Random Fields," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2009. [45] V. Cevher, M.F. Duarte, C. Hegde, and R.G. Baraniuk, "Sparse Signal Recovery Using Markov Random Fields," Proc. Neural Information and Processing Systems, 2008.