| | This Article | |
| |
| |
| | Share | |
| |
| |
| | Bibliographic References | |
| |
| |
| | Add to: | |
| |
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
| |
| | Search | |
| |
| |
| | |
A Volumetric/Iconic Frequency Domain Representation for Objects With Application for Pose Invariant Face Recognition
May 1998 (vol. 20 no. 5)
pp. 449-457
Abstract—A novel method for representing 3D objects that unifies viewer and model centered object representations is presented. A unified 3D frequency-domain representation (called Volumetric Frequency Representation—VFR) encapsulates both the spatial structure of the object and a continuum of its views in the same data structure. The frequency-domain image of an object viewed from any direction can be directly extracted employing an extension of the Projection Slice Theorem, where each Fourier-transformed view is a planar slice of the volumetric frequency representation. The VFR is employed for pose-invariant recognition of complex objects, such as faces. The recognition and pose estimation is based on an efficient matching algorithm in a four-dimensional Fourier space. Experimental examples of pose estimation and recognition of faces in various poses are also presented.
[1] 449 D.H. Ballard and C.M. Brown, Computer Vision, chap. 8. Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1982.[2] A.H. Barr, "Superquadrics and Angle Preserving Transformations," IEEE Computer Graphics and Applications, vol. 1, pp. 11-23, 1981.[3] J. Ben-Arie and Z. Wang, “Pictorial Recognition Using Affine-Invariant Spectral Signatures,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, pp. 34-39, San Juan, P.R., June 1997.[4] J. Ben-Arie, Z. Wang, and K.R. Rao, "Affine Invariant Shape Representation and Recognition using Gaussian Kernels and Multi-dimensional Indexing," Proc. 1996 IEEE Int'l Conf. Speech, Acoustics, and Signal Processing (ICASSP '96), vol. 6, pp. 3,470-3,473,Atlanta, May 1996.[5] J. Ben-Arie, Z. Wang, and K.R. Rao, "Iconic Recognition With Affine-Invariant Spectral Signatures," Proc. 1996 IAPR/IEEE Int'l Conf. Pattern Recognition (ICPR '96), vol. 1, pp. 672-676,Vienna, Austria, Aug. 1996.[6] J. Ben-Arie, Z. Wang, and K.R. Rao, "Iconic Representation and Recognition Using Affine-Invariant Spectral Signatures," Proc. ARPA Image Understanding Workshop 1996, pp. 1,277-1,286,Palm Springs, Calif., Feb. 1996.[7] D.J. Beymer, "Face Recognition Under Varying Pose," Technical Report A.I. Memo No. 1461, MIT Artificial Intelligence Lab., Dec. 1993.[8] M. Brady, J. Ponce, A. Yuille, and H. Asada, "Describing Surfaces," Computer Vision, Graphics, and Image Processing, vol. 32, no. 1, pp. 1-28, Oct. 1985.[9] R. Brunelli and T. Poggio, "Face Recognition: Features vs. Templates," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1,042-1,053, Oct. 1993.[10] I. Craw, D. Tock, and A. Bennet, "Finding Face Features," Proc. European Conf. Computer Vision, pp. 92-96, 1992.[11] D. DeMers and G.W. Cottrell, "Non-Linear Dimensionality Reduction," Advances in Neural Information Processing Systems 5, S.J. Hanson, J.D. Cowan, and C.L. Giles, eds., pp. 580-587.San Mateo, Calif.: Morgan Kaufmann Publishers, 1993.[12] R.M. Haralick and L. Shapiro, "Object Models and Matching," Computer and Robotic Vision, vol. 2, chap. 18. Addison-Wesley Publishing Company, Inc., 1993.[13] M.K. Hu, "Visual Pattern Recognition by Moment Invariants," A.K. Agarwal, R.O. Duda, and A. Rosenfeld, eds., Computer Methods in Image Analysis.Los Alam Calif.: IEEE CS Press, 1977.[14] A.K. Jain, "Image Reconstruction From Projections," Fundamentals of Digital Image Processing, chap. 10, pp. 431-475.Englewood Cliffs, N.J.: Prentice-Hall Inc., N.J., 1989.[15] R. Jain, R. Kasturi, and B.G. Schunk, "Object Recognition," Machine Vision, chap. 15. New York, N.Y.: McGraw-Hill, Inc., 1995.[16] J.J. Koenderink and A.J. Van Doorn, "The Internal Representation of Solid Shape With Respect to Vision," Biological Cybernetics, vol. 32, pp. 211-216, 1979.[17] M. Lades, J.C. Vorbruggen, J. Buhmann, J. Lange, C. von der Malsburg, R.P. Wurtz, and W. Konen, “Distortion Invariant Object Recognition in the Dynamic Link Architecture,” IEEE Trans. Computers, vol. 42, no. 3, pp. 300-311, Mar. 1993.[18] S. Lawrence, C.L. Giles, A.C. Tsoi, and A.D. Back, “Face Recognition: A Convolutional Neural-Network Approach,” IEEE Trans. Neural Networks, vol. 8, pp. 98-113, 1997.[19] C. Nastar and A.P. Pentland, "Matching and Recognition Using Deformable Intensity Surfaces," Proc. 1995 IEEE Int'l Symp. Computer Vision,Coral Gables, Fla., May 1995.[20] Olivetti and Oracle Research Laboratory, "The Orl Database of Faces," http://www.cam-orl.co.ukfacedatabase.html , 1994.[21] A.P. Pentland, "Perceptual Organization and the Representation of Natural Form," Artificial Intelligence, vol. 28, pp. 29-73, 1986.[22] A. Pentland, B. Moghaddam, and Starner, "View-Based and Modular Eigenspaces for Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1994, pp. 84-91.[23] D. Reisfeld and Y. Yeshurun, "Robust Detection of Facial Features by Generalized Symmetry," Proc. 1992 IEEE/IAPR Int'l Conf. Pattern Recognition, vol. 1, pp. 117-120,The Hague, The Netherlands, 1992.[24] A. Rosenfeld and A.C. Kak, Digital Image Processing.New York, N.Y.: Academic Press, 1982.[25] F. Samaria and A. Harter, "Parameterisation of a Stochastic Model for Human Face Identification," Proc. Second IEEE Workshop Applications of Computer Vision,Sarasota, Fla., Dec. 1994.[26] S. Budkowski, “Estelle Development Toolset (EDT),” Computer Networks and ISDN Systems, vol. 25, no. 1, pp. 63–82, 1992.[27] Z. Wang and J. Ben-Arie, "SVD and Log-Log Frequency Sampling With Gabor Kernels for Invariant Pictorial Recognition," Proc. 1997 IEEE Int'l Conf. Image Processing (ICIP '97), vol. III, pp. 162-165,Santa Barbara, Calif., Oct.26-29 1997.[28] J. Weng, N. Ahuja, and T.S. Huang, "Learning Recognition and Segmentation of 3D Objects from 2D Images," Proc. 1993 IEEE Int'l Conf. Computer Vision (ICCV '93), pp. 121-128, 1993.[29] A.L. Yuille, P.W. Halliman, and D.S. Cohen, "Feature Extraction From Faces Using Deformable Templates," Int'l J. Computer Vision, vol. 8, no. 2, pp. 99-111, 1992.[30] C.T. Zahn and R.Z. Roskies, "Fourier Descriptors for Plane Closed Curves," IEEE Trans. Computers, vol. 21, pp. 269-281, 1972.
Index Terms:
Volumetric frequency representation (VFR), object representation, projection-slice theorem, 4D Fourier space, face pose estimation, pose invariant face recognition.
Citation:
Jezekiel Ben-Arie, Dibyendu Nandy, "A Volumetric/Iconic Frequency Domain Representation for Objects With Application for Pose Invariant Face Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 5, pp. 449-457, May 1998, doi:10.1109/34.682175