• Publication
  • 1998
  • Issue No. 5 - May
  • Abstract - A Volumetric/Iconic Frequency Domain Representation for Objects With Application for Pose Invariant Face Recognition
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Volumetric/Iconic Frequency Domain Representation for Objects With Application for Pose Invariant Face Recognition
May 1998 (vol. 20 no. 5)
pp. 449-457

Abstract—A novel method for representing 3D objects that unifies viewer and model centered object representations is presented. A unified 3D frequency-domain representation (called Volumetric Frequency Representation—VFR) encapsulates both the spatial structure of the object and a continuum of its views in the same data structure. The frequency-domain image of an object viewed from any direction can be directly extracted employing an extension of the Projection Slice Theorem, where each Fourier-transformed view is a planar slice of the volumetric frequency representation. The VFR is employed for pose-invariant recognition of complex objects, such as faces. The recognition and pose estimation is based on an efficient matching algorithm in a four-dimensional Fourier space. Experimental examples of pose estimation and recognition of faces in various poses are also presented.

[1] D.H. Ballard and C.M. Brown, Computer Vision, chap. 8. Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1982.
[2] A.H. Barr, "Superquadrics and Angle Preserving Transformations," IEEE Computer Graphics and Applications, vol. 1, pp. 11-23, 1981.
[3] J. Ben-Arie and Z. Wang, “Pictorial Recognition Using Affine-Invariant Spectral Signatures,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, pp. 34-39, San Juan, P.R., June 1997.
[4] J. Ben-Arie, Z. Wang, and K.R. Rao, "Affine Invariant Shape Representation and Recognition using Gaussian Kernels and Multi-dimensional Indexing," Proc. 1996 IEEE Int'l Conf. Speech, Acoustics, and Signal Processing (ICASSP '96), vol. 6, pp. 3,470-3,473,Atlanta, May 1996.
[5] J. Ben-Arie, Z. Wang, and K.R. Rao, "Iconic Recognition With Affine-Invariant Spectral Signatures," Proc. 1996 IAPR/IEEE Int'l Conf. Pattern Recognition (ICPR '96), vol. 1, pp. 672-676,Vienna, Austria, Aug. 1996.
[6] J. Ben-Arie, Z. Wang, and K.R. Rao, "Iconic Representation and Recognition Using Affine-Invariant Spectral Signatures," Proc. ARPA Image Understanding Workshop 1996, pp. 1,277-1,286,Palm Springs, Calif., Feb. 1996.
[7] D.J. Beymer, "Face Recognition Under Varying Pose," Technical Report A.I. Memo No. 1461, MIT Artificial Intelligence Lab., Dec. 1993.
[8] M. Brady, J. Ponce, A. Yuille, and H. Asada, "Describing Surfaces," Computer Vision, Graphics, and Image Processing, vol. 32, no. 1, pp. 1-28, Oct. 1985.
[9] R. Brunelli and T. Poggio, "Face Recognition: Features vs. Templates," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1,042-1,053, Oct. 1993.
[10] I. Craw, D. Tock, and A. Bennet, "Finding Face Features," Proc. European Conf. Computer Vision, pp. 92-96, 1992.
[11] D. DeMers and G.W. Cottrell, "Non-Linear Dimensionality Reduction," Advances in Neural Information Processing Systems 5, S.J. Hanson, J.D. Cowan, and C.L. Giles, eds., pp. 580-587.San Mateo, Calif.: Morgan Kaufmann Publishers, 1993.
[12] R.M. Haralick and L. Shapiro, "Object Models and Matching," Computer and Robotic Vision, vol. 2, chap. 18. Addison-Wesley Publishing Company, Inc., 1993.
[13] M.K. Hu, "Visual Pattern Recognition by Moment Invariants," A.K. Agarwal, R.O. Duda, and A. Rosenfeld, eds., Computer Methods in Image Analysis.Los Alam Calif.: IEEE CS Press, 1977.
[14] A.K. Jain, "Image Reconstruction From Projections," Fundamentals of Digital Image Processing, chap. 10, pp. 431-475.Englewood Cliffs, N.J.: Prentice-Hall Inc., N.J., 1989.
[15] R. Jain, R. Kasturi, and B.G. Schunk, "Object Recognition," Machine Vision, chap. 15. New York, N.Y.: McGraw-Hill, Inc., 1995.
[16] J.J. Koenderink and A.J. Van Doorn, "The Internal Representation of Solid Shape With Respect to Vision," Biological Cybernetics, vol. 32, pp. 211-216, 1979.
[17] M. Lades, J.C. Vorbruggen, J. Buhmann, J. Lange, C. von der Malsburg, R.P. Wurtz, and W. Konen, “Distortion Invariant Object Recognition in the Dynamic Link Architecture,” IEEE Trans. Computers, vol. 42, no. 3, pp. 300-311, Mar. 1993.
[18] S. Lawrence, C.L. Giles, A.C. Tsoi, and A.D. Back, “Face Recognition: A Convolutional Neural-Network Approach,” IEEE Trans. Neural Networks, vol. 8, pp. 98-113, 1997.
[19] C. Nastar and A.P. Pentland, "Matching and Recognition Using Deformable Intensity Surfaces," Proc. 1995 IEEE Int'l Symp. Computer Vision,Coral Gables, Fla., May 1995.
[20] Olivetti and Oracle Research Laboratory, "The Orl Database of Faces," http://www.cam-orl.co.ukfacedatabase.html , 1994.
[21] A.P. Pentland, "Perceptual Organization and the Representation of Natural Form," Artificial Intelligence, vol. 28, pp. 29-73, 1986.
[22] A. Pentland, B. Moghaddam, and Starner, "View-Based and Modular Eigenspaces for Face Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1994, pp. 84-91.
[23] D. Reisfeld and Y. Yeshurun, "Robust Detection of Facial Features by Generalized Symmetry," Proc. 1992 IEEE/IAPR Int'l Conf. Pattern Recognition, vol. 1, pp. 117-120,The Hague, The Netherlands, 1992.
[24] A. Rosenfeld and A.C. Kak, Digital Image Processing.New York, N.Y.: Academic Press, 1982.
[25] F. Samaria and A. Harter, "Parameterisation of a Stochastic Model for Human Face Identification," Proc. Second IEEE Workshop Applications of Computer Vision,Sarasota, Fla., Dec. 1994.
[26] S. Budkowski, “Estelle Development Toolset (EDT),” Computer Networks and ISDN Systems, vol. 25, no. 1, pp. 63–82, 1992.
[27] Z. Wang and J. Ben-Arie, "SVD and Log-Log Frequency Sampling With Gabor Kernels for Invariant Pictorial Recognition," Proc. 1997 IEEE Int'l Conf. Image Processing (ICIP '97), vol. III, pp. 162-165,Santa Barbara, Calif., Oct.26-29 1997.
[28] J. Weng, N. Ahuja, and T.S. Huang, "Learning Recognition and Segmentation of 3D Objects from 2D Images," Proc. 1993 IEEE Int'l Conf. Computer Vision (ICCV '93), pp. 121-128, 1993.
[29] A.L. Yuille, P.W. Halliman, and D.S. Cohen, "Feature Extraction From Faces Using Deformable Templates," Int'l J. Computer Vision, vol. 8, no. 2, pp. 99-111, 1992.
[30] C.T. Zahn and R.Z. Roskies, "Fourier Descriptors for Plane Closed Curves," IEEE Trans. Computers, vol. 21, pp. 269-281, 1972.

Index Terms:
Volumetric frequency representation (VFR), object representation, projection-slice theorem, 4D Fourier space, face pose estimation, pose invariant face recognition.
Citation:
Jezekiel Ben-Arie, Dibyendu Nandy, "A Volumetric/Iconic Frequency Domain Representation for Objects With Application for Pose Invariant Face Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 5, pp. 449-457, May 1998, doi:10.1109/34.682175
Usage of this product signifies your acceptance of the Terms of Use.