This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
3-D Motion Estimation in Model-Based Facial Image Coding
June 1993 (vol. 15 no. 6)
pp. 545-555

An approach to estimating the motion of the head and facial expressions in model-based facial image coding is presented. An affine nonrigid motion model is set up. The specific knowledge about facial shape and facial expression is formulated in this model in the form of parameters. A direct method of estimating the two-view motion parameters that is based on the affine method is discussed. Based on the reasonable assumption that the 3-D motion of the face is almost smooth in the time domain, several approaches to predicting the motion of the next frame are proposed. Using a 3-D model, the approach is characterized by a feedback loop connecting computer vision and computer graphics. Embedding the synthesis techniques into the analysis phase greatly improves the performance of motion estimation. Simulations with long image sequences of real-world scenes indicate that the method not only greatly reduces computational complexity but also substantially improves estimation accuracy.

[1] B. L. Yen and T. S. Huang, "Determining 3-D motion and structure of a rigid body using straight line correspondences,"Image Sequence Processing and Dynamic Scene Analysis. Heidelberg, Germany: Springer-Verlag, 1983.
[2] B. K. P. Horn,Robot Vision. Cambridge, MA: M.I.T. Press, 1986.
[3] A. Sommerfeld,Mechanics of Deformable Bodies, 1950.
[4] W.M. Newman and R.F. Sproull,Principles of Interactive Computer Graphics, 2nd Ed., McGraw Hill, Amsterdam, 1979.
[5] N. Thalmann and D. Thalmann,Image Synthesis, Theory and Practice, Springer-Verlag, New York, 1987, pp. 156-169.
[6] J. K. Aggarwal and N. Nandhakumar, "On the computation of motion from sequences of images--A review,"Proc. IEEE, vol. 76, no. 8, pp. 917-935, 1988.
[7] R. Tsai and T. Huang, "Estimating 3-D motion parameters of a rigid planar patch, i,"IEEE Trans. Acous. Speech Signal Processing, vol. ASSP-29, pp. 1147-1152, Dec. 1981.
[8] I.K. Sethi and R. Jain, "Finding trajectories of feature points in a monocular image sequence,"IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI-9, pp. 56-73, Jan. 1987.
[9] G. Adiv, "Inherent ambiguities in recovering 3D motion and structure from a noisy flow field,"IEEE Trans. Patt. Anal. Machine Intell., vol. 11, no. 5, pp. 477-489, May 1989.
[10] T. Broida and R. Chellappa, "Estimation of object motion parameters from noisy images,"IEEE Trans. Pattern Anal. Machine Intell, vol. PAMI-8, no. 1, Jan. 1986.
[11] B. K. P. Horn and J. Weldon, "Robust algorithms for direct motion perception," inProc. 1st ICCV, 1987.
[12] R. Forchheimer and O. Fahlander, "Low bit-rate coding through animation," inProc. Picture Coding Symp. (PCS-83)(Davis), Mar. 1983, pp. 113-114.
[13] R. Forchheimer, "The motion estimation problem in semantic image coding," inProc. Picture Coding Symp. (PCS-87)(Stockholm), June 1987, pp. 171-172.
[14] R. Forchheimer and T. Kronander, "Image Coding--From Waveforms to Animation,"IEEE Trans. Acoustics, Speech and Image Processing, Vol. 37, No. 12, Dec. 1989, pp. 2008-2023.
[15] P. Roivainen, "Motion estimation in model-based coding of human faces," Licentiate Thesis LIU-TEK-LIC-1990:25, ISY, Linköping Univ., Sweden, 1990.
[16] A. N. Netravali and J. Salz, "Algorithms for estimation of three-dimensional motion,"AT&T Techn. J., vol. 64, no. 2, Feb. 1985.
[17] B. Welsh, "Model-based coding of images," Ph.D. dissertation, British Telecom Res. Lab., Jan. 1991.
[18] K. Aizawaet al., "Model-based synthesis image coding system--Modeling a person's face and synthesis of facial expressions," inProc. GLOBECOM-87, Nov. 1987, pp. 45-49.
[19] H. G. Musmann, M. Hotter, and J. Ostermann, "Object-oriented analysis-synthesis coding of moving images,"Image Commun., vol. 1, no. 2, pp. 117-138, Oct. 1989.
[20] J. Yau and N. Duffy, "A texture mapping approach to 3D facial image synthesis,"Comput. Graphics Forum, no. 7, pp. 129-134, 1988.
[21] M. Rydfalk, "CANDIDE: A Parameterized face," Dep. Elec. Eng. Rep. LiTH-ISY-I-0866, Linköping Univ., Oct. 1987.
[22] C.-H. Hjortsjö, "Human face and the mimical language,"Studentlitteratur, Sweden, 1969.
[23] P. Ekman and W. Friesen,Facial Action Coding System. Palo Alto, CA: Consulting Psychologists, 1977.
[24] M. Kaneko, A. Koike, and Y. Hatori, "Coding of facial image sequence based on a 3D model of the head and motion detection,"J. Visual Commun. Image Represent., vol. 2, no. 1, pp. 39-54, Mar. 1991.
[25] D. Terzopoulos and K. Waters, "Analysis of dynamic facial images using physical and anatomical models," inProc. Third Int. Conf. Comput. Vision(Osaka, Japan), 1990, pp. 306-331.
[26] D. E. Pearson, "Texture-mapping in model-based image coding,"Image Commun., vol. 2, no. 4, pp. 377-395, Dec. 1990.
[27] T. S. Huang, "Modeling, analysis and visualization of nonrigid object motion," inInt. Conf. Patt. Recognition, June 1990.
[28] D. Terzopoulos, A. Witkin, and M. Kass, "Constraints on deformable models: Recovering 3D shape and nonrigid motion,"Artificial Intell., vol. 36, pp. 91-123, 1988.
[29] S. Chen and M. Penna, "Shape and motion of nonrigid bodies,"Comput. Vision Graphics Image Processing, vol. 36, pp. 175-207, 1986.
[30] D. Shulman and J. Y. Aloimonos, "Nonrigid motion interpretation: A regularized approach,"Proc. Roy. Soc. London, vol. B233, pp. 217-234, 1988.
[31] A. Pentland and B. Horowitz, "Recovery of nonrigid motion and structure,"IEEE Trans. Patt. Anal. Machine Intell., vol. 13, no. 7, pp. 730-742, July 1991.
[32] H. Li, P. Roivainen, and R. Forchheimer, "3D motion estimation in model-based facial image coding," Dept. Elec. Eng. Rep. LiTH-ISY-I- 1278, Linköping Univ., Oct. 1991.
[33] F. Kappei and C.-E. Liedtke, "Modelling of a natural 3D scene consisting of moving objects from a sequence of monocular TV images,"Proc. SPIE, vol. 860, pp. 126, 1987.
[34] H. Busch, "Subdividing nonrigid 3D objects into quasi rigid parts," inProc. IEE 3rd Int. Conf. Image Processing Applications(Warwick, UK), 1989.

Index Terms:
parameter estimation; 3-D motion estimation; model-based facial image coding; affine nonrigid motion model; facial shape; facial expression; two-view motion parameters; computer vision; computer graphics; synthesis techniques; computational complexity; estimation accuracy; image coding; motion estimation; parameter estimation
Citation:
H. Li, P. Roivainen, R. Forcheimer, "3-D Motion Estimation in Model-Based Facial Image Coding," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 6, pp. 545-555, June 1993, doi:10.1109/34.216724
Usage of this product signifies your acceptance of the Terms of Use.