This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
IEEE International Workshop on Analysis and Modeling of Faces and Gestures
Boosted Audio-Visual HMM for Speech Reading
Nice, France
October 17-October 17
ISBN: 0-7695-2010-3
Pei Yin, Georgia Institute of Technology
Irfan Essa, Georgia Institute of Technology
James M. Rehg, Georgia Institute of Technology
We propose a new approach for combining acoustic and visual measurements to aid in recognizing lip shapes of a person speaking. Our method relies on computing the maximum likelihoods of (a) HMM used to model phonemes from the acoustic signal, and (b) HMM used to model visual features motions from video. One significant addition in this work is the dynamic analysis with features selected by Ad-aBoost, on the basis of their discriminant ability. This form of integration, leading to boosted HMM, permits AdaBoost to find the best features first, and then uses HMM to exploit dynamic information inherent in the signal.
Citation:
Pei Yin, Irfan Essa, James M. Rehg, "Boosted Audio-Visual HMM for Speech Reading," amfg, pp.68, IEEE International Workshop on Analysis and Modeling of Faces and Gestures, 2003
Usage of this product signifies your acceptance of the Terms of Use.