loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
18th International Conference on Pattern Recognition (ICPR'06) Volume 3
Automatic Lipreading with Limited Training Data
Hong Kong
August 20-August 24
ISBN: 0-7695-2521-0
S.L. Wang, Shanghai Jiaotong University, Shanghai, CHINA
W.H. Lau, City University of Hong Kong, Kowloon, HONG KONG
S.H. Leung, Chinese University of Hong Kong, Shatin, HONG KONG
Speech recognition solely based on visual information such as the lip shape and its movement is referred to as lipreading. This paper presents an automatic lipreading technique for speaker dependent (SD) and speaker independent (SI) speech recognition tasks. Since the visual features are derived according to the frame rate of the video sequence, spline representation is then employed to translate the discrete-time sampled visual features into continuous domain. The spline coefficients in the same word class are constrained to have similar expression and can be estimated from the training data by the EM algorithm. In addition, an adaptive multi-model approach is proposed to overcome the variation caused by different speaking style in speaker-independent recognition task. The experiments are carried out to recognize the ten English digits and an accuracy of 96% for speaker dependent recognition and 88% for speaker independent recognition have been achieved, which shows the superiority of our approach compared with other classifiers investigated.
Citation:
S.L. Wang, W.H. Lau, S.H. Leung, "Automatic Lipreading with Limited Training Data," icpr, vol. 3, pp.881-884, 18th International Conference on Pattern Recognition (ICPR'06) Volume 3, 2006
Usage of this product signifies your acceptance of the Terms of Use.