loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
Prosody Based Co-analysis for Continuous Recognition of Coverbal Gestures
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
Sanshzar Kettebekov, Pennsylvania State University
Mohammed Yeasin, Pennsylvania State University
Rajeev Sharma, Pennsylvania State University
Although recognition of natural speech and gestures have been studied extensively, previous attempts of combining them in a unified framework to boost classification were mostly semantically motivated, e.g., keyword-gesture co-occurrence. Such formulations inherit the complexity of natural language processing. This paper presents a Bayesian formulation that uses a phenomenon of gesture and speech articulation for improving accuracy of automatic recognition of continuous coverbal gestures. The prosodic features from the speech signal were co-analyzed with the visual signal to learn the prior probability of co-occurrence of the prominent spoken segments with the particular kinematical phases of gestures. It was found that the above co-analysis helps in detecting and disambiguating small hand movements, which subsequently improves the rate of continuous gesture recognition. The efficacy of the proposed approach was demonstrated on a large database collected from the weather channel broadcast. This formulation opens new avenues for bottom-up frameworks of multimodal integration.
Index Terms:
Multimodal fusion, gesture recognition, gesture speech co-occurrence, prominence, prosody
Citation:
Sanshzar Kettebekov, Mohammed Yeasin, Rajeev Sharma, "Prosody Based Co-analysis for Continuous Recognition of Coverbal Gestures," icmi, pp.161, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.