This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
An HMM-Based Threshold Model Approach for Gesture Recognition
October 1999 (vol. 21 no. 10)
pp. 961-973

Abstract—The task of automatic gesture recognition is highly challenging due to the presence of unpredictable and ambiguous nongesture hand motions. In this paper, a new method is developed using the Hidden Markov Model based technique. To handle nongesture patterns, we introduce the concept of a threshold model that calculates the likelihood threshold of an input pattern and provides a confirmation mechanism for the provisionally matched gesture patterns. The threshold model is a weak model for all trained gestures in the sense that its likelihood is smaller than that of the dedicated gesture model for a given gesture. Consequently, the likelihood can be used as an adaptive threshold for selecting proper gesture model. It has, however, a large number of states and needs to be reduced because the threshold model is constructed by collecting the states of all gesture models in the system. To overcome this problem, the states with similar probability distributions are merged, utilizing the relative entropy measure. Experimental results show that the proposed method can successfully extract trained gestures from continuous hand motion with 93.14 percent reliability.

[1] C. Maggioni, “A Novel Gestural Input Device for Virtual Reality,” Proc. IEEE Virtual Reality Ann. Int'l Symp., pp. 118-124, Seattle, Wash., 1993.
[2] K. Vaananen and K. Boehm, “Gesture Driven Interaction as a Human Factor in Virtual Environments—An Approach with Neural Networks,” Virtual Reality Systems, R. Earnshaw, M. Gigante, H. Jones, eds., chapter 7, pp. 93-106. Academic Press, 1993.
[3] J. Davis and M. Shah, “Visual Gesture Recognition,” IEE Proc. Visualization and Image Signal Processing, vol. 141, no. 2, pp. 101-106, Apr. 1994.
[4] C. Cedras and M. Shah, “Motion Based Recognition: A Survey,” Image and Vision Computing, vol. 13, no. 2, pp. 129-155, Mar. 1995.
[5] M. Shah and R. Jain, Motion-Based Recognition. Kluwer Academic, 1997.
[6] F. Quek, “Toward a Vision-Based Hand Gesture Interface,” Proc. Virtual Reality Software and Technology Conf., pp. 17-31, Singapore, 1994.
[7] R. Kjeldsen and J. Kender, “Visual Hand Gesture Recognition for Window System Control,” Proc. Int'l Workshop Automatic Face- and Gesture-Recognition, pp. 184-188, Zurich, Switzerland, 1995.
[8] T. Starner and A. Pentland, “Real-Time American Sign Language Recognition from Video Using Hidden Markov Models,” Technical Report TR-375, MIT's Media Lab., 1995.
[9] R.C. Rose, “Discriminant Wordspotting Techniques for Rejection Non-Vocabulary Utterances in Unconstrained Speech,” Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing, vol. II, pp. 105-108, San Francisco, 1992.
[10] K. Takahashi, S. Seki, and R. Oka, “Spotting Recognition of Human Gestures from Motion Images,” Technical Report IE92-134, The Inst. of Electronics, Information, and Comm. Engineers, Japan, pp. 9-16, 1992 (in Japanese).
[11] T. Baudel and M. Beaudouin-Lafon, “CHARADE: Remote Control of Objects Using Free-Hand Gestures,” Comm. ACM, vol. 36, no. 7, pp. 28-35, 1993.
[12] X.D. Huang, Y. Ariki, and M.A. Jack, Hidden Markov Models for Speech Recognition. Edinburgh: Edinburgh Univ. Press, 1990.
[13] L.D. Wilcox and M.A. Bush, “Training and Search Algorithms for an Interactive Wordspotting System,” Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, vol. II, pp. 97-100, San Francisco, 1992.
[14] T.M. Cover and J.A. Thomas, Elements of Information Theory. John Wiley&Sons, 1991.
[15] L.R. Rabiner, “Tutorial on Hidden Markov Model and Selected Applications in Speech Recognition,” Proc. IEEE, vol. 77, no. 2, pp. 257-285, 1989.
[16] S.H. Lee and J.H. Kim, “Augmenting the Discrimination Power of HMM by NN for On-Line Cursive Script Recognition,” Applied Intelligence, vol. 7, no. 4, pp. 304-314, 1997.
[17] B.K. Sin and J.H. Kim, “Ligature Modeling for Online Cursive Script Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 6, pp. 623-633, June 1997.
[18] J. Yamato, H. Ohya, and K. Ishii, “Recognizing Human Action in Time-Sequential Images Using Hidden Markov Model,” Proc. 1992 IEEE Conf. Computer Vision and Pattern Recognition, pp. 379-385, 1992.
[19] A.J. Viterbi, “Error Bounds for Convolution Codes and an Asymptotically Optimum Decoding Algorithm,” IEEE Trans. Information Theory, vol. 13, pp. 260-269, 1967.
[20] S. Kullback, Information Theory and Statistics. New York: Dover Publications, 1968.
[21] M. Hwang, X. Huang, and F.A. Alleva, “Predicting Unseen Triphones with Senones,” IEEE Trans. Speech and Audio Processing, vol. 4, no. 6, pp. 412-419, 1996.
[22] I.J. Ko and H.I. Choi, “Hand Region Detection by Region Extraction and Tracking,” Proc. Korea Information Science Soc. Spring Conf., pp. 239-242, Taegu, Korea, 1996 (in Korean).

Index Terms:
Hand gesture, gesture spotting, Hidden Markov Model, segmentation, pattern recognition, relative entropy, state reduction, threshold model.
Citation:
Hyeon-Kyu Lee, Jin H. Kim, "An HMM-Based Threshold Model Approach for Gesture Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 21, no. 10, pp. 961-973, Oct. 1999, doi:10.1109/34.799904
Usage of this product signifies your acceptance of the Terms of Use.