|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng, "Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 4, pp. 495-508, April, 2004. | |||
| BibTex | x | ||
| @article{ 10.1109/TPAMI.2004.1265864, author = {Chung-Hsien Wu and Yu-Hsien Chiu and Kung-Wei Cheng}, title = {Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {26}, number = {4}, issn = {0162-8828}, year = {2004}, pages = {495-508}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.2004.1265864}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Pattern Analysis and Machine Intelligence TI - Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation IS - 4 SN - 0162-8828 SP495 EP508 EPD - 495-508 A1 - Chung-Hsien Wu, A1 - Yu-Hsien Chiu, A1 - Kung-Wei Cheng, PY - 2004 KW - Taiwanese Sign Language KW - alternative and augmentative communication KW - error tolerant retrieval KW - gesture feature. VL - 26 JA - IEEE Transactions on Pattern Analysis and Machine Intelligence ER - | |||
Abstract—This paper proposes an efficient error-tolerant approach to retrieving sign words from a Taiwanese Sign Language (TSL) database. This database is tagged with visual gesture features and organized as a multilist code tree. These features are defined in terms of the visual characteristics of sign gestures by which they are indexed for sign retrieval and displayed using an anthropomorphic interface. The maximum a posteriori estimation is exploited to retrieve the most likely sign word given the input feature sequence. An error-tolerant mechanism based on mutual information criterion is proposed to retrieve a sign word of interest efficiently and robustly. A user-friendly anthropomorphic interface is also developed to assist learning TSL. Several experiments were performed in an educational environment to investigate the system's retrieval accuracy. Our proposed approach outperformed a dynamic programming algorithm in its task and shows tolerance to user input errors.
[1] L.L. Lloyd, D.R. Fuller, and H.H. Arvidson, Augmentative and Alternative Communication: A Handbook of Principles and Practices. Allyn and Bacon, Inc., 1997.
[2] P. Demasco and K.F. McCoy, Generating Text from Compressed Input: An Intelligent Interface for People with Server Motor Impairments Comm. ACM, vol. 35, no. 5, pp. 68-78, 1992.
[3] A.L. Swiffin, J.L. Arnott, and A.F. Newell, Adaptive and Predictive Techniques in a Communication Prosthesis Augmentative and Alternative Comm., vol. 3, no. 4, pp. 181-191, 1987.
[4] A.M. Cook and S.M. Hussey, Assistive Technologies: Principles and Practice. St. Louis, MO: Mosby-Year Book, 1995.
[5] S.K. Chang, G. Costagliola, S. Orefice, et al., A Methodology for Iconic Language Design with Application to Augmentative Communication Proc. 1992 IEEE Workshop Visual Language, pp. 110-116, 1992.
[6] J.G. Webster, A.M. Cook, W.J. Tompkins, and G.C. Vanderheiden, Electronic Devices for Rehabilitation. John Wiley&Sons, 1985.
[7] F. Alonso, A. de Antonio, J.L. Fuertes, et al., Teaching Communication Skills to Hearing-Impaired Children IEEE Multimedia, pp. 55-67, 1995.
[8] R.C. Simpson, and H.H. Koester, Adaptive One-Switch Row-Column Scanning IEEE Trans. Rehabilitation Eng., vol. 7, no. 4, pp. 464-473, 1999.
[9] J. Darragh and I. Witten, The Reactive Keyboard. Cambridge Univ. Press, 1992.
[10] R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. ACM Press, 1999.
[11] S. Berretti, A. Del Bimbo, and P. Pala, “Retrieval by Shape Similarity with Perceptual Distance and Effective Indexing,” IEEE Trans. Multimedia, vol. 2, no. 4, pp. 225-239, 2000.
[12] W.H. Smith, Morphological Characteristics of Verbs in Taiwan Sign Language PhD dissertation, Indiana Univ., 1989.
[13] M.-C. Su, Y.-X. Zhao, H. Huang, and H.-F. Chen, A Fuzzy Rule-Based Approach to Recognizing 3-D Arm Movements IEEE Trans. Neural Systems and Rehabilitation Eng., vol. 9, no. 2, 2001.
[14] R. Liang, Continuous Gesture Recognition System for Taiwanese Sign Language PhD dissertation, Nat'l Taiwan Univ., Taiwan, 1997.
[15] C. Valli and C. Lucas, Linguistics of American Sign Language: An Introduction. Gallaudet Univ. Press, 1995.
[16] S.K. Liddell and R.E. Johnson, American Sign Language: The Phonological Base Sign Language Studies, vol. 64, pp. 195-277, 1989.
[17] C. Vogler and D. Metaxas, Toward Scalability in ASL Recognition: Breaking Down Signs into Phonemes Lecture Notes in Artificial Intelligence, vol. 1739, pp. 211-224, 1999.
[18] C. Vogler and D. Metaxas, A Framework for Recognizing the Simultaneous Aspects of American Sign Language Computer Vision and Image Understanding, no. 81, pp. 358-384, 2001.
[19] T. Starner, J. Weaver, and A. Pentland, "Real-Time American Sign Language Recognition Using Desk and Wearable Computer-Based Video," Pattern Analysis and Machine Vision, Dec. 1998, pp. 1371-1375.
[20] W. Sandler, Phonological Features and Feature Classes: The Case of Movements in Sign Language Lingua 98, pp. 197-220, 1996.
[21] W.H. Smith and L.-F. Ting, Shou Neng Sheng Chyau (Your Hands Can Become a Bridge), vols. 1 and 2, Taipei, R.O.C.: Deaf Sign Language Research Assoc., 1997.
[22] Ministry of Education, Division of Special Education, Changyong Cihui Shouyu Huace (Sign Album of Common Words), vol. 1, Taipei: Ministry of Education, 2000.
[23] G.H. Gonnet, Handbook of Algorithms and Data Structures. Addison-Wesley Publishing Company, 1984.
[24] D.J. Hand, Discrimination and Classification, pp. 100-101, John Wiley&Sons, 1989.
[25] S.M. Ross, Introduction to Probability Models. Academic Press, Inc., 1993.
[26] M.H. DeGroot, Optimal Statistical Decisions. McGraw-Hill Publishing, 1970.
[27] Z.G. Dong and L.K. Teng, Interpolation of n-Gram and Mutual-Information Based Trigger Pair Language Models for Mandarin Speech Recognition Computer Speech and Language, vol. 13, pp. 125-141, 1999.
[28] T.H. Cormen, C.E. Leiserson, and R.L. Rivest, Introduction to Algorithms. The MIT Press, 1994.
[29] K. Oflazer, “Error-Tolerant Retrieval of Trees,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 12, Dec. 1997.
[30] L. Rabiner and B. Juang, Fundamentals of Speech Recognition. Prentice Hall, 1993.
[31] I.S. MacKenzie and W. Buxton, A Tool for Rapid Evaluation of Input Devices Using Fitt's Law Models SIGCHI Bull., vol. 25, no. 3, pp. 58-63, 1993.
[32] G.K. Kanji, 100 Statistical Tests. SAGE Publications Ltd., 1999.
[33] S. Theodoridis and K. Koutroumbas, Pattern Recognition. Academic Press, 1999.
[34] J. Ann, A Linguistic Investigation of the Relationship between Physiology and Handshape PhD dissertation, Univ. of Arizona, 1993.
[35] M. Mohri, Finite-State Transducers in Language and Speech Processing Assoc. for Computational Linguistics, vol. 23, pp. 1-42, 1997.

