Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
A Real-Time Framework for Natural Multimodal Interaction with Large Screen Displays
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
This paper presents a framework for designing a natural multimodal human computer interaction (HCI) system. The core of the proposed framework is a principled method for combining information derived from audio and visual cues. To achieve natural interaction, both audio and visual modalities are fused along with feedback through a large screen display. Careful design along with due considerations of possible aspects of a systems interaction cycle and integration has resulted in a successful system. The performance of the proposed framework has been validated through the development of several prototype systems as well as commercial applications for the retail and entertainment industry. To assess the impact of these multimodal systems (MMS), informal studies have been conducted. It was found that the system performed according to its specifications in 95% of the cases and that users showed ad-hoc proficiency, indicating natural acceptance of such systems.
Index Terms:
Continuous Gesture Recognition, Multimodal HCI, Speech-Gesture Co-Analysis, Visual Tracking, Real-Time System
Citation:
N. Krahnstoever, S. Kettebekov, M. Yeasin, R. Sharma, "A Real-Time Framework for Natural Multimodal Interaction with Large Screen Displays," icmi, pp.349, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002