The Community for Technology Leaders
IEEE International Conference on Multimedia and Expo, 2001. ICME 2001. (2001)
Tokyo, Japan
Aug. 22, 2001 to Aug. 25, 2001
ISBN: 0-7695-1198-8
pp: 233
I. Haritaoglu , IBM Almaden Research, San Jose, CA 95120, USA
A. Cozzi , IBM Almaden Research, San Jose, CA 95120, USA
D. Koons , IBM Almaden Research, San Jose, CA 95120, USA
M. Flickner1 , IBM Almaden Research, San Jose, CA 95120, USA
D. Zotkin , Computer Vision Laboratory University of Maryland, College Park, MD 20742, USA
R. Duraiswami , Computer Vision Laboratory University of Maryland, College Park, MD 20742, USA
Y. Yacoob , Computer Vision Laboratory University of Maryland, College Park, MD 20742, USA
ABSTRACT
We describe a multimodal attentive environment system that performs joint audio-visual information processing to enable it to interact intelligently with people. It integrates real-time video and audio processing techniques to detect and track multiple persons in the scene. Speech recognition and eye contact are used to develop a natural human-like communication interface with participants. We have implemented the system as a visually interactive toy robot (VTOYS) and demonstrated it successfully to many people belonging to different age classes. This allows us to explore novel ways of human-machine interactions and novel interfaces-specifically, the new possibilities of the human- machine interaction for the case of the machine having a limited environment perception ability.
INDEX TERMS
null
CITATION

A. Cozzi et al., "ATTENTIVE TOYS," IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.(ICME), Tokyo, Japan, 2001, pp. 233.
doi:10.1109/ICME.2001.1237872
87 ms
(Ver 3.3 (11022016))