Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
Evaluating Integrated Speech- and Image Understanding
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
The capability to coordinate and interrelate speech and vision is a virtual prerequisite for adaptive, cooperative, and flexible interaction among people. It is therefore to assume that human-machine interaction, too, would benefit from intelligent interfaces for integrated speech and image processing. In this paper, we first sketch an interactive system that integrates automatic speech processing with image understanding. Then, we concentrate on performance assessment which we believe is an emerging key issue in multimodal interaction. We explain the benefit of time scale analysis and usability studies and evaluate our system accordingly.
Citation:
C. Bauckhage, J. Fritsch, K. J. Rohlfing, S. Wachsmuth, G. Sagerer, "Evaluating Integrated Speech- and Image Understanding," icmi, pp.9, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002