Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
A Video Based Interface to Textual Information for the Visually Impaired
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
We describe the development of an interface to textual information for the visually impaired that uses video, image processing, optical-character-recognition (OCR) and text-to-speech (TTS). The video provides a sequence of low resolution images in which text must be detected, rectified and converted into high resolution rectangular blocks that are capable of being analyzed via off-the-shelf OCR. To achieve this, various problems related to feature detection, mosaicing, auto-focus, zoom, and systems integration were solved in the development of the system, and these are described.
Citation:
Ali Zandifar, Ramani Duraiswami, Antoine Chahine, Larry S. Davis, "A Video Based Interface to Textual Information for the Visually Impaired," icmi, pp.325, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002