loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02)
A Video Based Interface to Textual Information for the Visually Impaired
Pittsburgh, Pennsylvania
October 14-October 16
ISBN: 0-7695-1834-6
Ali Zandifar, University of Maryland at College Park
Ramani Duraiswami, University of Maryland at College Park
Antoine Chahine, University of Maryland at College Park
Larry S. Davis, University of Maryland at College Park
We describe the development of an interface to textual information for the visually impaired that uses video, image processing, optical-character-recognition (OCR) and text-to-speech (TTS). The video provides a sequence of low resolution images in which text must be detected, rectified and converted into high resolution rectangular blocks that are capable of being analyzed via off-the-shelf OCR. To achieve this, various problems related to feature detection, mosaicing, auto-focus, zoom, and systems integration were solved in the development of the system, and these are described.
Citation:
Ali Zandifar, Ramani Duraiswami, Antoine Chahine, Larry S. Davis, "A Video Based Interface to Textual Information for the Visually Impaired," icmi, pp.325, Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.