loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Pattern Recognition (ICPR'02) - Volume 2
Text Localization, Enhancement and Binarization in Multimedia Documents
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Christian Wolf, INSA de Lyon
Jean-Michel Jolion, INSA de Lyon
Françoise Chassaing, France Télécom R&D
The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. A new technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.
Citation:
Christian Wolf, Jean-Michel Jolion, Françoise Chassaing, "Text Localization, Enhancement and Binarization in Multimedia Documents," icpr, vol. 2, pp.21037, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 2, 2002
Usage of this product signifies your acceptance of the Terms of Use.