loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth International Conference Document Analysis and Recognition (ICDAR'97)
Adaptive Document Binarization
Ulm, GERMANY
August 18-August 20
ISBN: 0-8186-7898-4
Jaakko Sauvola, Machine Vision and Media Processing Group Infotech Oulu, University of Oulu
Tapio Seppänen, Machine Vision and Media Processing Group Infotech Oulu, University of Oulu
Sami Haapakoski, Machine Vision and Media Processing Group Infotech Oulu, University of Oulu
Matti Pietikäinen, Machine Vision and Media Processing Group Infotech Oulu, University of Oulu
A new method is presented for adaptive document image binarization, where the page is considered as a collection of subcomponents such as text, background and picture. The problems caused by noise, illumination and many source type related degradations are addressed. The algo= rithm uses document characteristics to determine (surface) attributes, often used in document segmentation. Using characteristics analysis, two new algorithms are applied to determine a local threshold for each pixel. An algorithm based on soft decision control is used for thresholding background and picture regions. An approach utilizing local mean and variance of gray values is applied to textual regions. Tests were performed with images including different types of document components and degradations. The results show that the method adapts and performs well in each case.
Index Terms:
Adaptive binarization, soft decision, histogram, document segmentation, document analysis, document understanding.
Citation:
Jaakko Sauvola, Tapio Seppänen, Sami Haapakoski, Matti Pietikäinen, "Adaptive Document Binarization," icdar, pp.147, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997
Usage of this product signifies your acceptance of the Terms of Use.