loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1
Training on Severely Degraded Text-Line Images
Edinburgh, Scotland
August 03-August 06
ISBN: 0-7695-1960-1
Prateek Sarkar, Palo Alto Research Center
Henry S. Baird, Palo Alto Research Center
Xiaohu Zhang, Stanford University
We show that document image decoding (DID) supervised training algorithms, as a result of recent refinements, achieve high accuracy with low manual effort even under conditions of severe image degradation in both training and test data. We describe improvements in DID training of character template, set-width, and channel (noise) models. Large-scale experimental trials, using synthetically degraded images of text, have established two new and practically important advantages of DID algorithms: 1. high accuracy ( > 99% chraracters correct) in decoding using models trained on even severely degraded images from the same distribution; and 2. greatly improved accuracy ( < 1/10 the error rate) across a wide range of image degradations compared to untrained (idealized) models.
Citation:
Prateek Sarkar, Henry S. Baird, Xiaohu Zhang, "Training on Severely Degraded Text-Line Images," icdar, vol. 1, pp.38, Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1, 2003
Usage of this product signifies your acceptance of the Terms of Use.