loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1
Exploiting Fisher Kernels in Decoding Severely Noisy Document Images
Curitiba, Parana, Brazil
September 23-September 26
ISBN: 0-7695-2822-8
J. Chen, Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, CA 94304-1314, USA
Y. Wang, Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, CA 94304-1314, USA
Decoding noisy document images is commonly needed in applications such as enterprise content management. Available OCR solutions are still not satisfactory especially on noisy images, and re-trainable systems require difficult and tedious training example preparation. Motivated by this challenging real application, we propose a novel so- lution that organically combines generative template mod- els with discriminative classifiers via RBF Fisher kernel de- rived from a generative model. We show that the new ap- proach is highly accurate in decoding noisy document im- ages, making the system more generalizable to variations in font and degradation, and hence significantly reduces the burden in training example preparation. We also show that as it weights the pixel features by their relevancies, RBF Fisher kernel is more robust, and leads to smaller, faster models by dimensionality reduction.
Citation:
J. Chen, Y. Wang, "Exploiting Fisher Kernels in Decoding Severely Noisy Document Images," icdar, vol. 1, pp.417-421, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 1, 2007
Usage of this product signifies your acceptance of the Terms of Use.