loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
17th International Conference on Pattern Recognition (ICPR'04) - Volume 2
Adaptive Word Style Classification Using a Gaussian Mixture Model
Cambridge UK
August 23-August 26
ISBN: 0-7695-2128-2
Huanfeng Ma, University of Maryland, College Park
David Doermann, University of Maryland, College Park
In this paper, we present a new approach to detect bold and italic words in scanned documents. Under the assumption that OCR results are available, features used for classification are selected automatically using feature selection. For each scanned page, a Gaussian Mixture Model is constructed for characters with the same character code, and word styles are determined using a weighted majority vote. We applied this method to a variety of documents and compared the results with current commercial OCR software that provides style information. The experimental results show that our method performs better.
Citation:
Huanfeng Ma, David Doermann, "Adaptive Word Style Classification Using a Gaussian Mixture Model," icpr, vol. 2, pp.606-609, 17th International Conference on Pattern Recognition (ICPR'04) - Volume 2, 2004
Usage of this product signifies your acceptance of the Terms of Use.