loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth International Conference Document Analysis and Recognition (ICDAR'97)
Omnifont and Unlimited-Vocabulary OCR for English and Arabic
Ulm, GERMANY
August 18-August 20
ISBN: 0-8186-7898-4
Issam Bazzi, BBN Corporati
Chris LaPre, BBN Corporati
John Makhoul, BBN Corporati
Chris Raphael, BBN Corporati
Richard Schwartz, BBN Corporati
on We present a set of techniques for omnifont, unlimited-vocabulary OCR, within the context of a system based on Hidden Markov Models (HMM). First, we address the issue of how to perform OCR on omnifont and multi-style data, such as plain and italic, without the need to have a separate model for each style. The amount of training data from each style, which is used to train a single model, becomes an important issue in the face of the conditional independence assumption inherent in the use of HMMs. We demonstrate mathematically and empirically how to allocate training data among the different styles to alleviate this problem. Second, we show how to use a word-based HMM system to perform character recognition with unlimited vocabulary. The method includes the use of a trigram language model on character sequences. Using all these techniques, we have achieved character error rates of 1.1% on data from the University of Washington English Document Image Database and 3.3% on data from the DARPA Arabic OCR Corpus.
Index Terms:
character recognition, OCR, speech recognition, Hidden Markov Models.
Citation:
Issam Bazzi, Chris LaPre, John Makhoul, Chris Raphael, Richard Schwartz, "Omnifont and Unlimited-Vocabulary OCR for English and Arabic," icdar, pp.842, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997
Usage of this product signifies your acceptance of the Terms of Use.