Third International Conference on Document Analysis and Recognition (ICDAR'95) - Volume 1 Visual inter-word relations and their use in OCR postprocessing Montr?al, Canada August 14-August 15 ISBN: 0-8186-7128-9
A technique is presented that uses visual relationships between word images in a document to improve the recognition of the text it contains. This technique takes advantage of the visual relationships between word images that are usually lost in most conventional optical character recognition (OCR) techniques. The visual relations are defined to be the equivalence that exists between images of the same word or portions of word images. An algorithm is presented that calculates these relationships in a document. The resulting clusters are integrated with the recognition results provided by an OCR system. Inconsistencies in OCR results between equivalent images are identified and used to improve recognition performance. Experimental results are presented in which the input is provided directly from a commercial OCR system.
Index Terms:
optical character recognition; document image processing; inter-word relations; OCR postprocessing; character recognition; recognition performance; equivalent images; word images; document
Citation:
T. Hong, J.J. Hull, "Visual inter-word relations and their use in OCR postprocessing," icdar, vol. 1, pp.442, Third International Conference on Document Analysis and Recognition (ICDAR'95) - Volume 1, 1995 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||