Fourth International Conference Document Analysis and Recognition (ICDAR'97) Confidence computation improvement in an optical field reading system Ulm, GERMANY August 18-August 20 ISBN: 0-8186-7898-4
An expression in closed form is derived for the recognition error vs. rejection rate of optical character or word recognition systems. This expression allows to define a lower bound for the error rate of any recognition system employing a rejection process based on the definition of a confidence threshold. This relation has also proved to be useful to make a quantitative comparison between two confidence computation methods implemented in a system for reading USA Census '90 hand-written forms. The newly proposed method is based upon a confidence model integrating single-character confidence levels, digram statistics and other information from the dictionary matching phase. At a 50% rejection rate, the field error rate calculated using the new confidence computation algorithm decreased from 47.7% to 44.6%, which represents a considerable improvement, given a theoretical lower bound of 40.8% on the error rate.
Index Terms:
optical character recognition; confidence computation algorithm; optical field reading system; closed form expression; recognition error; rejection rate; optical character recognition systems; word recognition systems; error rate lower bound; confidence threshold; US Census hand-written forms; single-character confidence levels; digram statistics; dictionary matching phase; field error rate
Citation:
A. Benedetti, Z.M. Kovacs-V., "Confidence computation improvement in an optical field reading system," icdar, pp.836, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||