Fourth International Conference Document Analysis and Recognition (ICDAR'97) Matching form lines based on a heuristic search Ulm, GERMANY August 18-August 20 ISBN: 0-8186-7898-4
A major problem in form reading applications is that form fields cannot be located exactly because of nonlinear distortions on the form images. Such nonlinear distortions appear for example on photocopied forms or on forms transmitted by fax. One way to solve this problem is to determine the form fields by considering the positions of the form lines. This paper describes a new method to find pairs of corresponding form lines on a reference form and a filled form. The advantage of this method is that the corresponding line pairs can be used to map any pixel of the filled form and the reference form without any assumption about the kind of distortion. The core of this method is an algorithm that is based on the A*-search algorithm. Two sets of horizontal or vertical lines, one from the reference form and one from the filled form, are searched for pairs of corresponding lines. The algorithm's run time is low and nonlinear distortions of the form images hardly influence its results. With increasing complexity-i.e. increasing number of lines or decreasing image quality-the number of rejected form lines grows, but the error rate stays low.
Index Terms:
business forms; form line matching; heuristic search; form reading applications; form field location; form image nonlinear distortions; photocopied forms; facsimile transmission; reference form; filled form; corresponding line pairs; pixel mapping; A*-search algorithm; horizontal lines; vertical lines; run time; complexity; image quality; rejected form lines; error rate; form recognition; form identification; form structures
Citation:
U. Bohnacker, J. Schacht, T. Yucel, "Matching form lines based on a heuristic search," icdar, pp.86, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||