International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2
Recognition and Identification of Form Document Layouts
Las Vegas, Nevada
April 05-April 07
ISBN: 0-7695-2108-8
Kai Luo, University of Nevada, Las Vegas
In this paper, a hierarchical tree representation is introduced to represent the logical structure of a form document. Different forms might have the same logical structure, so the representation will be ambiguous. In this paper, an improvement is proposed to solve the ambiguity problem by using the physical information of the blocks. A pixel tracing approach is used to extract form layout structures from form documents. Compared with hough transform, it requires less computation. This algorithm has been tested on 50 different table forms. The algorithm applies to table form documents.
Index Terms:
table form document, form layout extraction, pixel tracing, hierarchical tree
Citation:
Kai Luo, Shahram Latifi, Kazem Taghva, Emma Regentova, "Recognition and Identification of Form Document Layouts," itcc, vol. 2, pp.352, International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2, 2004