The Community for Technology Leaders
Green Image
<p><it>Abstract</it>—Many approaches have reported that knowledge-based layout recognition methods are very successful to classify the meaningful data from document images automatically. However, these approaches are applicable to only the same kind of documents because they are based on the paradigm that specifies the structure definition information in advance so as to be able to analyze a particular class of documents intelligently. In this paper, we propose a method to recognize the layout structures of multi-kinds of table-form document images. For this purpose, we introduce a classification tree to manage the relationships among different classes of layout structures. Our recognition system has two modes: layout knowledge acquisition and layout structure recognition. In the layout knowledge acquisition mode, table-form document images are distinguished according to this classification tree and then the structure description trees which specify the logical structures of table-form documents are generated automatically. While, in the layout structure recognition mode, individual item fields in the table-form document images are extracted and classified successfully by searching the classification tree and interpreting the structure description tree.</p>
Recognition paradigm for multi-kinds of table-form documents, automatic acquisition of layout knowledge, recognition of document classes, recognition of layout structures, classification tree, structure description tree.

T. Watanabe, N. Sugie and Q. Luo, "Layout Recognition of Multi-Kinds of Table-Form Documents," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 17, no. , pp. 432-445, 1995.
85 ms
(Ver 3.3 (11022016))