Fourth International Conference Document Analysis and Recognition (ICDAR'97)
Supporting Information Extraction from Printed Documents by Lexico-Semantic Pattern Matching
Ulm, GERMANY
August 18-August 20
ISBN: 0-8186-7898-4
Document analysis and understanding (DAU) systems aim not only at the recognition of text and document structures but also at the extraction of relevant information out of a scanned document. Depending on the class of a document, information to be extracted may be defined in advance in syntactic structures as well as in semantic structures. In this paper we present a system for detecting such information and transforming it into a semantic representation. The basic component is a pattern matcher which incorporates geometric positions to detect phrases in the document. By defining a Levenshtein distance, the component reacts more generously in order to be error- tolerant against OCR failures.
Citation:
Claudia Wenzel, "Supporting Information Extraction from Printed Documents by Lexico-Semantic Pattern Matching," icdar, pp.732, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997
Usage of this product signifies your acceptance of the
Terms of Use.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||