loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fourth International Conference Document Analysis and Recognition (ICDAR'97)
Supporting Information Extraction from Printed Documents by Lexico-Semantic Pattern Matching
Ulm, GERMANY
August 18-August 20
ISBN: 0-8186-7898-4
Claudia Wenzel, German Research Center for Artificial Intelligence DFKI GmbH
Document analysis and understanding (DAU) systems aim not only at the recognition of text and document structures but also at the extraction of relevant information out of a scanned document. Depending on the class of a document, information to be extracted may be defined in advance in syntactic structures as well as in semantic structures. In this paper we present a system for detecting such information and transforming it into a semantic representation. The basic component is a pattern matcher which incorporates geometric positions to detect phrases in the document. By defining a Levenshtein distance, the component reacts more generously in order to be error- tolerant against OCR failures.
Citation:
Claudia Wenzel, "Supporting Information Extraction from Printed Documents by Lexico-Semantic Pattern Matching," icdar, pp.732, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997
Usage of this product signifies your acceptance of the Terms of Use.