Fourth International Conference Document Analysis and Recognition (ICDAR'97)
A Generic System for Processing Invoices
Ulm, GERMANY
August 18-August 20
ISBN: 0-8186-7898-4
This paper presents a generic system which automatically extracts the requested items from invoices with arbitrary form layout in arbitrary domains. The system consists of two components, an OCR tool which need not be adapted to the current domain and an information extraction component FRESCO which contains the knowledge about the domain. The automation rate for interpreting invoices in the domain of health insurance is above 50 % with an error rate below 1 % with respect to the items to be extracted.