Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1
Proper Names Extraction from Fax Images Combining Textual and Image Features
Edinburgh, Scotland
August 03-August 06
ISBN: 0-7695-1960-1
In the frame of a Unified Messaging System, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the present work is to identify and extract a specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimised dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
Citation:
Laurence Likforman-Sulem, Pascal Vaillant, Fran?ois Yvon, "Proper Names Extraction from Fax Images Combining Textual and Image Features," icdar, vol. 1, pp.545, Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1, 2003