This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Ninth International Workshop on Frontiers in Handwriting Recognition (IWFHR'04)
Document Retrieval System Tolerant of Segmentation Errors of Document Images
Kokubunji, Tokyo, Japan
October 26-October 29
ISBN: 0-7695-2187-8
Takeshi Nagasaki, Hitachi, Ltd.
Toshikazu Takahashi, Hitachi, Ltd.
Katsumi Marukawa, Hitachi, Ltd.
This paper describes a new document retrieval method that is tolerant of OCR segmentation errors in document images. To overcome the segmentation and recognition errors that most OCR-based retrieval systems suffer from, the proposed method consists of two processing phases. First, the OCR engine first generates multiple character-segmentation and recognition hypotheses. Then the retrieval engine extracts keywords from the recognition hypotheses by using lexicon-driven dynamic programming (DP) matching. We have applied this method to both handwritten and printed document images and have demonstrated its effectiveness in reducing false drops and false alarms.
Citation:
Takeshi Nagasaki, Toshikazu Takahashi, Katsumi Marukawa, "Document Retrieval System Tolerant of Segmentation Errors of Document Images," iwfhr, pp.280-285, Ninth International Workshop on Frontiers in Handwriting Recognition (IWFHR'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.