CSDL Home I ICDAR 2003 Proceedings Seventh International Conference on Document Analysis and Recognition
Aug. 6, 2003 to Aug. 6, 2003
Poh-Kok Loo , Singapore Polytechnic
Chew-Lim Tan , National University of Singapore
Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images are binarized before the actual text extraction, this paper proposes a new method by first segmenting individual subject areas with the help of an irregular pyramid to be followed by the binarization process. This permits the focus of attention only on the appropriate subject areas for the binarization process before text recognition. Our method overcomes the difficulty in global binarization to find a single value to fit all. It also avoids the common problem in most local thresholding technique of finding a suitable window size. As shown in our experimented result, our method performed well in both text segmentation and binarization by varying the sequence of processing.
Poh-Kok Loo, Chew-Lim Tan, "Using Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Images", ICDAR, 2003, Proceedings Seventh International Conference on Document Analysis and Recognition, Proceedings Seventh International Conference on Document Analysis and Recognition 2003, pp. 594, doi:10.1109/ICDAR.2003.1227733