Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1 Using Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Images Edinburgh, Scotland August 03-August 06 ISBN: 0-7695-1960-1
Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (ie. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images are binarized before the actual text extraction, this paper proposes a new method by first segmenting individual subject areas with the help of an irregular pyramid to be followed by the binarization process. This permits the focus of attention only on the appropriate subject areas for the binarization process before text recognition. Our method overcomes the difficulty in global binarization to find a single value to fit all. It also avoids the common problem in most local thresholding technique of finding a suitable window size. As shown in our experimented result, our method performed well in both text segmentation and binarization by varying the sequence of processing.
Citation:
Poh-Kok Loo, Chew-Lim Tan, "Using Irregular Pyramid for Text Segmentation and Binarization of Gray Scale Images," icdar, vol. 1, pp.594, Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 1, 2003 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||