The Community for Technology Leaders
Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers (1994)
Pacific Grove, CA, USA
Oct. 31, 1994 to Nov. 2, 1994
ISSN: 1058-6393
ISBN: 0-8186-6405-3
pp: 1356-1361
N. Chaddha , Comput. Syst. Lab., Stanford Univ., CA, USA
R. Sharma , Comput. Syst. Lab., Stanford Univ., CA, USA
A. Agrawal , Comput. Syst. Lab., Stanford Univ., CA, USA
A. Gupta , Comput. Syst. Lab., Stanford Univ., CA, USA
ABSTRACT
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which are very effective in compressing continuous tone images, do not perform well with mixed-mode images which have a substantial text component. With a growing number of applications where such images occur, e.g., color facsimile, digital libraries and educational videos, there are advantages in being able to classify each block as being text or continuous tone. With such a classification, different compression parameters or even algorithms may be employed for the two kinds of data to obtain high compression with minimal loss in visual quality. In this paper we analyze and compare four methods for block classification in mixed mode images, namely variance, absolute-deviation, edge, and DCT based methods. Our evaluation of each scheme is based on the accuracy of segmentation, robustness across different types of images and sensitivity to the threshold used for segmentation. Our results show that DCT based segmentation offers the best accuracy and robustness. Another advantage of DCT is that it is compatible with standards like JPEG, MPEG and H.261.<>
INDEX TERMS
image segmentation, data compression, image coding, discrete cosine transforms, image classification, transform coding, edge detection, visual databases
CITATION

N. Chaddha, R. Sharma, A. Agrawal and A. Gupta, "Text segmentation in mixed-mode images," Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers(ACSSC), Pacific Grove, CA, USA, 1995, pp. 1356-1361.
doi:10.1109/ACSSC.1994.471679
96 ms
(Ver 3.3 (11022016))