loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
16th International Conference on Pattern Recognition (ICPR'02) - Volume 3
Hierarchical Content Classification and Script Determination for Automatic Document Image Processing
Quebec City, QC, Canada
August 11-August 15
ISBN: 0-7695-1695-X
Qing Wang, Northwestern Polytechnical University and Hong Kong Polytechnical University
Zheru Chi, Hong Kong Polytechnical University
Rongchun Zhao, Northwestern Polytechnical University
Page segmentation and image content classification plays an important role in automatic document image processing with apllications to mixed-type document image compression, form and check reading, and automatic mail sorting. In this paper, we propose an enhanced backgroung-thinning based page segmentation algorithm to process document images rapidly and eliminate some small regions embedded in other regions. We then present a hierarchical approach, which combines cross correlation measure, kolmogorov complexity measure, and a neural network, to classify sub-images into halftones and texts. The approach also achieves high accuracy in text determination using a three-layer feed-forward network, where text region can be classified into Chinese or alphabetic character. Experimental results on a number of mixed-type document images show the efficiency and effectiveness of our approach.
Citation:
Qing Wang, Zheru Chi, Rongchun Zhao, "Hierarchical Content Classification and Script Determination for Automatic Document Image Processing," icpr, vol. 3, pp.30077, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 3, 2002
Usage of this product signifies your acceptance of the Terms of Use.