Fourth International Conference Document Analysis and Recognition (ICDAR'97) Speeding-up Chinese Character Recognition in an Automatic Document Reading System Ulm, GERMANY August 18-August 20 ISBN: 0-8186-7898-4
In this paper, we present two techniques for speeding up character recognition. Our character recognition system, including the candidate- cluster selection and detail-matching modules, is implemented using two statistical features: crossing-counts and contour-direction counts. In the training stage, we divide characters into different clusters by using reference characters. To keep very high recognition rate, the candidate- cluster selection module selects the top 60 clusters with minimal distances from among 300 predefined clusters. To further speed-up the recognition speed, we use a modified branch-and-bound algorithm in the detail-matching module.In the automatic document-reading system, characters and punctuation marks are first extracted from printed document images and sorted according to their positions and the document orientation. The system then recognizes all printed Chinese characters between pairs of punctuation marks. The results are then spoken aloud by a speech- synthesis system. The character recognition system and the text-to- speech synthesis system are integrated in the Windows-based document reading system, which provides a user friendly environment.
Index Terms:
crossing-count features, contour-direction features, candidate-cluster selection, branch-and-bound method, text-to-speech technique, automatic document-reading system.
Citation:
Yi-Hong Tseng, Chi-Chang Kuo, Hsi-Jian Lee, "Speeding-up Chinese Character Recognition in an Automatic Document Reading System," icdar, pp.629, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||