This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A New Methodology for Gray-Scale Character Segmentation and Recognition
October 1996 (vol. 18 no. 10)
pp. 1045-1050

Abstract—Generally speaking, through the binarization of gray-scale images, useful information for the segmentation of touched or overlapped characters may be lost in many cases. If we analyze gray-scale images, however, specific topographic features and the variation of intensities can be observed in the character boundaries. We believe that such kinds of clues obtained from gray-scale images may work for efficient character segmentation and recognition. In this paper, we propose a new methodology for character segmentation and recognition which makes the best use of the characteristics of gray-scale images. In the proposed methodology, the character segmentation regions are determined by using projection profiles and topographic features extracted from the gray-scale images. Then a nonlinear character segmentation path in each character segmentation region is found by using multi-stage graph search algorithm. Finally, in order to confirm the nonlinear character segmentation paths and recognition results, recognition-based segmentation method is adopted. Through the experiments with various kinds of printed documents, it is convinced that the proposed methodology is very effective for the segmentation and recognition of touched and overlapped characters.

[1] Y. Lu, "Machine Printed Character Segmentation—An Overview," Pattern Recognition, vol. 28, no. 1, pp. 67-80, Jan. 1995.
[2] S. Kahan,T. Pavlidis,, and H. S. Baird,“On the Recognition of Printed Characters of Any Font and Size,” IEEE-PAMI, vol. 9, no. 2, pp. 274-288, 1987.
[3] A. Ariyoshi, "A Character Segmentation Method for Japanese Documents Coping with Touching Character Problems," Proc. 11th Int'l Conf. Pattern Recognition,The Hague, Netherlands, pp. 313-316, Aug. 1992.
[4] S. Liang, M. Shridhar, and M. Ahmadi, "Segmentation of Touching Characters in Printed Document Recognition," Pattern Recognition, vol. 27, no. 6, pp. 825-840, June 1994.
[5] S. Tsujimoto and H. Asada, "Major Components of a Complete Text Reading System," Proceedings IEEE, vol. 80, no. 7, pp. 1,133-1,149, July 1992.
[6] T. Bayer and U. Kresel, "Cut Classification for Segmentation," Proc. Second Int'l Conf. Document Analysis and Recognition,Tsukuba, Japan, pp. 565-568, Oct. 1993.
[7] L. Wang and T. Pavlidis,“Direct gray scale extraction of features for character recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1,053-1,067, Oct. 1993.
[8] J. Rocha and T. Pavlidis, "A Solution to the Problem of Touching and Broken Characters," Proc. Second Int'l Conf. Document Analysis and Recognition,Tsukuba, Japan, pp. 602-605, Oct. 1993.
[9] J. Wang and J. Jean, "Segmentation of Merged Characters by Neural Networks and Shortest Path," Pattern Recognition, vol. 27, no. 25, pp. 649-658, May 1994.
[10] S.-W. Lee and Y.-J. Kim, "Direct Extraction of Topographic Features for Gray Scale Character Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 7, pp. 724-729, July 1995.
[11] T.H. Horowitz and S. Sahni, Fundamentals of Computer Algorithms. Computer Science Press, 1978.
[12] S.-W. Lee and J.-S. Kim, "Multilingual, Multifont, and Multisize Large-set Character Recognition Using Self-Organizing Neural Network," Proc.Third Int'l Conf. Document Analysis and Recognition,Montreal, pp. 28-33, Aug. 1995.
[13] D.-J. Lee and S.-W. Lee, "Character Segmentation and Recognition in Korean Document Mixed with Alphanumeric Characters," Proc. Fall National Conf. Korea Information Science Soc.,Seoul, Korea, pp. 403-406, Oct. 1994, (in Korean).
[14] Hangul and Computer Co. Ltd., Hangul Wordprocessor: Reference Manual, Version 3.0b, Seoul, Korea, 1995.

Index Terms:
Character segmentation and recognition, topographic feature, gray-scale character recognition, multistage graph search, recognition-based segmentation.
Citation:
Seong-Whan Lee, Dong-June Lee, Hee-Seon Park, "A New Methodology for Gray-Scale Character Segmentation and Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, no. 10, pp. 1045-1050, Oct. 1996, doi:10.1109/34.541415
Usage of this product signifies your acceptance of the Terms of Use.