loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Machine Printed Text and Handwriting Identification in Noisy Document Images
March 2004 (vol. 26 no. 3)
pp. 337-353

Abstract—In this paper, we address the problem of the identification of text in noisy document images. We are especially focused on segmenting and identifying between handwriting and machine printed text because: 1) Handwriting in a document often indicates corrections, additions, or other supplemental information that should be treated differently from the main content and 2) the segmentation and recognition techniques requested for machine printed and handwritten text are significantly different. A novel aspect of our approach is that we treat noise as a separate class and model noise based on selected features. Trained Fisher classifiers are used to identify machine printed text and handwriting from noise and we further exploit context to refine the classification. A Markov Random Field-based (MRF) approach is used to model the geometrical structure of the printed text, handwriting, and noise to rectify misclassifications. Experimental results show that our approach is robust and can significantly improve page segmentation in noisy document collections.

[1] 337 A.K. Jain and B. Yu, “Document Representation and Its Application to Page Decomposition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 294-308, Mar. 1998.[2] L. O'Gorman, “The Document Spectrum for Page Layout Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1,162-1,173, Nov. 1993.[3] ScanSoft Corp, ScanSoft Developer's Kit 2000,http:/www.scansoft.com, 2003.[4] J.J. Hull, “Incorporating Language Syntax in Visual Text Recognition with a Statistical Model,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 18, no. 12, pp. 1,251-1,256, Dec. 1996.[5] R.M.K. Sinha, B. Prasada, G.H. Houle, and M. Sabourin, “Hybrid Contextual Text Recognition with String Matching,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 9, pp. 915-923, Sept. 1993.[6] S. Geman and D. Geman, Stochastic Relaxation, Gibbs Distribution and the Bayesian Restoration of Images IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 6, no. 6, pp. 721-741, 1984.[7] S.Z. Li, Markov Random Field Modeling in Image Analysis, second ed. New York: Springer-Verlag, 2001.[8] G. Nagy, S. Seth, and S. Stoddard, Document Analysis with an Expert System Pattern Recognition in Practice II, pp. 149-155, Elsevier Science, 1984.[9] D. Sylwester and S. Seth, Adaptive Segmentation of Document Images Proc. Int'l Conf. Document Analysis and Recognition, pp. 827-831, 2001.[10] H.S. Baird, S.E. Jones, and S.J. Fortune, Image Segmentation by Shape-Directed Covers Proc. Int'l Conf. Pattern Recognition, pp. 820-825, 1990.[11] T. Pavlidis and J. Zhou, Page Segmentation and Classification CVGIP, vol. 54, no. 6, pp. 484-496, 1992.[12] R.M. Haralick, “Document Image Understanding: Geometric and Logical Layout,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 385-390, 1994.[13] Y. Wang, R.M. Haralick, and I.T. Phillips, Zone Content Classification and Its Performance Evaluation Proc. Int'l Conf. Document Analysis and Recognition, pp. 540-544, 2001.[14] K. Etemad, D. Doerman, and R. Chellappa, “Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 1, pp. 92-96, Jan. 1997.[15] A.K. Jain and S. Bhattacharjee, Text Segmentation Using Gabor Filters for Automatic Document Processing Machine Vision and Applications, vol. 5, pp. 169-184, 1992.[16] S.-W. Lee and B.-S. Ryu, Parameter-Free Geometric Document Layout Analysis IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 11, pp. 1240-1256, Nov. 2001.[17] K.C. Fan, L.S. Wang, and Y.T. Tu, Classification of Machine-Printed and Handwritten Texts Using Character Block Layout Variance Pattern Recognition, vol. 31, no. 9, pp. 1275-1284, 1998.[18] J. Fanke and M. Oberlander, Writing Style Detection by Statistical Combination of Classifier in Form Reader Applications Proc. Int'l Conf. Document Analysis and Recognition, pp. 581-585, 1993.[19] V. Pal and B.B. Chaudhuri, Machine-Printed and Handwritten Text Lines Identification Pattern Recognition Letters, vol. 22, nos. 3-4, pp. 431-441, 2001.[20] S.N. Srihari, Y.C. Shim, and V. Ramanprasad, A System to Read Names and Address on Tax Forms Technical Report CEDAR-TR-94-2, CEDAR, SUNY, Buffalo, N.Y., 1994.[21] J.K. Guo and M.Y. Ma, Separating Handwritten Material from Machine Printed Text Using Hidden Markov Models Proc. Int'l Conf. Document Analysis and Recognition, pp. 439-443, 2001.[22] K. Kuhnke, L. Simoncini, and Z.M. Kovacs-V, A System for Machine-Written and Hand-Written Character Distinction Proc. Int'l Conf. Document Analysis and Recognition, pp. 811-814, 1995.[23] Y. Zheng, C. Liu, and X. Ding, Single Character Type Identification Proc. SPIE Conf. Document Recognition and Retrieval, pp. 49-56, 2002.[24] Y. Zheng, H. Li, and D. Doermann, The Segmentation and Identification of Handwriting in Noisy Document Images Proc. Int'l Workshop Document Analysis Systems, pp. 95-105, 2002.[25] H.S. Baird, Calibration of Document Image Defect Models Proc. Symp. Document Analysis and Information Retrieval, pp. 1-16, 1993.[26] T. Kanungo, R.M. Haralick, and I. Phillips, Nonlinear Local and Global Document Degradation Models Int'l J. Imaging Systems and Technology, vol. 5, no. 4, pp. 220-230, 1994.[27] S. Sural and P.K. Das, A Two-State Markov Chain Model of Degraded Document Images Proc. Int'l Conf. Document Analysis and Recognition, pp. 463-466, Sept. 1999[28] M. Cannon, J. Hochberg, and P. Kelly, Quality Assessment and Restoration of Typewritten Document Images Int'l J. Document Analysis and Recognition, vol. 2, pp. 80-89, 1999.[29] H. Li and D. Doermann, Text Quality Estimation in Video Proc. SPIE Conf. Document Recognition and Retrieval, pp. 232-243, 2002.[30] J. Liang, I.T. Phillips, and R.M. Haralick, Performance Evaluation of Document Layout Analysis Algorithms on the UW Data Set Proc. SPIE Conf. Document Recognition, pp. 149-160, 1997.[31] R.P. Loce and E.R. Dougherty, Enhancement and Restoration of Digital Documents Statistical Design of Nonlinear Algorithms. SPIE Optical Eng. Press, 1997.[32] L. O'Gorman, Image and Document Processing Techniques for the RightPages Electronic Library System Proc. Int'l Conf. Pattern Recognition, pp. 820-825, 1992.[33] K. Chinnasarn, Y. Rangsanseri, and P. Thitimajshima, Removing Salt-and-Pepper Noise in Text/Graphics Images Proc. IEEE Asia-Pacific Conf. Circuits and Systems, pp. 459-462, 1998.[34] J. Liang and R.M. Haralick, Document Image Restoration Using Binary Morphological Filters Proc. SPIE Conf. Document Recognition, pp. 274-285, 1996.[35] T. Kanungo, R.M. Haralick, H.S. Baird, W. Stuetzle, and D. Madigan, Document Degradation Models: Parameter Estimation and Model Validation Proc. Int'l Workshop Machine Vision Applications, pp. 552-557, 1994.[36] T. Kanungo, H.S. Baird, and R.M. Haralick, Validation and Estimation of Document Degradation Models Proc. Symp. Document Analysis and Information Retrieval, pp. 217-228, 1995.[37] T. Kanungo and Q. Zheng, Estimation of Morphological Degradation Model Parameters Proc. IEEE Int'l Conf. Speech and Signal Processing, May 2001.[38] H.S. Baird, Document Image Quality: Making Fine Discriminations Proc.Int'l Conf. Document Analysis and Recognition, pp. 459-462, Sept. 1999.[39] D. Gabor, Theory of Communication J. Instructional Electrical Eng., vol. 93, pp. 429-459, 1946.[40] T. Akiyama and N. Hagita, Automated Entry System for Printed Documents Pattern Recognition, vol. 23, no. 11, pp. 1141-1154, 1990.[41] R. Haralick, B. Shanmugam, and I. Dinstein, Texture Features for Image Classification IEEE Trans. Systems, Man, and Cybernetics, vol. 3, no. 6, pp. 610-622, 1973.[42] A. Soffer, Image Categorization Using Texture Features Proc. Int'l Conf. Document Analysis and Recognition, pp. 233-237 1997.[43] K. Fukunaga, Introduction to Statistical Pattern Recognition, second ed. New York: Academic Press, 1990.[44] A. Jain and D. Zongker, Feature Selection: Evaluation, Application, and Small Sample Performance IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 2, pp. 153-158, Feb. 1997.[45] L. Kukolick and R. Lippmann, LNKnet User's Guide,http://www.ll.mit.edu/ISTlnknet, 2003.[46] G. Grimmett, and D. Stirzaker, Probability and Random Processes, second ed. Oxford Univ. Press, 2001.[47] X. Lin, X. Ding, and M. Chen, Adaptive Confidence Transform Based Classifier Combination for Chinese Character Recognition Pattern Recognition Letters, vol. 19, no. 10, pp. 975-988, 1998.[48] C. Wolf and D. Doermann, Binarization of Low Quality Text Using a Markov Random Field Model Proc. Int'l Conf. Pattern Recognition, 2002.[49] P.B. Chou, P.R. Cooper, and M.J. Swain, Probabilistic Network Inference for Cooperative High and Low Level Vision Markov Random Fields: Theory and Application, R. Chellapa and A. Jain, eds., San Diego, Calif.: Academic Press, 1993.[50] V. Vapnik, The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.[51] C.-C. Chang and C.-J. Lin, Libsvm A Library for Support Vector Machines,http://www.csie.ntu.edu.tw/cjlinlibsvm/, 2003.[52] J. Kanai, S.V. Rice, T.A. Nartker, and G. Nagy, “Automated Evaluation of OCR Zoning” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 1, pp. 86-89, Jan. 1995.[53] S. Mao and T. Kanungo, Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 242-256, Mar. 2001.[54] S. Randriamasy, L. Vincent, and B. Wittner, An Automatic Benchmarking Scheme for Page Segmentation Proc. SPIE Conf. Document Recognition, pp. 217-227, 1994.

Index Terms:
Text identification, handwriting identification, Markov random field, postprocessing, noisy document image enhancement, document analysis.
Citation:
Yefeng Zheng, Huiping Li, David Doermann, "Machine Printed Text and Handwriting Identification in Noisy Document Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 3, pp. 337-353, Mar. 2004, doi:10.1109/TPAMI.2004.1262324
Usage of this product signifies your acceptance of the Terms of Use.