The Community for Technology Leaders
RSS Icon
Issue No.07 - July (2009 vol.31)
pp: 1184-1194
Huaigu Cao , University at Buffalo, Amherst
Venu Govindaraju , University at Buffalo, Amherst
This paper presents a statistical approach to the preprocessing of degraded handwritten forms including the steps of binarization and form line removal. The degraded image is modeled by a Markov Random Field (MRF) where the hidden-layer prior probability is learned from a training set of high-quality binarized images and the observation probability density is learned on-the-fly from the gray-level histogram of the input image. We have modified the MRF model to drop the preprinted ruling lines from the image. We use the patch-based topology of the MRF and Belief Propagation (BP) for efficiency in processing. To further improve the processing speed, we prune unlikely solutions from the search space while solving the MRF. Experimental results show higher accuracy on two data sets of degraded handwritten images than previously used methods.
Markov random field, image segmentation, document analysis, handwriting recognition.
Huaigu Cao, Venu Govindaraju, "Preprocessing of Low-Quality Handwritten Documents Using Markov Random Fields", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.31, no. 7, pp. 1184-1194, July 2009, doi:10.1109/TPAMI.2008.126
[1] Z.-L. Bai and Q. Huo, “Underline Detection and Removal in a Document Image Using Multiple Strategies,” Proc. 17th Int'l Conf. Pattern Recognition, vol. 2, pp. 578-581, 2004.
[2] M. Bertalmio, G. Sapiro, V. Caselles, and C. Ballester, “Image Inpainting,” Computer Graphics, Proc. ACM SIGGRAPH '00, pp.417-424, 2000.
[3] H. Cao and V. Govindaraju, “Handwritten Carbon Form Preprocessing Based on Markov Random Field,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2007.
[4] W.T. Freeman and E.C. Pasztor, “Learning Low-Level Vision,” Proc. Seventh IEEE Int'l Conf. Computer Vision, pp. 1182-1189, 1999.
[5] W.T. Freeman, E.C. Pasztor, and O.T. Carmichael, “Learning Low-Level Vision,” Int'l J. Computer Vision, vol. 40, no. 1, pp. 25-47, 2000.
[6] S. Geman and D. Geman, “Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 6, no. 6, pp. 721-741, 1984.
[7] M.D. Gupta, S. Rajaram, N. Petrovic, and T.S. Huang, “Restoration and Recognition in a Loop,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, 2005.
[8] M.D. Gupta, S. Rajaram, N. Petrovic, and T.S. Huang, “Models for Patch Based Image Restoration,” Proc. Computer Vision and Pattern Recognition Conf., 2006.
[9] N.R. Howe, T.M. Rath, and R. Manmatha, “Boosted Decision Trees for Word Recognition in Handwritten Document Retrievals,” Proc. ACM SIGIR '05, pp. 377-383, 2005.
[10] N. Jojic, B.J. Frey, and A. Kannan, “Epitomic Analysis of Appearance and Shape,” Proc. Ninth IEEE Int'l Conf. Computer Vision, 2003.
[11] M. Kamel and A. Zhao, “Extraction of Binary Characters/Graphics Images from Grayscale Document Images,” CVGIP: Graphic Models Image Processing, vol. 55, no. 3, 1993.
[12] G. Kim and V. Govindaraju, “A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 4, pp. 366-379, Apr. 1997.
[13] U. Marti and H. Bunke, “The IAM-Database: An English Sentence Database for Off-Line Handwriting Recognition,” Int'l J. Document Analysis and Recognition, vol. 5, pp. 39-46, 2006.
[14] R. Milewski and V. Govindaraju, “Extraction of Handwritten Text from Carbon Copy Medical Form Images,” Proc. Seventh Int'l Workshop Document Analysis Systems, pp. 106-116, 2006.
[15] W. Niblack, An Introduction to Digital Image Processing. Prentice Hall, 1986.
[16] N.A. Otsu, “A Threshold Selection Method from Gray-Level Histogram,” IEEE Trans. Systems, Man, and Cybernetics, vol. 9, no. 1, 1979.
[17] J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, 1988.
[18] J. Sauvola, T. Seppanen, S. Haapakoski, and M. Pietiktinen, “Adaptive Document Binarization,” Proc. Fourth Int'l Conf. Document Analysis and Recognition, pp. 147-152, 1997.
[19] M. Seeger and C. Dance, “Binarising Camera Images for OCR,” Proc. Sixth Int'l Conf. Document Analysis and Recognition, pp. 54-58, 2001.
[20] C. Wolf and D. Doermann, “Binarization of Low Quality Text Using a Markov Random Field Model,” Proc. 16th Int'l Conf. Pattern Recognition, 2002.
[21] Y. Yang and H. Yan, “An Adaptive Logical Method for Binarization of Degraded Document Images,” Pattern Recognition, pp. 787-807, 2000.
[22] M. Yasuda, J. Ohkubo, and K. Tanaka, “Digital Image Inpainting Based on Markov Random Field,” Proc. Int'l Conf. Computational Intelligence for Modelling, Control and Automation and Int'l Conf. Intelligent Agents, Web Technologies and Internet Commerce, vol. 2, pp. 747-752, 2005.
[23] J.-Y. Yoo, M.-K. Kim, S.Y. Han, and Y.-B. Kwon, “Line Removal and Restoration of Handwritten Characters on the Form Documents,” Proc. Fourth Int'l Conf. Document Analysis and Recognition, pp. 128-131, 1997.
25 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool