The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - August (2008 vol.30)
pp: 1313-1329
ABSTRACT
Text line segmentation in freestyle handwritten documents remains an open document analysis problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on density estimation and a state-of-the-art image segmentation technique, the level set method. From an input document image, we estimate a probability map, where each element represents the probability that the underlying pixel belongs to a text line. The level set method is then exploited to determine the boundary of neighboring text lines by evolving an initial estimate. Unlike connected component based methods ( [1], [2] for example), the proposed algorithm does not use any script-specific knowledge. Extensive quantitative experiments on freestyle handwritten documents with diverse scripts, such as Arabic, Chinese, Korean, and Hindi, demonstrate that our algorithm consistently outperforms previous methods [1]-[3]. Further experiments show the proposed algorithm is robust to scale change, rotation, and noise.
INDEX TERMS
Document analysis, Handwriting analysis, Document and Text Processing
CITATION
Yi Li, Yefeng Zheng, David Doermann, Stefan Jaeger, Yi Li, "Script-Independent Text Line Segmentation in Freestyle Handwritten Documents", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.30, no. 8, pp. 1313-1329, August 2008, doi:10.1109/TPAMI.2007.70792
REFERENCES
[1] L. O'Gorman, “The Document Spectrum for Page Layout Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.
[2] S. Jaeger, G. Zhu, D. Doermann, K. Chen, and S. Sampat, “DOCLIB: A Software Library for Document Processing,” Proc. SPIE Document Recognition and Retrieval XIII, pp. 63-71, 2006.
[3] G. Nagy, S. Seth, and M. Viswanathan, “A Prototype Document Image Analysis System for Technical Journals,” Computer, vol. 25, no. 7, pp. 10-22, July 1992.
[4] R. Plamondon and S.N. Srihari, “Online and Offline Handwriting Recognition: A Comprehensive Survey,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 63-84, Jan. 2000.
[5] Y. Li, Y. Zheng, and D. Doermann, “Detecting Text Lines in Handwritten Documents,” Proc. 18th Int'l Conf. Pattern Recognition, pp. 1030-1033, 2006.
[6] Y. Li, Y. Zheng, D. Doermann, and S. Jaeger, “A New Algorithm for Detecting Text Line in Handwritten Documents,” Proc. 10th Int'l Workshop Frontiers in Handwriting Recognition, pp. 35-40, 2006.
[7] S. Mao, A. Rosenfeld, and T. Kanungo, “Document Structure Analysis Algorithms: A Literature Survey,” Proc. SPIE Document Recognition and Retrieval X, pp. 197-207, 2003.
[8] A.K. Jain and B. Yu, “Document Representation and Its Application to Page Decomposition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 294-308, Mar. 1998.
[9] J. Liang, I.T. Phillips, and R.M. Haralick, “Performance Evaluation of Document Structure Extraction Algorithms,” Computer Vision and Image Understanding, vol. 84, no. 1, pp. 144-159, 2001.
[10] G. Nagy, “Twenty Years of Document Image Analysis in PAMI,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62, Jan. 2000.
[11] A. Simon, J.-C. Pret, and A. Johnson, “A Fast Algorithm for Bottom-Up Document Layout Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 273-277, Mar. 1997.
[12] B. Yu and A. Jain, “A Robust and Fast Skew Detection Algorithm for Generic Documents,” Pattern Recognition, vol. 29, no. 10, pp.1599-1629, 1996.
[13] R. Manmatha and J.L. Rothfeder, “A Scale Space Approach for Automatically Segmenting Words from Historical Handwritten Documents,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 8, pp. 1212-1225, Aug. 2005.
[14] U.-V. Martin and H. Bunke, “Text Line Segmentation and Word Recognition in a System for General Writer Independent Handwriting Recognition,” Proc. Sixth Int'l Conf. Document Analysis and Recognition, pp. 159-163, 2001.
[15] N. Tripathy and U. Pal, “Handwriting Segmentation of Unconstrained Oriya Text,” Proc. Ninth Int'l Workshop Frontiers in Handwriting Recognition, pp. 306-311, 2004.
[16] U. Pal and P.P. Roy, “Multioriented and Curved Text Lines Extraction from Indian Documents,” IEEE Trans. Systems, Man, and Cybernetics—Part B: Cybernetics, vol. 34, no. 4, pp. 1676-1684, 2004.
[17] A. Zahour, B. Taconet, P. Mercy, and S. Ramdane, “Arabic Handwritten Text-Line Extraction,” Proc. Sixth Int'l Conf. Document Analysis and Recognition, pp. 281-285, 2001.
[18] Y. Zheng, H. Li, and D. Doermann, “A Parallel-Line Detection Algorithm Based on HMM Decoding,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 777-792, May 2005.
[19] J. Sauvola, D. Doermann, and M. Pietikainen, “Locally Adaptive Document Skew Detection,” Proc. SPIE Document Recognition and Retrieval IV, pp. 96-108, 1997.
[20] S. Osher and R. Fedkiw, Level Set Methods and Dynamic Implicit Surfaces. Springer, 2003.
[21] E. Parzen, “On Estimation of a Probability Density Function and Mode,” Annals Math. Statistics, vol. 33, no. 3, pp. 1065-1076, 1962.
[22] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, second ed., pp. 84-97. Wiley-Interscience Publications, 2000.
[23] B. Sumengen, “Variational Image Segmentation and Curve Evolution on Natural Images,” PhD dissertation, Univ. of California, Santa Barbara, Sept. 2004.
[24] W. Niblack, An Introduction to Digital Image Processing, pp. 115-116. Prentice Hall, 1986.
[25] A. Rosenfeld and J. Pfaltz, “Sequential Operations in Digital Picture Processing,” J. ACM, vol. 13, no. 4, pp. 471-494, 1966.
[26] D. Doermann and D. Mihalcik, “Tools and Techniques for Video Performances Evaluation,” Proc. 15th Int'l Conf. Pattern Recognition, pp. 167-170, 2000.
[27] G. Liu and R.M. Haralick, “Optimal Matching Problem in Detection and Recognition Performance Evaluation,” Pattern Recognition, vol. 35, no. 3, pp. 2125-2139, 2002.
[28] S. Mao and T. Kanungo, “Software Architecture of PSET: A Page Segmentation Evaluation Toolkit,” Int'l J. Document Analysis and Recognition, vol. 4, no. 3, pp. 205-217, 2002.
[29] H.S. Baird, “Calibration of Document Image Defect Models,” Proc. Second Symp. Document Analysis and Information Retrieval, pp. 1-16, 1993.
[30] T. Kanungo, R.M. Haralick, and I. Phillips, “Nonlinear Local and Global Document Degradation Models,” Int'l J. Imaging Systems and Technology, vol. 5, no. 4, pp. 220-230, 1994.
[31] M. Cannon, J. Hochberg, and P. Kelly, “Quality Assessment and Restoration of Typewritten Document Images,” Int'l J. Document Analysis and Recognition, vol. 2, nos. 2-3, pp. 80-89, 1999.
[32] J. Sethian, Level Set Methods and Fast Marching Methods, Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge Univ. Press, 1999.
12 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool