The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - April (2011 vol.33)
pp: 846-851
Faisal Shafait , German Research Center for Artificial Intelligence, Kaiserslautern
Thomas M. Breuel , Technical University of Kaiserslautern, Kaiserslautern
ABSTRACT
Projection methods have been used in the analysis of bitonal document images for different tasks such as page segmentation and skew correction for more than two decades. However, these algorithms are sensitive to the presence of border noise in document images. Border noise can appear along the page border due to scanning or photocopying. Over the years, several page segmentation algorithms have been proposed in the literature. Some of these algorithms have come into widespread use due to their high accuracy and robustness with respect to border noise. This paper addresses two important questions in this context: 1) Can existing border noise removal algorithms clean up document images to a degree required by projection methods to achieve competitive performance? 2) Can projection methods reach the performance of other state-of-the-art page segmentation algorithms (e.g., Docstrum or Voronoi) for documents where border noise has successfully been removed? We perform extensive experiments on the University of Washington (UW-III) data set with six border noise removal methods. Our results show that although projection methods can achieve the accuracy of other state-of-the-art algorithms on the cleaned document images, existing border noise removal techniques cannot clean up documents captured under a variety of scanning conditions to the degree required to achieve that accuracy.
INDEX TERMS
Document page segmentation, OCR, performance evaluation, border noise removal, document cleanup.
CITATION
Faisal Shafait, Thomas M. Breuel, "The Effect of Border Noise on the Performance of Projection-Based Page Segmentation Methods", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.33, no. 4, pp. 846-851, April 2011, doi:10.1109/TPAMI.2010.194
REFERENCES
[1] R. Cattoni, T. Coianiz, S. Messelodi, and C.M. Modena, "Geometric Layout Analysis Techniques for Document Image Understanding: A Review," Technical Report 9703-09, http:/citeseer.nj.nec.com/, 1998.
[2] G. Nagy, "Twenty Years of Document Image Analysis in PAMI," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62, Jan. 2000.
[3] G. Nagy and S. Seth, "Hierarchical Representation of Optically Scanned Documents," Proc. Seventh Int'l Conf. Pattern Recognition, pp. 347-349, July 1984.
[4] G. Nagy, S. Seth, and M. Viswanathan, "A Prototype Document Image Analysis System for Technical Journals," Computer, vol. 25, no. 7, pp. 10-22, July 1992.
[5] M. Krishnamoorthy, G. Nagy, S. Seth, and M. Viswanathan, "Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 7, pp. 737-747, July 1993.
[6] L. O'Gorman, "The Document Spectrum for Page Layout Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.
[7] K. Kise, A. Sato, and M. Iwata, "Segmentation of Page Images Using the Area Voronoi Diagram," Computer Vision and Image Understanding, vol. 70, no. 3, pp. 370-382, 1998.
[8] T.M. Breuel, "Two Geometric Algorithms for Layout Analysis," Proc. Workshop Document Analysis Systems, pp. 188-199, Aug. 2002.
[9] S. Mao and T. Kanungo, "Empirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 242-256, Mar. 2001.
[10] F. Shafait, D. Keysers, and T.M. Breuel, "Performance Evaluation and Benchmarking of Six Page Segmentation Algorithms," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 6, pp. 941-954, June 2008.
[11] I. Guyon, R.M. Haralick, J.J. Hull, and I.T. Phillips, "Data Sets for OCR and Document Image Understanding Research," Handbook of Character Recognition and Document Image Analysis, H. Bunke and P. Wang, eds., pp. 779-799, World Scientific, 1997.
[12] K.Y. Wong, R.G. Casey, and F.M. Wahl, "Document Analysis System," IBM J. Research and Development, vol. 26, no. 6, pp. 647-656, 1982.
[13] G. Nagy, S.C. Seth, and M. Viswanathan, "Projection Methods Require Black Border Removal," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 4, p. 762, Apr. 2009.
[14] F. Shafait, D. Keysers, and T.M. Breuel, "Response to Projection Methods Require Black Border Removal," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31, no. 4, pp. 763-764, Apr. 2009.
[15] J. Ha, R. Haralick, and I. Phillips, "Document Page Decomposition by the Bounding-Box Projection Technique," Proc. Int'l Conf. Document Analysis and Recognition, pp. 1119-1122, Aug. 1995.
[16] D. Sylwester and S. Seth, "A Trainable, Single-Pass Algorithm for Column Segmentation," Proc. Int'l Conf. Document Analysis and Recognition, pp. 615-618, Aug. 1995.
[17] F. Shafait, D. Keysers, and T.M. Breuel, "Pixel-Accurate Representation and Evaluation of Page Segmentation in Document Images," Proc. 18th Int'l Conf. Pattern Recognition, pp. 872-875, Aug. 2006.
[18] H. Baird, "The Skew Angle of Printed Documents," Proc. 40th Ann. Conf. and Symp. Hybrid Imaging Systems, pp. 21-24, May 1987.
[19] A.D. Bagdanov and J. Kanai, "Projection Profile Based Skew Estimation Algorithm for JBIG Compressed Images," Proc. Int'l Conf. Document Analysis and Recognition, pp. 401-405, Aug. 1997.
[20] J. Kanai and A.D. Bagdanov, "Projection Profile Based Skew Estimation Algorithm for JBIG Compressed Images," Int'l J. Document Analysis and Recognition, vol. 1, no. 1, pp. 43-51, 1998.
[21] F. Shafait and T.M. Breuel, "A Simple and Effective Approach for Border Noise Removal from Document Images," Proc. 13th IEEE Int'l Multi-Topic Conf., Dec. 2009.
[22] N. Stamatopoulos, B. Gatos, and A. Kesidis, "Automatic Borders Detection of Camera Document Images," Proc. Second Int'l Workshop Camera-Based Document Analysis and Recognition, pp. 71-78, Sept. 2007.
[23] http:/unpaper.berlios.de/, 2010.
[24] F. Shafait, J. van Beusekom, D. Keysers, and T.M. Breuel, "Document Cleanup Using Page Frame Detection," Int'l J. Document Analysis and Recognition, vol. 11, no. 2, pp. 81-96, 2008.
[25] W. Peerawit and A. Kawtrakul, "Marginal Noise Removal from Document Images Using Edge Density," Proc. Fourth Information and Computer Eng. Postgraduate Workshop, Jan. 2004.
[26] K.C. Fan, Y.K. Wang, and T.R. Lay, "Marginal Noise Removal of Document Images," Pattern Recognition, vol. 35, no. 11, pp. 2593-2611, 2002.
[27] T.M. Breuel, "The OCRopus Open Source OCR System," Proc. SPIE Document Recognition and Retrieval XV, pp. 0F1-0F15, Jan. 2008.
[28] F. Shafait, J. van Beusekom, D. Keysers, and T.M. Breuel, "Page Frame Detection for Marginal Noise Removal from Scanned Documents," Proc. Scandinavian Conf. Image Analysis, pp. 651-660, June 2007.
[29] S. Mao and T. Kanungo, "Software Architecture of PSET: A Page Segmentation Evaluation Toolkit," Int'l J. Document Analysis and Recognition, vol. 4, no. 3, pp. 205-217, 2002.
[30] L. Cinque, S. Levialdi, L. Lombardi, and S. Tanimoto, "Segmentation of Page Images Having Artifacts of Photocopying and Scanning," Pattern Recognition, vol. 35, no. 5, pp. 1167-1177, 2002.
[31] B.T. Avila and R.D. Lins, "Efficient Removal of Noisy Borders from Monochromatic Documents," Proc. Int'l Conf. Image Analysis and Recognition, pp. 249-256, Sept. 2004.
5 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool