This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals
July 1993 (vol. 15 no. 7)
pp. 737-747

A method for extracting alternating horizontal and vertical projection profiles are from nested sub-blocks of scanned page images of technical documents is discussed. The thresholded profile strings are parsed using the compiler utilities Lex and Yacc. The significant document components are demarcated and identified by the recursive application of block grammars. Backtracking for error recovery and branch and bound for maximum-area labeling are implemented with Unix Shell programs. Results of the segmentation and labeling process are stored in a labeled x-y tree. It is shown that families of technical documents that share the same layout conventions can be readily analyzed. Results from experiments in which more than 20 types of document entities were identified in sample pages from two journals are presented.

[1] R. Ascher, G. Koppelman, M. Miller, G. Nagy, and G. Shelton Jr, "An interactive system for reading unformatted printed text,"IEEE Trans. Comput., vol. C-20, no. 12, pp. 1527-1543, Dec. 1971.
[2] E. Backer and E. S. Gelsema, Eds., inProc. Int. Conf. Patt. Recogn.(The Hague), Sept. 1992.
[3] H. S. Baird, "The skew angle of printed documents," inAdvance Printing Symp. Summaries, SPSE's 40th Ann. Conf. Symp. Hybrid Imaging Syst., May 1987, pp. 21-24.
[4] H. S. Baird, H. Bunke, and K. Yamamoto, Eds.,Structured Image Analysis. New York: Springer Verlag, 1992.
[5] D. P. D'Amato, W. E. Blanz, B. E. Dom, and S. N. Srihari, Eds., inProc. SPIE (Int Soc. Opt. Engr.)--Machine vision Applications Character Recogn. Ind. Inspection(San Jose, CA), Feb. 10-12, 1992, vol. 1661.
[6] A. Dengel and G. Barth, "ANASTASIL: A hybrid knowledge-based system for document layout analysis," inProc. 11th Int. J. Conf. Artificial Intell.(Detroit), Aug. 1989, pp. 1249-1254.
[7] F. Jenkins and J. Kanai, "A keyword-indexed bibliography of character recognition and document analysis," Inform. Sci. Res. Inst., Univ. Nevada, Las Vegas, Mar. 4, 1992.
[8] S. C. Johnson, "Yacc: Yet another compiler-compiler," Comp. Sci. Tech. Rep. 32, Bell Laboratories, Murray Hill, NJ, 1975.
[9] J. Kanai, "Text line extraction using character prototypes," inPreproc. IAPR Workshop Structural Syntactic Patt. Recogn.(Murray Hill, NJ), June 1990, pp. 182-191.
[10] R. Kasturi and L. O'Gorman, Eds.,Machine Vision Applications, vol. 5, no. 3 (special issue on Document Image Analysis Techniques), Summer 1992.
[11] R. Kasturi and L. O'Gorman, Eds.,Machine Vision Applications, vol. 5, no. 3 (special issue on Document Image Analysis Techniques), Summer 1992.
[12] D. Klarner and S. Magliveras, "The number of tilings of a block with blocks,"Euro. J. Combinatorics, vol. 9, pp. 317-330, 1988.
[13] M. E. Lesk, "Lex--A lexical analyzer generator," Comp. Sci. Tech. Rep. 39, Bell Lab., Murray Hill, NJ, 1975.
[14] S. Liebowitz, M. Lipshutz, and C. Weir, "Document structure interpretation by integrating multiple knowledge sources," inProc. Symp. Document Anal. Inform. Retrieval, Mar. 1992, pp. 58-76.
[15] G. Lorette (Ed.), inProc. First Int. Conf. Document Anal. Recogn.(Rennes, France), Oct. 1991.
[16] G. Nagy and S. Seth, "Hierarchical representation of optically scanned documents," inProc. 7th Int. Conf. Patt. Recogn.(Montreal, Canada), 1984, pp. 347-349.
[17] G. Nagy, S. Seth, and S. D. Stoddard, "Document analysis with an expert system, " inPattern Recognition in Practice II(E. S. Gelsema and L. Kanal, Eds.). New York: Elsevier Science, 1986.
[18] G. Nagy, J. Kanai, and M. Krishnamoorthy, "Two complementary techniques for digitized document analysis," inProc. ACM Conf. on Document Processing Systems, 1988, pp. 169-176.
[19] G. Nagy, "Document analysis and optical character recognition," inProgress in Image Analysis and Processing(V. Cantoni, L. P. Cordella, S. Levialdi, and G. Sanniti di Baja, Eds.). Singapore: World Scientific, 1990, pp. 511-529.
[20] G. Nagy and M. Viswanathan, "Dual representation of segmented technical documents," inProc. First Int. Conf. Document Anal. Recogn.(Rennes, France), Oct. 1991, pp. 141-151.
[21] G. Nagy, "Towards a structured-document-image utility," inStructured Image Analysis(H. S. Baird, H. Bunke, and K. Yamamoto, Eds.). New York: Springer Verlag, 1992, pp. 54-69.
[22] G. Nagy, S. Seth, and M. Viswanathan, "A prototype image analysis system for technical journals,"Computer, Vol. 25, no. 7 (special issue on Document Image Analysis Systems), pp. 10-22, July 1992.
[23] T. A. Nartker (Editor), inProc. Symp. Document Anal. Inform. Retrieval(Inform. Sci. Res. Inst., Univ. Nevada, Las Vegas), Mar. 1992.
[24] L. O'Gorman and R. Kasturi (Eds.),Computer, vol. 25, no. 7 (special issue on Document Image Analysis Systems), July 1992.
[25] T. Pavlidis and S. Mori (Eds.),Proc. IEEE, vol. 80, no. 7 (special issue on Optical Character Recognition), July 1992.
[26] S. V. Rice, J. Kanai, and T. A. Nartker, "A report on the accuracy of OCR Devices," Inform. Sci. Res. Inst., Univ. Nevada, Las Vegas, Mar. 4, 1992.
[27] E. Tanaka, "Theoretical aspects of syntactic pattern recognition," in Memo. Graduate Sch. Sci. Technol. (Kobe Univ.), 1992, pp. 111-126, vol. 10-A.
[28] M. Viswanathan and M. S. Krishnamoorthy, "A syntactic approach to document segmentation," inStructural Pattern Analysis(R. Mohr, T. Pavlidis, and A. Sanfeliu, Eds). Singapore: World Scientific, 1989, pp. 197-215.
[29] M. Viswanathan, "A syntactic approach to document segmentation and labeling," Ph.D. thesis, Dept. Elect., Comput. Syst. Eng., Rensselaer Polytechnic Inst., Dec. 1990.
[30] M. Viswanathan, "Analysis of scanned documents--A syntactic approach," inStructured Image Analysis(H. S. Baird, H. Bunke, and K. Yamamoto, Eds.). New York: Springer Verlag, 1992, pp. 115-136.
[31] M. Viswanathan and G. Nagy, "Characteristics of digitized images of technical articles," inProc. Int. Soc. Opt. Eng. (SPIE)--Machine Vision Applications Character Recogn. Ind. Applications, 1992, pp. 6-17, vol. 1661.
[32] J. Yu, "Document analysis using x-y tree and rule-based system," Master's thesis, Dept. Elect. Comput. Syst. Eng., Rensselaer Polytechnic Inst., Troy, NY, Dec. 1986.

Index Terms:
syntactic segmentation; image recognition; document image processing; horizontal projection profiles; digitized pages; vertical projection profiles; scanned page images; technical documents; thresholded profile strings; compiler utilities; Lex; Yacc; block grammars; error recovery; branch and bound; Unix Shell; labeling; labeled x-y tree; document image processing; feature extraction; grammars; image recognition; image segmentation
Citation:
M. Krishnamoorthy, G. Nagy, S. Seth, M. Viswanathan, "Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 7, pp. 737-747, July 1993, doi:10.1109/34.221173
Usage of this product signifies your acceptance of the Terms of Use.