This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Fast Algorithm for Bottom-Up Document Layout Analysis
March 1997 (vol. 19 no. 3)
pp. 273-277

Abstract—This paper describes a new bottom-up method for document layout analysis. The algorithm was implemented in the CLiDE (Chemical Literature Data Extraction) system (http://chem.leeds.ac.uk/ICAMS/CLiDE.html), but the method described here is suitable for a broader range of documents. It is based on Kruskal's algorithm and uses a special distance-metric between the components to construct the physical page structure. The method has all the major advantages of bottom-up systems: independence from different text spacing and independence from different block alignments. The algorithms computational complexity is reduced to linear by using heuristics and path-compression.

[1] P. Ibison, F. Kam, R.W. Simpson, C. Tonnelier, T. Venczel, and A.P. Johnson, "Chemical Structure Recognition and Generic Text Interpretation in the CLiDE Project," Proc. Online Information 92,London, England, 1992.
[2] F. Kam, R.W. Simpson, C. Tonnelier, T. Venczel, and A.P. Johnson, "Chemical Literature Data Extraction: Bond Crossing in Single and Multiple Structures," Proc. Int'l Chemical Information Conf.,Annecy, France, 1992.
[3] P. Ibison, M. Jacquot, F. Kam, A.G. Neville, R.W. Simpson, C. Tonnelier, T. Venczel, and A.P. Johnson, "Chemical Literature Data Extraction: The CLiDE Project," J. Chem. Inf. Comput. Sci., vol. 33, no. 3, pp. 338-344, 1993.
[4] Y.Y. Tang, C.Y. Suen, and C.D. Yan, "Document Processing for Automatic Knowledge Acquisition," IEEE Trans. on Knowledge and Data Engineering, vol. 6, no. 1, pp. 3-21, 1994.
[5] M. Krishnamoorthy, G. Nagy, S. Seth, and M. Viswanathan, “Syntactic Segmentation and Labeling of Digitized Pages from Technical Journals,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 7, pp. 737-747, July 1993.
[6] T. Pavlidis and J. Zhou, "Page Segmentation and Classification," CVGIP: Graphical Models and Image Processing, vol. 54, no. 6, pp. 484-496, 1992.
[7] H. Baird and D. Ittner, "Language-Free Layout Analysis," Proc. Second Int'l Conf. Document Analysis and Recognition,Tsukuba, Japan, pp. 336-340, Oct. 1993.
[8] L. O'Gorman, “The Document Spectrum for Page Layout Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1,162-1,173, Nov. 1993.
[9] S. Tsujimoto and H. Asada, "Major Components of a Complete Text Reading System," Proceedings IEEE, vol. 80, no. 7, pp. 1,133-1,149, July 1992.
[10] T. Saitoh, T. Yamaai, and M. Tachikawa, "Document Image Segmentation and Layout Analysis," IEICE Trans. Information and Systems, vol. 77, no. 7, pp. 778-784, 1994.
[11] K.Y. Wong, R.G. Casey, and F.M. Wahl, "Document Analysis System," IBJ J. Research and Development, vol. 26, no. 6, pp. 647-656, 1982.
[12] K.-C. Fan, C.-H. Liu, and Y.-K. Wang, "Segmentation and Classification of Mixed Text/Graphics/image Documents," Pattern Recognition Letters, vol. 15, no. 12, pp. 1,201-1,209, 1994.
[13] S.L. Taylor, D.A. Dahl, M. Lipshutz, C. Weir, L.M. Norton, R.W. Nilson, and M.C. Linebarger, "Integrating Natural Language Understanding with Document Structure Analysis," Artificial Intelligence Rev., vol. 8, no. 2, pp. 255-276, 1994.
[14] A.V. Aho, J.E. Hopcroft, and J.D. Ullman, Data Structures and Algorithms.Reading, Mass.: Addison-Wesley Publishing Company, 1983.

Index Terms:
Document analysis, physical page layout, bottom-up layout analysis, Kruskal's algorithm, spanning tree, chemical documents.
Citation:
Anikó Simon, Jean-Christophe Pret, A. Peter Johnson, "A Fast Algorithm for Bottom-Up Document Layout Analysis," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 273-277, March 1997, doi:10.1109/34.584106
Usage of this product signifies your acceptance of the Terms of Use.