This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Correlation Filter: An Accurate Approach to Detect and Locate Low Contrast Character Strings in Complex Table Environment
December 2004 (vol. 26 no. 12)
pp. 1639-1644
Correlation has been used extensively in object detection field. In this paper, two kinds of correlation filters, Minimum Average Correlation Energy (MACE) and Extended Maximum Average Correlation Height (EMACH), are applied as adaptive shift locators to detect and locate smudgy character strings in complex tabular color flight coupon images. These strings in irregular tabular coupon are computer-printed characters but of low contrast and could be shifted out of the table so that we cannot detect and locate them using traditional algorithms. In our experiment, strings are extracted in the preprocessing phase by removing background and then based on geometric information, two correlation filters are applied to locate expected fields. We compare results from two correlation filters and demonstrate that this algorithm is a high accurate approach.

[1] J. Mao, R. Lorie, and K. Mohiuddin, “A System for Automatically Reading IATA Flight Coupons,” Proc. Fourth Int'l Conf. Document Analysis and Recognition, pp. 153-157, 1997.
[2] Z. Li, “Detection of Text Regions from Digital Engineering Drawings,” IEEE Tran. Pattern Analysis and Machine Intelligence, vol. 20, no. 4, pp. 431-439, Apr. 1998.
[3] P. Tofani and R. Kasturi, “Segmentation of Text from Color Map Images,” Proc. 14th Int'l Conf. Pattern Recognition, pp. 945-947, 1998.
[4] C. Zhang, “A New Method of Color Image Segmentation Based on Intensity and Hue Clustering,” Proc. 15th Int'l Conf. Pattern Recognition, pp. 613-616, 1999.
[5] L.Y. Tseng and R.C. Chen, “Recognition and Data Extraction of Form Documents Based on 3 Types of Line Segments,” Pattern Recognition, vol. 31, no. 10, pp. 1525-1540, 1998.
[6] S.L. Taylor, R. Fritzson, and J.A. Pastor, “Extraction of Data from Preprinted Forms,” Machine Vision and Application, vol. 5, pp. 211-222, 1992.
[7] T. Watanabe, Q. Luo, and N. Sugie, “Layout Recognition of Multi-Kinds of Table-Form Documents,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 4, pp. 432-445, Apr. 1995.
[8] K.C. Fan, J.M. Lu, and G.D. Chen, “A Feature Point Clustering Approach to the Recognition of Form Documents,” Pattern Recognition, vol. 31, no. 9, pp. 1205-1220, 1998.
[9] R. Casey, D. Ferguson, K. Mohiuddin, and E. Walach, “Intelligent Forms Processing System,” Machine Vision and Applications, vol. 5, no. 5, pp. 143-155, 1991.
[10] Y.Y. Tang, H. Ma, J. Liu, B.F. Li, and D. Xi, “Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray-Level Background,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 8, pp. 921-926, Aug. 1997.
[11] F. Cesarini, E. Francesconi, M. Gori, and G. Soda, “Analysis and Understanding of Multi-Class Invoices,” Int'l J. Document Analysis Recognition, vol. 6, no. 2, pp. 102-114, 2003.
[12] J.L. Chen and H.J. Lee, “Field Data Extraction for Form Document Processing Using a Gravitation-Based Algorithm,” Pattern Recognition, vol. 34, no. 9, pp. 1741-1750, 2001.
[13] B. Yu and A.K. Jain, “A Generic System for Form Dropout,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 18, no. 11, pp. 1127-1134, Nov. 1996.
[14] Y. Li et al., “String Extraction in Complex Coupon Environment Using Statistical Approach,” Proc. Seventh Int'l Conf. Document Analysis and Recognition, pp. 289-294, 2003.
[15] Y. Li et al., “High Accurate Character Strings Extraction from Coupon Images Using Hybrid Neural Network,” Computer Eng. and Application, vol. 40, no. 8, pp. 209-212, 2004. (in Chinese)
[16] S. Haykin, Neural Networks-A Comprehensive Foundation, pp. 392-440, second ed. Prentice-Hall Inc., 1998.
[17] T. Kohonen, “The Self-Organizing Map,” Proc. IEEE, vol. 78, no. 9, pp. 1464-1480, Sept. 1990.
[18] D. Xi and S.-W. Lee, “Reference Line Extraction from Form Documents with Complicated Backgrounds,” Proc. Seventh Int'l Conf. Document Analysis and Recognition, pp. 1080-1084, Aug. 2003.
[19] C.F. Hester and D. Casasent, “Multivariant Technique for Multiclass Pattern Recognition,” Applied Optics, vol. 19, pp. 1758-1761, 1980.
[20] M. Sonka et al., Image Processing, Analysis and Machine Vision, pp. 191-194, second ed. Thomson Learning, 2000.
[21] R.C. Gonzalez et al., Digital Image Processing, pp. 703-704, second ed. Prentice Hall, 2002.
[22] A. Mahalanobis, B.V.K. Vijaya Kumar, and D. Casasent, “Minimum Average Correlation Energy Filters,” Applied Optics, vol. 26, pp. 3633-3630, 1987.
[23] R. Singh and B.V.K. Vijaya Kumar, “Performance of the Extended Maximum Average Correlation Height (EMACH) Filter and the Polynomial Distance Classifier Correlation Filter (PDCCF) for Multiclass SAR Detection and Classification,” Proc. SPIE, vol. 4727, pp. 265-276, 2002.
[24] M. Savvides, B.V.K. Vijaya Kumar, and P. Khosla, “Face Verification Using Correlation Filters,” Proc. Third IEEE Automatic Identification Advanced Technologies, pp. 56-61, 2002.

Index Terms:
Document analysis, graphics recognition, pattern analysis, correlation theory.
Citation:
Yi Li, Zhiyan Wang, Haizan Zeng, "Correlation Filter: An Accurate Approach to Detect and Locate Low Contrast Character Strings in Complex Table Environment," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 12, pp. 1639-1644, Dec. 2004, doi:10.1109/TPAMI.2004.117
Usage of this product signifies your acceptance of the Terms of Use.