Rotation Invariant Texture Features and Their Use in Automatic Script Identification
July 1998 (vol. 20 no. 7)
pp. 751-756
DOI Bookmark:
http://doi.ieeecomputersociety.org/10.1109/34.689305
Abstract—This paper concerns the extraction of rotation invariant texture features and the use of such features in script identification from document images. Rotation invariant texture features are computed based on an extension of the popular multi-channel Gabor filtering technique, and their effectiveness is tested with 300 randomly rotated samples of 15 Brodatz textures. These features are then used in an attempt to solve a practical but hitherto mostly overlooked problem in document image processing—the identification of the script of a machine printed document. Automatic script and language recognition is an essential front-end process for the efficient and correct use of OCR and language translation products in a multilingual environment. Six languages (Chinese, English, Greek, Russian, Persian, and Malayalam) are chosen to demonstrate the potential of such a texture-based approach in script identification. [1] R.M. Haralick, "Statistical and Structural Approaches to Texture," Proc. IEEE, vol. 67 no. 5, pp. 786-804, 1979.[2] T.R. Reed and J.M. Hans Du Buf, "A Review of Recent Texture Segmentation and Feature Extraction Techniques," CVGIP: Image Understanding, vol.57, 1993, pp. 359-372.[3] T.N. Tan, "Geometric Transform Invariant Texture Analysis," Proc. SPIE, vol. 2,488, 1995, pp. 475-485.[4] V.K. Govindan and A.P. Shivaprasad, "Character Recognition—A Review," Pattern Recognition, vol. 23, pp. 671-683, 1990.[5] J. Mantas, "An Overview of Character Recognition Methods," Pattern Recognition, vol. 19, pp. 425-430, 1986.[6] O.D. Trier, A.K. Jain, and T. Taxt, "Feature Extraction Methods for Character Recognition—A Survey," Pattern Recognition, vol. 29, pp. 641-662, 1996.[7] T.N. Tan, "Noise Robust and Rotation Invariant Texture Classification," Proc. EUSIPCO-94, pp. 1,377-1,380, 1994.[8] T.N. Tan, "Written Language Recognition Based on Texture Analysis," Proc. IEEE ICIP96, vol. 3, pp. 185-188, 1996.[9] D.A. Pollen and S.F. Ronner, "Visual Cortical Neurons as Localized Spatial Frequency Filters," IEEE Trans. Systems, Man, and Cybernetics, vol. 13, pp. 907-916, 1983.[10] J.G. Daugman, "Uncertainty Relation for Resolution in Space, Spatial Frequency, and Orientation Optimized by 2D Visual Cortical Filters," J. Optical Soc. Am. A, vol. 2, pp. 1,160-1,169, 1985.[11] T.N. Tan, "Multi-Channel Filtering Approaches to Texture Analysis," Proc. CIE and Proc. IEEE Int'l. Conf. Signal Processing,Beijing, China, Oct. 1993.[12] T.N. Tan, "Texture Feature Extraction Via Cortical Channel Modelling," Proc. 11th IAPR Int'l Conf. Pattern Recognition, IEEE CS Press, C607-C610, 1992.[13] M. Leung and A.M. Peterson, Scale and Rotation Invariant Texture Classification Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, pp. 461-165, 1991.[14] G.M. Haley and B.S. Manjunath, "Rotation Invariant Texture Classification Using Modified Gabor Filters," Proc. IEEE ICIP95, pp. 262-265, 1995.[15] H. Greenspan, S. Belongie, R. Goodman, and P. Perona, Rotation Invariant Texture Recognition Using a Steerable Pyramid Proc. 12th IAPR Int'l Conf. Pattern Recognition, vol. 2, pp. 162-167, 1994.[16] P. Brodatz, Textures: A Photographic Album for Artists and Designer.New York, NY: Dover, 1966.[17] K. Fukunaga, Introduction to Statistical Pattern Recognition.New York, NY: Academic Press, 1972.[18] Y.K. Muthusamy, E. Barnard, and R.A. Cole, "Reviewing Automatic Language Identification," IEEE Signal Proc., Oct. 1994, pp. 33-41.[19] D.S. Lee, C.R. Nohl, and H.S. Baird, "Language Identification in Complex, Unoriented, and Degraded Document Images," Proc. IAPR Workshop on Document Analysis Syst., pp. 76-98, Oct. 1996.[20] A.L. Spitz, “Determination of the Script and Language Content of Document Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 235-245, Mar. 1997.[21] J. Hochberg, P. Kelly, T. Thomas, and L. Kerns, “Automatic Script Identification from Document Images Using Cluster-Based Templates,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 2, pp. 176-181, Feb. 1997.[22] S. Wood, X. Yao, K. Krishnamurthi, and L. Dang, "Language Identification for Printed Text Independent of Segmentation," Int'l Conf. Image Processing, pp. 428-431, Oct. 1995.[23] H.S. Baird, "The Skew Angle of Printed Documents," Document Image Analysis, L. O'Gorman and R. Kasturi, eds., IEEE CS Press, pp. 204-208, 1995.[24] A.K. Jain and Y. Zhong, "Page Segmentation Using Texture Analysis," Pattern Recognition, vol. 29, pp. 743-770, 1996.[25] A.K. Jain and B. Chandrasekaran, "Dimensionality and Sample Size Considerations in Pattern Recognition Practice," Handbook Statistics, P.R. Krishnaiah and L.N. Kanal, eds., vol. 2, pp. 835-855.Amsterdam: North-Holland, 1982.[26] S.R. Fountain and T.N. Tan, "Extraction of Noise Robust and Rotation Invariant Texture Features via Multichannel Filtering," Proc. IEEE Int'l Conf. Image Processing, vol. 3, pp. 197-200, Oct. 1997.[27] G.S. Peake and T.N. Tan, "Script and Language Identification from Document Images," Proc. Eighth British Mach. Vision Conf., vol. 2, pp. 230-233, Sept. 1997.[28] J. Ding, L. Lam, and C.Y. Suen, "Classification of Oriental and European Scripts by Using Characteristic Features," Proc. ICDAR'97, pp. 1,023-1,027.[29] L. O'Gorman and R. Kasturi, Document Image Analysis.Los Alamitos, Calif.: IEEE CS Press, 1995.[30] A.C. Bovik,M. Clark,, and W.S. Geisler,“Multichannel texture analysis using localized spatial filters,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, pp. 55-73, 1990.
Index Terms:
Texture, rotation invariance, Gabor filtering, script and language recognition, document image processing.
Citation:
T.n. Tan, "Rotation Invariant Texture Features and Their Use in Automatic Script Identification," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 7, pp. 751-756, July 1998, doi:10.1109/34.689305
Usage of this product signifies your acceptance of the
Terms of Use.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||