This Article 
 Bibliographic References 
 Add to: 
TextFinder: An Automatic System to Detect and Recognize Text In Images
November 1999 (vol. 21 no. 11)
pp. 1224-1229

Abstract—A robust system is proposed to automatically detect and extract text in images from different sources, including video, newspapers, advertisements, stock certificates, photographs, and checks. Text is first detected using multiscale texture segmentation and spatial cohesion constraints, then cleaned up and extracted using a histogram-based binarization algorithm. An automatic performance evaluation scheme is also proposed.

[1] M. Bokser, “Omnidocument Technologies,” Proc. IEEE, vol. 80, no. 7, pp. 1,066-1,078, July 1992.
[2] K. Etemad, D. Doerman, and R. Chellappa, “Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 1, pp. 92-96, Jan. 1997.
[3] L.A. Fletcher and R. Kasturi, “A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 10, pp. 910-918, Nov. 1988.
[4] A.K. Jain and B. Yu, “Automatic Text Location in Images and Video Frames,” Pattern Recognition, vol. 31, no. 12, pp. 2,055-2076, 1998.
[5] D.L.J. Zhou and T. Tasdizen, “Extracting Text from WWW Images,” Proc. SPIE'98 Document Recognition V, pp. 130-138, Jan. 1998.
[6] M. Kamel and A. Zhao, “Extraction of Binary Character/Graphics Images from Grayscale Document Images,” Computer Vision, Graphics, and Imaging Processing, vol. 55, no. 3, pp. 203-217, May 1993.
[7] J. Malik and P. Perona, “Preattentive Texture Discrimination with Early Vision Mechanisms,” J. Opt. Soc. Am., vol. 7, no. 5, pp. 923-932, May 1990.
[8] S. Mori, C.Y. Suen, and K. Yamamoto, “Historical Review of OCR Research and Development,” Proc. IEEE, vol. 80, no. 7, pp. 1,029-1,058, 1992.
[9] G. Nagy, S. Seth, and M. Viswanathan, “A Prototype Document Image Analysis System for Technical Journals,” Computer, vol. 25, no. 7, pp. 10-22, July 1992.
[10] R. Nevatia, “A Color Edge Detector and Its Use in Scene Segmentation” IEEE Trans. System, Man, and Cybernetics, vol. 7, no. 11, pp. 820-826, Nov. 1977.
[11] P.W. Palumbo, S.N. Srihari, J. Soh, R. Sridhar, and V. Demjanenko, “Postal Address Block Location in Real Time,” Computer, pp. 34-42, July 1992.
[12] M.A. Smith and T. Kanade, "Video Skimming and Characterization Through the Combination of Image and Language Understanding Techniques," Computer Vision and Pattern Recognition, pp. 775-781, 1997.
[13] F.M. Wahl, K.Y. Wong, and R.G. Casey, “Block Segmentation and Text Extraction in Mixed Text/Image Documents,” Computer Graphics and Image Processing, vol. 20, pp. 375-390, 1982.
[14] D. Wang and S.N. Srihari, “Classification of Newspaper Image Blocks Using Texture Analysis,” Computer Vision, Graphics, and Image Processing, vol. 47, pp. 327-352, 1989.
[15] K.Y. Wong, R.G. Casey, and F.M. Wahl, “Document Analysis System,” IBM Journal Res. Dev., vol. 26, no. 6, pp. 647-656, 1982.
[16] V. Wu and R. Manmatha, “Document Image Clean-Up and Binarization” Proc. SPIE'98 Document Recognition V, pp. 263-273, Jan. 1998.
[17] V. Wu, R. Manmatha, and E.M. Riseman, “Finding Text In Images,” Proc. the Second Int'l Conf. Digital Libraries, pp. 1-10, Philadaphia, PA, July 1997.
[18] V. Wu, R. Manmatha, and E.M. Riseman, “TextFinder: An Automatic System to Detect and Recognize Text in Images,” Technicial Report 99-40, Computer Science Dept., Univ. of Massachusetts, Amherst, 1999.
[19] Y. Zhong, K. Karu, and A.K. Jain, “Locating Text in Complex Color Images” Pattern Recognition, vol. 28, no. 10, pp. 1,523-1,536, Oct. 1995.

Index Terms:
Text reading, character recognition, multimedia indexing, text detection, texture segmentation, filters, hierarchical processing, binarization, connected-components.
Victor Wu, Raghavan Manmatha, Edward M. Riseman, "TextFinder: An Automatic System to Detect and Recognize Text In Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 21, no. 11, pp. 1224-1229, Nov. 1999, doi:10.1109/34.809116
Usage of this product signifies your acceptance of the Terms of Use.