Issue No. 12 - December (2010 vol. 32)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2010.30
Debashis Ghosh , Indian Institute of Technology Roorkee, Roorkee
Tulika Dube , Indian Institute of Management Ahmedabad, Ahmedabad
Adamane P. Shivaprasad , Sambhram Institute of Technology, Bangalore
A variety of different scripts are used in writing languages throughout the world. In a multiscript, multilingual environment, it is essential to know the script used in writing a document before an appropriate character recognition and document analysis algorithm can be chosen. In view of this, several methods for automatic script identification have been developed so far. They mainly belong to two broad categories—structure-based and visual-appearance-based techniques. This survey report gives an overview of the different script identification methodologies under each of these categories. Methods for script identification in online data and video-texts are also presented. It is noted that the research in this field is relatively thin and still more research is to be done, particularly in the case of handwritten documents.
Document analysis, optical character recognition, script identification, multiscript document.
T. Dube, A. P. Shivaprasad and D. Ghosh, "Script Recognition—A Review," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. , pp. 2142-2161, 2010.