Issue No. 02 - February (1997 vol. 19)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/34.574802
<p><b>Abstract</b>—We describe an automated script identification system for typeset document images. Templates for each script are created by clustering textual symbols from a training set. Symbols from new images are compared to the templates to find the best script. Our current system processes thirteen scripts with minimal preprocessing and high accuracy.</p>
Script identification, document analysis, optical character recognition.
P. Kelly, J. Hochberg, T. Thomas and L. Kerns, "Automatic Script Identification From Document Images Using Cluster-Based Templates," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 19, no. , pp. 176-181, 1997.