The Community for Technology Leaders
Green Image
ABSTRACT
<p><b>Abstract</b>—We describe an automated script identification system for typeset document images. Templates for each script are created by clustering textual symbols from a training set. Symbols from new images are compared to the templates to find the best script. Our current system processes thirteen scripts with minimal preprocessing and high accuracy.</p>
INDEX TERMS
Script identification, document analysis, optical character recognition.
CITATION

P. Kelly, J. Hochberg, T. Thomas and L. Kerns, "Automatic Script Identification From Document Images Using Cluster-Based Templates," in IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 19, no. , pp. 176-181, 1997.
doi:10.1109/34.574802
92 ms
(Ver 3.3 (11022016))