Fourth International Conference Document Analysis and Recognition (ICDAR'97) Classification of Oriental and European Scripts by Using Characteristic Features Ulm, GERMANY August 18-August 20 ISBN: 0-8186-7898-4
Two types of techniques are usually adopted in language differentiation: token matching and statistical analysis. In this paper we present a method which uses a combined analysis of several discriminating statistical features, for the differentiation between European and oriental language scripts. When applied to more than 23 languages, it has proved to be effective in classifying documents printed in these different scripts.
Index Terms:
language differentiation, script identification, oriental script features, Roman script features, multiple features combination.
Citation:
J. Ding, L. Lam, Ching Y. Suen, "Classification of Oriental and European Scripts by Using Characteristic Features," icdar, pp.1023, Fourth International Conference Document Analysis and Recognition (ICDAR'97), 1997 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||