Eighth International Conference on Document Analysis and Recognition (ICDAR'05) Identifying Script onWord-Level with Informational Confidenc Seoul, Korea August 31-September 01 ISBN: 0-7695-2420-6
In this paper, we present a multiple classifier system for script identification. Applying a Gabor filter analysis of textures on word-level, our system identifies Latin and non-Latin words in bilingual printed documents. The classifier system comprises four different architectures based on nearest neighbors, weighted Euclidean distances, Gaussian mixture models, and support vector machines.We report results for Arabic, Chinese, Hindi, and Korean script. Moreover, we show that combining informational confidence values using sum-rule can consistently outperform the best single recognition rate.
Citation:
Stefan Jaeger, Huanfeng Ma, David Doermann, "Identifying Script onWord-Level with Informational Confidenc," icdar, pp.416-420, Eighth International Conference on Document Analysis and Recognition (ICDAR'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||