First International Conference on Innovative Computing, Information and Control - Volume III (ICICIC'06)
Script Identification of Document Image Analysis
Beijing, China
August 30-September 01
ISBN: 0-7695-2616-0
Juan Cheng, Zhengzhou Information Science and Technology Institute, China
Xijian Ping, Zhengzhou Information Science and Technology Institute, China
Guanwei Zhou, Zhengzhou Information Science and Technology Institute, China
Yang Yang, Zhengzhou Information Science and Technology Institute, China
Script identification prior to OCR is necessary in document image analysis. And each script has unique spatial distribution and visual attribute that make it possible to identify itself from other languages. The key technology of script identification algorithm is to abstract effective measure feature. By analyzing vision differences based on normalized histogram statistic, Chinese, Japanese, English and Russian are identified respectively from others. Therefore, automatic identification of four scripts is realized successfully.
Citation:
Juan Cheng, Xijian Ping, Guanwei Zhou, Yang Yang, "Script Identification of Document Image Analysis," icicic, vol. 3, pp.178-181, First International Conference on Innovative Computing, Information and Control - Volume III (ICICIC'06), 2006