|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2012 10th IAPR International Workshop on Document Analysis Systems
Improving Handwritten Chinese Text Recognition by Unsupervised Language Model Adaptation
Gold Coast, Queensland Australia
March 27-March 29
ISBN: 978-0-7695-4661-2
| ASCII Text | x | ||
| Qiu-Feng Wang, Fei Yin, Cheng-Lin Liu, "Improving Handwritten Chinese Text Recognition by Unsupervised Language Model Adaptation," Document Analysis Systems, IAPR International Workshop on, pp. 110-114, 2012 10th IAPR International Workshop on Document Analysis Systems, 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/DAS.2012.46, author = {Qiu-Feng Wang and Fei Yin and Cheng-Lin Liu}, title = {Improving Handwritten Chinese Text Recognition by Unsupervised Language Model Adaptation}, journal ={Document Analysis Systems, IAPR International Workshop on}, volume = {0}, year = {2012}, isbn = {978-0-7695-4661-2}, pages = {110-114}, doi = {http://doi.ieeecomputersociety.org/10.1109/DAS.2012.46}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Document Analysis Systems, IAPR International Workshop on TI - Improving Handwritten Chinese Text Recognition by Unsupervised Language Model Adaptation SN - 978-0-7695-4661-2 SP110 EP114 A1 - Qiu-Feng Wang, A1 - Fei Yin, A1 - Cheng-Lin Liu, PY - 2012 KW - Handwritten Chinese text recognition KW - Two-pass recognition KW - Language model adaptation KW - Language model compression VL - 0 JA - Document Analysis Systems, IAPR International Workshop on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DAS.2012.46
This paper investigates the effects of unsupervised language model adaptation (LMA) in handwritten Chinese text recognition. For no prior information of recognition text is available, we use a two-pass recognition strategy. In the first pass, the generic language model (LM) is used to get a preliminary result, which is used to choose the best matched LMs from a set of pre-defined domains, then the matched LMs are used in the second pass recognition. Each LM is compressed to a moderate size via the entropy-based pruning, tree-structure formatting and fewer-byte quantization. We evaluated the LMA for five LM types, including both character-level and word-level ones. Experiments on the CASIA-HWDB database show that language model adaptation improves the performance for each LM type in all domains. The documents of ancient domain gained the biggest improvement of character-level correct rate of 5.87 percent up and accurate rate of 6.05 percent up.
Index Terms:
Handwritten Chinese text recognition, Two-pass recognition, Language model adaptation, Language model compression
Citation:
Qiu-Feng Wang, Fei Yin, Cheng-Lin Liu, "Improving Handwritten Chinese Text Recognition by Unsupervised Language Model Adaptation," das, pp.110-114, 2012 10th IAPR International Workshop on Document Analysis Systems, 2012
Usage of this product signifies your acceptance of the Terms of Use.
