Seventh ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD'06) A Dynamic and Self-study Language Model Oriented to Chinese Characters Input Las Vegas, Nevada June 19-June 20 ISBN: 0-7695-2611-X
In this paper, a statistic language model is put forward to predict the next inputting word to improve the performance of the input method. So this paper constructs a general language model and a user language model, and then combines them into a new language model which was called as dynamic and self-study language model. Using the general language model in our experiment, the average length of input codes (ALIC) is reduced from 2.557 to 2.479 and the hit rate of first characters (HRFC) is also improved from 78.704% to 96.202%. Using the dynamic and self-study language model in our experiment, when the number of inputted Chinese characters is less then 20 thousand, the HRFC increases rapidly, while the ALIC reduces rapidly. And when the number is greater than 20 thousand, the HRFC and ALIC become steady. Thus it?s clear that dynamic and self-study language model performs well in input method. Otherwise, we provide a modified Church-Gale smoothing method to reduce the size of general language model. This method can reduce the size to 5 percent in order to fit the request of handheld device.
Citation:
Li Pei-feng, Gu Ping, Zhu Qiao-ming, "A Dynamic and Self-study Language Model Oriented to Chinese Characters Input," snpd-sawn, pp.311-318, Seventh ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD'06), 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||