|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
n-Gram Statistics for Natural Language Understanding and Text Processing
February 1979 (vol. 1 no. 2)
pp. 164-172
| ASCII Text | x | ||
| Ching Y. Suen, "n-Gram Statistics for Natural Language Understanding and Text Processing," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 1, no. 2, pp. 164-172, February, 1979. | |||
| BibTex | x | ||
| @article{ 10.1109/TPAMI.1979.4766902, author = {Ching Y. Suen}, title = {n-Gram Statistics for Natural Language Understanding and Text Processing}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {1}, number = {2}, issn = {0162-8828}, year = {1979}, pages = {164-172}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.1979.4766902}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Pattern Analysis and Machine Intelligence TI - n-Gram Statistics for Natural Language Understanding and Text Processing IS - 2 SN - 0162-8828 SP164 EP172 EPD - 164-172 A1 - Ching Y. Suen, PY - 1979 VL - 1 JA - IEEE Transactions on Pattern Analysis and Machine Intelligence ER - | |||
n-gram (n = 1 to 5) statistics and other properties of the English language were derived for applications in natural language understanding and text processing. They were computed from a well-known corpus composed of 1 million word samples. Similar properties were also derived from the most frequent 1000 words of three other corpuses. The positional distributions of n-grams obtained in the present study are discussed. Statistical studies on word length and trends of n-gram frequencies versus vocabulary are presented. In addition to a survey of n-gram statistics found in the literature, a collection of n-gram statistics obtained by other researchers is reviewed and compared.
Citation:
Ching Y. Suen, "n-Gram Statistics for Natural Language Understanding and Text Processing," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 1, no. 2, pp. 164-172, Feb. 1979, doi:10.1109/TPAMI.1979.4766902
Usage of this product signifies your acceptance of the Terms of Use.

