|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2009 International Conference on Asian Language Processing
Automatic Acquisition of Large-Scale Academic Bilingual Parallel Corpus from the Web
Singapore
December 07-December 09
ISBN: 978-0-7695-3904-1
| ASCII Text | x | ||
| Han Yong, Li Yu, He Xiaoning, Yang Muyun, Lei Guohua, "Automatic Acquisition of Large-Scale Academic Bilingual Parallel Corpus from the Web," Asian Language Processing, International Conference on, pp. 318-321, 2009 International Conference on Asian Language Processing, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/IALP.2009.75, author = {Han Yong and Li Yu and He Xiaoning and Yang Muyun and Lei Guohua}, title = {Automatic Acquisition of Large-Scale Academic Bilingual Parallel Corpus from the Web}, journal ={Asian Language Processing, International Conference on}, volume = {0}, year = {2009}, isbn = {978-0-7695-3904-1}, pages = {318-321}, doi = {http://doi.ieeecomputersociety.org/10.1109/IALP.2009.75}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Asian Language Processing, International Conference on TI - Automatic Acquisition of Large-Scale Academic Bilingual Parallel Corpus from the Web SN - 978-0-7695-3904-1 SP318 EP321 A1 - Han Yong, A1 - Li Yu, A1 - He Xiaoning, A1 - Yang Muyun, A1 - Lei Guohua, PY - 2009 KW - data mining KW - bilingual parallel corpora acquision KW - bilingual term acquision VL - 0 JA - Asian Language Processing, International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/IALP.2009.75
In this paper, we describe a system which automatically acquires large-scale Chinese-English bilingual parallel corpus from China Journals Full-text Database (CJFD), a component of China National Knowledge Infrastructure (CNKI). The system gets large amount of parallel texts with domain information from the existing structured bilingual texts in CJFD, such as Chinese and English abstracts and titles of academic articles. The acquired Chinese-English parallel corpus is by several orders of magnitudes larger than similar corpus we have known before. In addition, this system collects a large amount of bilingual terms which can directly apply to lexical acquisition.
Index Terms:
data mining, bilingual parallel corpora acquision, bilingual term acquision
Citation:
Han Yong, Li Yu, He Xiaoning, Yang Muyun, Lei Guohua, "Automatic Acquisition of Large-Scale Academic Bilingual Parallel Corpus from the Web," ialp, pp.318-321, 2009 International Conference on Asian Language Processing, 2009
Usage of this product signifies your acceptance of the Terms of Use.
