The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - September/October (2008 vol.23)
pp: 34-41
Andras Csomai , Google
Rada Mihalcea , University of North Texas
ABSTRACT
Wikipedia can support the development of automatic methods for keyword extraction and word-sense disambiguation. The Wikify system combines these two methods to automatically enrich a text with links to Wikipedia content. The system identifies the important concepts in a given document and automatically links these concepts to the corresponding Wikipedia pages. An evaluation of the system using a Turing-like test shows that the automatic annotations are hardly distinguishable from manual annotations. A second evaluation in an educational environment shows that enriching educational materials with such annotations can improve the learning process by allowing faster access to background knowledge. This article is part of a special issue on Natural Language Processing and the Web.
INDEX TERMS
Word-sense disambiguation, keyword extraction, computers in education, text annotation
CITATION
Andras Csomai, Rada Mihalcea, "Linking Documents to Encyclopedic Knowledge", IEEE Intelligent Systems, vol.23, no. 5, pp. 34-41, September/October 2008, doi:10.1109/MIS.2008.86
REFERENCES
1. G. Salton and C. Buckley, "Term-Weighting Approaches in Automatic Text Retrieval," Information Processing &Management, vol. 24, no. 5, 1988, pp. 513–523.
2. P. Turney, "Learning Algorithms for Key-phrase Extraction," Information Retrieval, vol. 2, no. 4, 2000, pp. 303–336.
3. R. Mihalcea and P. Tarau, "Text Rank—Bringing Order into Texts," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP04), Assoc. for Computational Linguistics, 2004, pp. 404–411.
4. A. Hulth, "Improved Automatic Keyword Extraction Given More Linguistic Knowledge," Proc. 2003 Conf. Empirical Methods in Natural Language Processing, Assoc. for Computational Linguistics, 2003, pp. 216–233.
5. G. Miller, "WordNet: A Lexical Database," Comm. ACM, vol. 38, no. 11, 1995, pp. 39–41.
6. M.E. Lesk, "Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone," Proc. SIGDOCConf. 1986, ACM Press, 1986, pp. 24–26.
7. H.T. Ng and H.B. Lee, "Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Examplar-Based Approach," Proc. 34th Ann. Meeting Assoc. for Computational Linguistics (ACL 96), Assoc. for Computational Linguistics, 1996, pp. 40–47.
8. R. Mihalcea, "Using Wikipedia for Automatic Word Sense Disambiguation," Human Language Technologies 2007: Conf. North Am. Chapter of the Assoc. for Computational Linguistics, Assoc. for Computational Linguistics, 2007, pp. 196–203.
9. S. Pradhan et al., "Semeval-2007 Task 17: English Lexical Sample, SRL and All Words," Proc. 4th Int'l Workshop Semantic Evaluations (SemEval 07), Assoc. for Computational Linguistics, 2007.
10. R. Navigli and M. Lapata, "Graph Connectivity Measures for Unsupervised Word Sense Disambiguation," Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI07), AAAI Press, 2007, pp. 1683–1688.
11. W. Kintsch, Comprehension: A Paradigm for Cognition, Cambridge Univ. Press, 1998.
12. T. Murray, "Metalinks: Authoring and Affordances for Conceptual and Narrative Flow in Adaptive Hyperbooks," Int'l J. Artificial Intelligence in Education, vol. 13, no. 1, 2002, pp. 199–233.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool