The Community for Technology Leaders
Internet and Web Applications and Services, International Conference on (2010)
Barcelona, Spain
May 9, 2010 to May 15, 2010
ISBN: 978-0-7695-4022-1
pp: 506-512
ABSTRACT
In this article, we present a new application that evaluated the performance of a number of the Arabic root extraction methods. The implemented methods in this system are selected according to a previous classification, where these methods are classified into five categories. We have selected a method for each category. These methods are: Light Stemmer, Arabic Stemming without a root dictionary, MT-based Arabic Stemmer, N-gram based on similarity coefficient and N-gram based on dissimilarity coefficient. This evaluation was conducted on the same terms in a corpus of two thousand words and their roots. These words are taken from Arabic dictionary "Lesan Al-Arab". This application has allowed us to have a first original comparison between the evaluated methods. This system works in two ways: normal and automatic.
INDEX TERMS
Dictionary, Information extraction, Evaluation, Arabic language, N-gram, Stemmer
CITATION

A. E. Al Hajjar, M. Hajjar and K. Zreik, "A System for Evaluation of Arabic Root Extraction Methods," Internet and Web Applications and Services, International Conference on(ICIW), Barcelona, Spain, 2010, pp. 506-512.
doi:10.1109/ICIW.2010.98
85 ms
(Ver 3.3 (11022016))