The Community for Technology Leaders
2016 17th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) (2016)
Shanghai, China
May 30, 2016 to June 1, 2016
ISBN: 978-1-5090-0804-9
pp: 451-456
Nabil Khoufi , ANLP Research Group, MIRACL Laboratory, FSEGS, University of Sfax, Tunisia
Chafik Aloulou , ANLP Research Group, MIRACL Laboratory, FSEGS, University of Sfax, Tunisia
Lamia Hadrich Belguith , ANLP Research Group, MIRACL Laboratory, FSEGS, University of Sfax, Tunisia
ABSTRACT
Parsing Arabic language is a difficult task given the specificities of the language and given the scarcity of linguistic resources. Linguistic resources such as grammars are very important to any natural language processing application. Unfortunately, the manual construction of these resources is laborious and time-consuming. The use of annotated corpora as a knowledge database might be a solution to a fast construction of a grammar for a given language. In this paper, we began by presenting an overview of our method to automatically induce a probabilistic context free grammar from an Arabic annotated corpus (The Penn Arabic TreeBank). Then we tested the obtained grammar in the parsing task and we expose the evaluation results. Finally we present our vision of a hybrid method for parsing Modern Standard Arabic (MSA) that we believe that it could enhance obtained results.
INDEX TERMS
Grammar, Pragmatics, Syntactics, Context, Standards, Natural language processing, Probabilistic logic
CITATION

N. Khoufi, C. Aloulou and L. H. Belguith, "Toward hybrid method for parsing Modern Standard Arabic," 2016 17th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Shanghai, China, 2016, pp. 451-456.
doi:10.1109/SNPD.2016.7515939
185 ms
(Ver 3.3 (11022016))