Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on (2011)
Aug. 22, 2011 to Aug. 27, 2011
A semantic linguistic processor which extracts the objects and their links from natural language texts is considered. It is intended for the areas where the automatic formalization of the flows of texts in natural language is required. Peculiarities of the texts are taken into account by linguistic knowledge of the processor: the system can be tuned to various subject areas. We describe the use of this processor for text formalization in different subject areas, such as criminology (summary of incidents, accusatory conclusions, etc.), mass media (documents about terrorist activities), personnel management (autobiographies, resume). Special features of each problem area are examined: the collections of extracted objects, the means for their identification, their connections, occurring contractions, punctuation and special signs, specific character of language constructions, etc. -- all these special features were taken into account in the linguistic knowledge development.
semantics, natural language, linguistic processor, knowledge engineering, data extraction
E. B. Kozerenko, A. G. Matskevich and I. P. Kuznetsov, "Intelligent Extraction of Knowledge Structures from Natural Language Texts," 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies(WI-IAT), Lyon, 2011, pp. 269-272.