Aug. 30, 2004 to Sept. 3, 2004
Wilfried Njomgue Sado , Technology University of Compi?gne, France; Suez Environnement CIRSEE Information Technology Division
Dominique Fontaine , Technology University of Compi?gne, France
Philippe Fontaine , Suez Environnement CIRSEE Information Technology Division
This article presents and evaluates an innovating method of automatic indexing. It combines a linguistic analysis of the document to be indexed and a statistical analysis by the singular values decomposition of words in the document. The weighting of words combines advantages of their local and global context as well as their position compared to others terms _the co-occurrence. An application was developed in order to propose assignments topics of documents to a hierarchical referential. Finally, we will present experiments results and evaluation carried out on documents of Suez-Environment.
Wilfried Njomgue Sado, Dominique Fontaine, Philippe Fontaine, "A Linguistic and Statistical Approach for Extracting Knowledge from Documents", DEXA, 2004, 2012 23rd International Workshop on Database and Expert Systems Applications, 2012 23rd International Workshop on Database and Expert Systems Applications 2004, pp. 454-458, doi:10.1109/DEXA.2004.1333516