CSDL Home W WI-IAT 2011 Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on
Aug. 22, 2011 to Aug. 27, 2011
Knowledge-based data mining and classification algorithms require of systems that are able to extract textual attributes contained in raw text documents, and map them to structured knowledge sources (e.g. ontologies) so that they can be semantically analyzed. The system presented in this paper performs this tasks in an automatic way, relying on a predefined ontology which states the concepts in this the posterior data analysis will be focused. As features, our system focuses on extracting relevant Named Entities from textual resources describing a particular entity. Those are evaluated by means of linguistic and Web-based co-occurrence analyses to map them to ontological concepts, thereby discovering relevant features of the object. The system has been preliminary tested with tourist destinations and Wikipedia textual resources, showing promising results.
Ontologies, Information Extraction, Linguistic Patterns, Web-based statistics
Carlos Vicient, David S´nchez, Antonio Moreno, "Ontology-Based Feature Extraction", WI-IAT, 2011, Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on, Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on 2011, pp. 189-192, doi:10.1109/WI-IAT.2011.199