Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on (2010)
Toronto, Ontario Canada
Aug. 31, 2010 to Sept. 3, 2010
Biomedical entity extraction from unstructured web documents is an important task that needs to be performed in order to discover knowledge in the veterinary medicine domain. In general, this task can be approached by applying domain specific ontologies, but a review of the literature shows that there is no universal dictionary, or ontology for this domain. To address this issue, we manually construct an ontology for extracting entities such as: animal disease names, viruses and serotypes. We then use an automated ontology expansion approach to extract semantic relationships between concepts. Such relationships include asserted synonymy, hyponymy and causality. Specifically, these relationships are extracted by using a set of syntactic patterns and part-of-speech tagging. The resulting ontology contains richer semantics compared to the manually constructed ontology. We compare our approach for extracting synonyms, hyponyms and other disease related concepts, with an approach where the ontology is expanded using GoogleSets, on the veterinary medicine entity extraction task. Experimental results show that our semantic relationship extraction approach produces a significant increase in precision and recall as compared to the GoogleSets approach.
entity extraction, semantic relation discovery, text mining
John Drouhard, Doina Caragea, Landon Fowles, William H. Hsu, Svitlana Volkova, "Boosting Biomedical Entity Extraction by Using Syntactic Patterns for Semantic Relation Discovery", Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on, vol. 01, no. , pp. 272-278, 2010, doi:10.1109/WI-IAT.2010.152