2007 IEEE Congress on Services (Services 2007)
Infrastructure for Annotation-Driven Information Extraction from the Primary Scientific Literature: Principles and Practice
Salt Lake City, Utah, USA
July 09-July 13
ISBN: 0-7695-2926-7
We present an informatics infrastructure for biocuration, based on a combination of techniques from Information Extraction (IE) and Knowledge Engineering (KE). We describe the high-level design of this infrastructure which we base on the concept of ?experimental type?. Here, we treat each experiment as a specific type of knowledge statement determined by the experiment?s design. We provide a preliminary, detailed example of the use of the infrastructure to support the construction of a database pertaining to neuroanatomical tract-tracing experiments. This work generalizes to provide support for other experimental types and could be used to make biocuration efforts more efficient. We also discuss how the process of annotating text for IE directly supports designing schema for databases. We envisage how this architecture could support small-scale, laboratory-centric knowledge bases that each support service-oriented functionality.
Citation:
Gully Burns, Donghui Feng, Tommy Ingulfsen, Eduard Hovy, "Infrastructure for Annotation-Driven Information Extraction from the Primary Scientific Literature: Principles and Practice," services, pp.122-129, 2007 IEEE Congress on Services (Services 2007), 2007