loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
21st International Conference on Data Engineering Workshops (ICDEW'05)
A Sentence Classification System for Multi Biomedical Literature Summarization
Tokyo, Japan
April 05-April 08
ISBN: 0-7695-2657-8
Yasunori Yamamoto, University of Tokyo, Tokyo, Japan
Toshihisa Takagi, Computational Biology,University of Tokyo, Tokyo, Japan

A PubMed search often returns a long list of queryrelated papers that a researcher cannot cope with in a short time. As a first step to address this issue by summarizing retrieved papers, we developed a system to classify sentences of abstracts obtained from the MEDLINE database into five rhetorical statuses: background, purpose, method, result, or conclusion.

We used Support Vector Machine (SVM) classifiers and trained each of them for a different rhetorical status on structured abstracts. A structured abstract is one that has labels indicating rhetorical statuses of the sentences, while an unstructured abstract does not. The classifiers were tested on both structured and unstructured abstracts. The former were randomly obtained from the MEDLINE database and the latter were manually labeled by humans. We compared our method with a previously reported one. In addition, we combined them and evaluated the combined method.

Our method outperformed the previously reported one, and the combined method showed even better results. Classified abstracts can be used for multi-document summarization that provides researchers with a way of learning a research topic efficiently and effectively.

Citation:
Yasunori Yamamoto, Toshihisa Takagi, "A Sentence Classification System for Multi Biomedical Literature Summarization," icdew, pp.1163, 21st International Conference on Data Engineering Workshops (ICDEW'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.