19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06)
Automatic Extraction of Bibliographic Information from Biomedical Online Journal Articles Using a String Matching Algorithm
Salt Lake City, Utah
June 22-June 23
ISBN: 0-7695-2517-1
A system has been developed to extract bibliographic data (grant numbers and databank accession numbers) from online biomedical journal articles for the National Library of Medicine?s MEDLINE.. database. Rule-based algorithms and a string matching algorithm are proposed to extract the bibliographic data from HTML-formatted articles. Experiments conducted with 411 medical articles from 73 journal issues show an accuracy exceeding 96%.
Citation:
Jongwoo Kim, Daniel X. Le, George R. Thoma, "Automatic Extraction of Bibliographic Information from Biomedical Online Journal Articles Using a String Matching Algorithm," cbms, pp.905-912, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06), 2006