The Community for Technology Leaders
2015 International Conference on Big Data and Smart Computing (BigComp) (2015)
Jeju, South Korea
Feb. 9, 2015 to Feb. 11, 2015
ISBN: 978-1-4799-7303-3
pp: 251-253
Karin M. Verspoor , Department of Computing and Information Systems, The University of Melbourne, Melbourne, VIC 3010 Australia
The biomedical literature captures the most current biomedical knowledge and is a tremendously rich resource for research. With over 24 million publications currently indexed in the US National Library of Medicine's PubMed index, however, it is becoming increasingly challenging for biomedical researchers to keep up with this literature. Automated strategies for extracting information from it are required. Large-scale processing of the literature enables direct biomedical knowledge discovery. This paper introduces the use of text mining techniques to support analysis of biological data sets, specifically discussing applications in protein function prediction and analysis of genetic variants that are supported by analysis of the literature. Review of the work suggests that methods that integrate simple text analysis with more targeted relation extraction, and methods that combine literature-derived information with complementary biological data, represent the most promising future directions.
Proteins, Protein engineering, Text mining, Diseases, Bioinformatics, Feature extraction

K. M. Verspoor, "Drawing on millions of biomedical journal publications to do predictive biology," 2015 International Conference on Big Data and Smart Computing (BigComp)(BIGCOMP), Jeju, South Korea, 2015, pp. 251-253.
89 ms
(Ver 3.3 (11022016))