|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary
PrePrint
ISSN: 1545-5963
| ASCII Text | x | ||
| Said Bleik, Meenakshi Mishra, Jun Huan, Min Song, "Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 99, no. 1, pp. 1, , 5555. | |||
| BibTex | x | ||
| @article{ 10.1109/TCBB.2013.16, author = {Said Bleik and Meenakshi Mishra and Jun Huan and Min Song}, title = {Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {99}, number = {1}, issn = {1545-5963}, year = {5555}, pages = {1}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.16}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics TI - Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary IS - 1 SN - 1545-5963 SP EP EPD - 1 A1 - Said Bleik, A1 - Meenakshi Mishra, A1 - Jun Huan, A1 - Min Song, PY - 5555 KW - Classifier design and evaluation KW - Information Technology and Systems KW - Database Management KW - Database Applications KW - Mining methods and algorithms KW - Modeling structured KW - textual and multimedia data KW - Information Technology and Systems KW - Text mining KW - Computing Methodologies KW - Pattern Recognition KW - Design Methodology VL - 99 JA - IEEE/ACM Transactions on Computational Biology and Bioinformatics ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.16
Recently, graph representations of text have been showing improved performance over conventional bag-of-words representations in text categorization applications. In this paper we present a graph-based representation for biomedical articles and use graph kernels to classify those articles into high level categories. In our representation, common biomedical concepts and semantic relationships are identified with the help of an existing ontology and are used to build a rich graph structure that provides a consistent feature set and preserves additional semantic information that could improve a classifier's performance. We attempt to classify the graphs using both a set-based graph kernel that is capable of dealing with the disconnected nature of the graphs and a simple linear kernel. Finally, we report the results comparing the classification performance of the kernel classifiers to common text-based classifiers.
Index Terms:
Classifier design and evaluation,Information Technology and Systems,Database Management,Database Applications,Mining methods and algorithms,Modeling structured,textual and multimedia data,Information Technology and Systems,Text mining,Computing Methodologies,Pattern Recognition,Design Methodology
Citation:
Said Bleik, Meenakshi Mishra, Jun Huan, Min Song, "Text Categorization of Biomedical Data Sets Using Graph Kernels and a Controlled Vocabulary," IEEE/ACM Transactions on Computational Biology and Bioinformatics, 08 March 2013. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TCBB.2013.16>
Usage of this product signifies your acceptance of the Terms of Use.

