International Conference on Information Technology (ITNG'07) Overfitting in protein name recognition on biomedical literature and method of preventing it through use of transductive SVM Las Vegas, Nevada, USA April 02-April 04 ISBN: 0-7695-2776-0
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ITNG.2007.145
Machine learning methods have recently been used in research on protein name recognition. A classifier trained in a specific domain, however, could be overfit and so inflexible that it could be used only in that domain. We therefore developed a new corpus about breast cancer and investigated the flexibility of classifier trained on the GENIA [14] corpus or the breast cancer corpus. To avoid overfitting we used the transductive support vector machine (SVM), and we evaluated the effect of transductive learning. We confirmed experimentally that the tranductive SVM prevented overfitting and yielded higher accuracies than the ordinary SVM did.
Citation:
Masaki Murata, Tomohiro Mitsumori, Kouichi Doi, "Overfitting in protein name recognition on biomedical literature and method of preventing it through use of transductive SVM," itng, pp.583-588, International Conference on Information Technology (ITNG'07), 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||