loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Conference on Information Technology (ITNG'07)
Overfitting in protein name recognition on biomedical literature and method of preventing it through use of transductive SVM
Las Vegas, Nevada, USA
April 02-April 04
ISBN: 0-7695-2776-0
Masaki Murata, National Institute of Information and Communications Technology
Tomohiro Mitsumori, Nara Institute of Science and Technology
Kouichi Doi, Nara Institute of Science and Technology
Machine learning methods have recently been used in research on protein name recognition. A classifier trained in a specific domain, however, could be overfit and so inflexible that it could be used only in that domain. We therefore developed a new corpus about breast cancer and investigated the flexibility of classifier trained on the GENIA [14] corpus or the breast cancer corpus. To avoid overfitting we used the transductive support vector machine (SVM), and we evaluated the effect of transductive learning. We confirmed experimentally that the tranductive SVM prevented overfitting and yielded higher accuracies than the ordinary SVM did.
Citation:
Masaki Murata, Tomohiro Mitsumori, Kouichi Doi, "Overfitting in protein name recognition on biomedical literature and method of preventing it through use of transductive SVM," itng, pp.583-588, International Conference on Information Technology (ITNG'07), 2007
Usage of this product signifies your acceptance of the Terms of Use.