2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05) Rule Clustering and Super-rule Generation for Transmembrane Segments Prediction Stanford, California August 08-August 11 ISBN: 0-7695-2442-7
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CSBW.2005.121
The explanation of a decision is important for the acceptance of machine learning technology in bioinformatics applications such as protein structure prediction. In past research, we have already combined SVM with decision tree to extract rules for understanding transmembrane segments prediction. However, rules we have gotten are as many as about 20,000. This large number of rules makes them difficult for us to interpret their meaning. In this paper, a novel approach of rule clustering (SVM_DT_C) for superrule generation is presented. We use K-means clustering to cluster huge number of rules to generate many new super-rules. The experimental results show that the super-rules produced by SVM_DT_C can be analyzed manually by a researcher, and these superrules are not only new but also achieve very high transmembrane prediction accuracy (exceeding 95%) most of the times.
Citation:
Jieyue He, Yisheng Dong, Bernard Chen, Hae-Jin Hu, Robert Harrison, Phang C. Tai, Yisheng Dong, Yi Pan, "Rule Clustering and Super-rule Generation for Transmembrane Segments Prediction," csbw, pp.224-227, 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||