This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2005 IEEE Computational Systems Bioinformatics Conference (CSB'05)
Motif Extraction and Protein Classification
Stanford, California
August 08-August 11
ISBN: 0-7695-2344-7
Vered Kunik, Tel Aviv University
Zach Solan, Tel Aviv University
Shimon Edelman, Cornell University
Eytan Ruppin, Tel Aviv University
David Horn, Tel Aviv University
We present a novel unsupervised method for extracting meaningful motifs from biological sequence data. This de novo motif extraction (MEX) algorithm is data driven, finding motifs that are not necessarily over-represented in the data. Applying MEX to the oxidoreductases class of enzymes, containing approximately 7000 enzyme sequences, a relatively small set of motifs is obtained. This set spans a motif-space that is used for functional classification of the enzymes by an SVM classifier. The classification based on MEX motifs surpasses that of two other SVM based methods: SVMProt, a method based on the analysis of physical-chemical properties of a protein generated from its sequence of amino acids, and SVM applied to a Smith-Waterman distances matrix. Our findings demonstrate that the MEX algorithm extracts relevant motifs, supporting a successful sequence-to-function classification.
Index Terms:
motif extraction, enzyme classification
Citation:
Vered Kunik, Zach Solan, Shimon Edelman, Eytan Ruppin, David Horn, "Motif Extraction and Protein Classification," csb, pp.80-85, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.