loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 9
Big Island, Hawaii
January 03-January 06
ISBN: 0-7695-2268-8
Jingyi Yang, University of Nebraska - Lincoln
Jitender S. Deogun, University of Nebraska - Lincoln
Zhaohui Sun, University of Nebraska - Lincoln
Protein sequence motifs are short conserved subsequences common to related protein sequences. The extraction of sequence motifs in proteins can help classify proteins families and predict protein functions, also provide valuable information about the evolution of species. However, the automatic protein sequence motif extraction is not straightforward because sequence motifs are often inexact and containing gaps. In this paper, we review currently available algorithms for protein sequence motif extraction, and propose a novel scheme to extract protein sequence motifs that allow mismatches and gaps from unaligned protein sequences. This scheme is based on a probabilistic model-Mismatch-allowed Probabilistic Suffix Tree (M-PST). In this scheme, an M-PST is first constructed from the unaligned protein sequences. The subsequences with highest likelihood scores, which are over-represented patterns, are further discovered with the M-PST. These subsequences are probable sequence motifs and outputted along with the position probability matrices.
Citation:
Jingyi Yang, Jitender S. Deogun, Zhaohui Sun, "A New Scheme for Protein Sequence Motif Extraction," hicss, vol. 9, pp.280a, Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 9, 2005
Usage of this product signifies your acceptance of the Terms of Use.