loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
IEEE Computer Society Bioinformatics Conference (CSB'03)
Towards Index-based Similarity Search for Protein Structure Databases
Stanford, California
August 11-August 14
ISBN: 0-7695-2000-6
Orhan ?amoglu, University of California, Santa Barbara
Tamer Kahveci, University of California, Santa Barbara
Ambuj K. Singh, University of California, Santa Barbara
We propose two methods for finding similarities in protein structure databases. Our techniques extract feature vectors on triplets of SSEs (Secondary Structure Elements) of proteins. These feature vectors are then indexed using a multidimensional index structure. Our first technique considers the problem of finding proteins similar to a given query protein in a protein dataset. This technique quickly finds promising proteins using the index structure. These proteins are then aligned to the query protein using a popular pairwise alignment tool such as VAST. We also develop a novel statistical model to estimate the goodness of a match using the SSEs. Our second technique considers the problem of joining two protein datasets to find an all-to-all similarity. Experimental results show that our techniques improve the pruning time of VAST 3 to 3.5 times while keeping the sensitivity similar.
Index Terms:
Protein structures, feature vectors, indexing, dataset join
Citation:
Orhan ?amoglu, Tamer Kahveci, Ambuj K. Singh, "Towards Index-based Similarity Search for Protein Structure Databases," csb, pp.148, IEEE Computer Society Bioinformatics Conference (CSB'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.