A method for finding protein folds consistent with secondary structure assignments and imposed experimental restraints is described. All possible matches between the query pattern and every member of a database of protein structural domains are generated by a comparison of secondary structure assignments. The comparison allows for errors in predicted secondary structure elements and possible variations between query and database structure. Several filters remove matches that are un-compact, that have poor /spl beta/ sheet bonding, that do not allow loop/turn lengths to bridge the distance between connected secondary structures, or that fail to satisfy imposed experimental restraints (e.g. disulphide bonds). The remaining matches provide a set of plausible topologies for a protein of unknown structure, which can be inspected visually or tested by experiment. A search using the src homology 2 domain prediction finds 13 possible topologies, one being a domain from the E. coli bio protein known to adopt an SH2 fold. The use and development of the method are discussed.
Index Terms:
proteins; database management systems; query processing; biology computing; chemistry computing; protein fold recognition; secondary structure assignments; experimental restraints; query pattern; database; protein structural domains; errors; predicted secondary structure elements; query; database structure; connected secondary structures; plausible topologies; E. coli bio protein; SH2 fold
Citation:
R.B. Russell, R.R. Copley, G.J. Barton, "Protein fold recognition from secondary structure assignments," hicss, pp.302, 28th Hawaii International Conference on System Sciences (HICSS'95), 1995