Predicting Ligand Binding Residues and Functional Sites Using Multipositional Correlations with Graph Theoretic Clustering and Kernel CCA
Issue No. 04 - July-Aug. (2012 vol. 9)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2011.136
Li Liao , Comput. & Inf. Sci. Dept., Univ. of Delaware, Newark, DE, USA
A. J. Gonzalez , Comput. & Inf. Sci. Dept., Univ. of Delaware, Newark, DE, USA
C. H. Wu , Center for Bioinf. & Comput. Biol., Univ. of Delaware, Newark, DE, USA
We present a new computational method for predicting ligand binding residues and functional sites in protein sequences. These residues and sites tend to be not only conserved, but also exhibit strong correlation due to the selection pressure during evolution in order to maintain the required structure and/or function. To explore the effect of correlations among multiple positions in the sequences, the method uses graph theoretic clustering and kernel-based canonical correlation analysis (kCCA) to identify binding and functional sites in protein sequences as the residues that exhibit strong correlation between the residues' evolutionary characterization at the sites and the structure-based functional classification of the proteins in the context of a functional family. The results of testing the method on two well-curated data sets show that the prediction accuracy as measured by Receiver Operating Characteristic (ROC) scores improves significantly when multipositional correlations are accounted for.
proteins, biology computing, evolutionary computation, graph theory, molecular biophysics, molecular configurations, receiver operating characteristic score, ligand binding residues, multipositional correlations, graph theoretic clustering, kernel-based canonical correlation analysis, computational method, protein sequences, evolution, structure-based functional classification, Correlation, Proteins, Kernel, Amino acids, Bioinformatics, Eigenvalues and eigenfunctions, Computational biology, cliques., Functional residues, specificity determining positions, multiple sequence alignments, kernel canonical correlation analysis
Li Liao, A. J. Gonzalez and C. H. Wu, "Predicting Ligand Binding Residues and Functional Sites Using Multipositional Correlations with Graph Theoretic Clustering and Kernel CCA," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. , pp. 992-1001, 2012.