The Community for Technology Leaders
Green Image
Issue No. 03 - May/June (2011 vol. 8)
ISSN: 1545-5963
pp: 607-620
Yi Pan , Georgia State University, Atlanta
Min Li , Central South University, Changsha
Jianer Chen , Texas A & M University, College Station
Jianxin Wang , Central South University, ChangSha
ABSTRACT
As advances in the technologies of predicting protein interactions, huge data sets portrayed as networks have been available. Identification of functional modules from such networks is crucial for understanding principles of cellular organization and functions. However, protein interaction data produced by high-throughput experiments are generally associated with high false positives, which makes it difficult to identify functional modules accurately. In this paper, we propose a fast hierarchical clustering algorithm HC-PIN based on the local metric of edge clustering value which can be used both in the unweighted network and in the weighted network. The proposed algorithm HC-PIN is applied to the yeast protein interaction network, and the identified modules are validated by all the three types of Gene Ontology (GO) Terms: Biological Process, Molecular Function, and Cellular Component. The experimental results show that HC-PIN is not only robust to false positives, but also can discover the functional modules with low density. The identified modules are statistically significant in terms of three types of GO annotations. Moreover, HC-PIN can uncover the hierarchical organization of functional modules with the variation of its parameter's value, which is approximatively corresponding to the hierarchical structure of GO annotations. Compared to other previous competing algorithms, our algorithm HC-PIN is faster and more accurate.
INDEX TERMS
Protein interaction network, functional module, hierarchical clustering algorithm, Gene Ontology.
CITATION
Yi Pan, Min Li, Jianer Chen, Jianxin Wang, "A Fast Hierarchical Clustering Algorithm for Functional Modules Discovery in Protein Interaction Networks", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. , pp. 607-620, May/June 2011, doi:10.1109/TCBB.2010.75
106 ms
(Ver 3.1 (10032016))