The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - July-Aug. (2013 vol.10)
pp: 970-983
Andrei Todor , Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
Alin Dobra , Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
Tamer Kahveci , Dept. of Comput. & Inf. Sci. & Eng., Univ. of Florida, Gainesville, FL, USA
ABSTRACT
Biological interactions are often uncertain events, that may or may not take place with some probability. This uncertainty leads to a massive number of alternative interaction topologies for each such network. The existing studies analyze the degree distribution of biological networks by assuming that all the given interactions take place under all circumstances. This strong and often incorrect assumption can lead to misleading results. In this paper, we address this problem and develop a sound mathematical basis to characterize networks in the presence of uncertain interactions. Using our mathematical representation, we develop a method that can accurately describe the degree distribution of such networks. We also take one more step and extend our method to accurately compute the joint-degree distributions of node pairs connected by edges. The number of possible network topologies grows exponentially with the number of uncertain interactions. However, the mathematical model we develop allows us to compute these degree distributions in polynomial time in the number of interactions. Our method works quickly even for entire protein-protein interaction (PPI) networks. It also helps us find an adequate mathematical model using MLE. We perform a comparative study of node-degree and joint-degree distributions in two types of biological networks: the classical deterministic networks and the more flexible probabilistic networks. Our results confirm that power-law and log-normal models best describe degree distributions for both probabilistic and deterministic networks. Moreover, the inverse correlation of degrees of neighboring nodes shows that, in probabilistic networks, nodes with large number of interactions prefer to interact with those with small number of interactions more frequently than expected. We also show that probabilistic networks are more robust for node-degree distribution computation than the deterministic ones. Availability: all the data sets used, the software implemented and the alignments found in this paper are available at >http://bioinformatics.cise.ufl.edu/projects/probNet/.
INDEX TERMS
topology, biochemistry, molecular biophysics, polynomials, probability, proteins, node-degree distribution computation, probabilistic biological network topology, biological interactions, alternative interaction topologies, joint-degree distributions, node pairs, mathematical model, polynomial time, protein-protein interaction networks, PPI networks, biological networks, classical deterministic networks, flexible probabilistic networks, power-law models, log-normal models, deterministic networks, Probabilistic logic, Random variables, Maximum likelihood estimation, Joints, Network topology, Mathematical model, random graphs, Probabilistic biological networks, network topology, degree distribution
CITATION
Andrei Todor, Alin Dobra, Tamer Kahveci, "Characterizing the Topology of Probabilistic Biological Networks", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 4, pp. 970-983, July-Aug. 2013, doi:10.1109/TCBB.2013.108
REFERENCES
[1] M. Arita, "Scale-Freeness and Biological Networks," J. Biochemistry, vol. 138, pp. 1-4, 2005.
[2] J.S. Bader, A. Chaudhuri, J. Rothberg, and J. Chant, "Gaining Confidence in High-Throughput Protein Interaction Networks," Nature Biotechnology, vol. 22, pp. 78-85, 2003.
[3] A.-L. Barabasi and R. Albert, "Emergence of Scaling in Random Networks," Science, vol. 286, pp. 509-512, 1999.
[4] A.L. Barabasi and Z.N. Oltvai, "Network Biology: Understanding the Cell's Functional Organization," Nature Rev., vol. 5, pp. 101-113, 2004.
[5] M. Barthelemy, A. Barrat, R. Pastor-Satorras, and A. Vespignani, "Velocity and Hierarchical Spread of Epidemic Outbreaks in Scale-Free Networks," Physical Rev. Letters, vol. 92, no. 17,article 178701, 2005.
[6] C.M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.
[7] D.S. Callaway, J.E. Hopcroft, J.M. Kleinberg, M.E.J. Newman, and S.H. Strogatz, "Are Randomly Grown Graphs Really Random?" Physical Rev. E, vol. 64, no. 4,article 041902, 2001.
[8] A. Chatraryamontri et al., "MINT: The Molecular Interaction Database," Nucleic Acids Research, vol. 513, pp. D572-D574, 2007.
[9] A. Clauset, R.S. Cosma, and M.E.J. Newman, "Power Law Distributions in Empirical Data," SIAM Rev., vol. 51, pp. 661-703, 2009.
[10] H. Cramer, Mathematical Methods of Statistics. Princeton Univ. Press, 1946.
[11] J.C. Doyle, D.L. Alderson, L. Li, S. Low, M. Roughan, S. Shalunov, R. Tanaka, and W. Willinger, "The 'Robust Yet Fragile' Nature of the Internet," Proc. Nat'l Academy of Science USA, vol. 102, pp. 14497-14502, 2005.
[12] P. Erdős and A. Rényi, "On Random Graphs, I," Publicationes Math., vol. 6, pp. 290-297, 1959.
[13] M. Girvan and M.E.J. Newman, "Community Structure in Social and Biological Networks," Proc. Nat'l Academy of Science USA, vol. 99, pp. 7821-7826, 2002.
[14] A. Gitter et al., "Discovering Pathways by Orienting Edges in Protein Interaction Networks," Nucleic Acids Research, vol. 39, article e22, 2010.
[15] M.A. Goldberg, An Introduction to Probability Theory with Statistical Applications. Plenum Press, 1984.
[16] M. Green and P. Karp, "A Bayesian Method for Identifying Missing Enzymes in Predicted Metabolic Pathway Databases," BMC Bioinformatics, vol. 5, article 76, 2004.
[17] J.-D.J. Han, D. Dupuy, N. Bertin, M.E. Cusick, and M. Vidal, "Effect of Sampling on Topology Predictions of Protein-Protein Interaction Networks," Nature Biotechnology, vol. 23, pp. 839-844, 2005.
[18] Y. Hu, I. Flockhart, A. Vinayagam, C. Bergwitz, B. Berger, N. Perrimon, and S.E. Mohr, "An Integrative Approach to Ortholog Prediction for Disease-Focused and Other Functional Studies," BMC Bioinformatics, vol. 12, article 357, 2011.
[19] H. Jeong, S. Mason, A-L Barabasi, and Z.N. Oltvai, "Lethality and Centrality in Protein Networks," Nature, vol. 411, pp. 41-42, 2001.
[20] H. Jeong, S. Tombor, A-L Barabasi, and Z.N. Oltvai, "The Large-Scale Organization of Metabolic Networks," Nature, vol. 407, pp. 651-654, 2000.
[21] T. Kailath, "The Divergence and Bhattacharyya Distance Measures in Signal Selection," IEEE Trans. Comm. Technology, vol. CT-15, no. 1, pp. 52-60, Feb. 1967.
[22] R. Khanin and E. Wit, "How Scale-Free Are Biological Networks," J. Computational Biology, vol. 13, pp. 810-818, 2006.
[23] P.L. Krapivsky and S. Redner, "Organization of Growing Random Networks," Physical Rev., Letters, vol. 63, no. 6,article 066123, 2001.
[24] D.-S. Lee, J. Park, K.A. Kay, N.A. Christakis, Z.N. Oltvai, and A.-L. Barabasi, "The Implications of Human Metabolic Network Topology for Disease Comorbidity," Proc. Nat'l Academy of Sciences USA, vol. 105, pp. 9880-9885, 2008.
[25] L.S. Li, D. Alderson, R. Tanaka, J.C. Doyle, and W. Willinger, "Toward a Theory of Scale-Free Graphs: Definition Properties and Implications," Internet Math., vol. 2, no. 4, pp. 431-523, 2005.
[26] M.E.J. Newman, "Assortative Mixing in Networks," Physical Rev. Letters, vol. 89, no. 20,article, 208701, 2002.
[27] M.E.J. Newman, S.H. Strogatz, and D.J. Watts, "Random Graphs with Arbitrary Degree Distributions and Their Applications," Physical Rev. E., vol. 64, no. 2,article 026118, 2001.
[28] M.E.J. Newman and D.J. Watts, "Renormalization Group Analysis of the Small-World Network Model," Physics Letters A, vol. 263, nos. 4-6, pp. 341-346, 1999.
[29] H. Ogata, W. Fujibuchi, S. Goto, and K. Minoru, "A Heuristic Graph Comparison Algorithm and Its Application to Detect Functionally Related Enzyme Clusters," Nucleic Acids Research, vol. 28, pp. 4021-4028, 2000.
[30] O. Ourfali et al., "SPINE: A Framework for Signaling-Regulatory Pathway Inference from Cause-Effect Experiments," Bioinformatics, vol. 23, pp. i359-i366, 2007.
[31] N. Przulj, "Biological Network Comparison Using Graphlet Degree Distribution," Bioinformatics, vol. 23, pp. e177-e183, 2007.
[32] N. Przulj, D.G. Corneil, and I. Jurisica, "Modeling Interactome: Scale-Free or Geometric?" Bioinformatics, vol. 20, pp. 3508-3515, 2004.
[33] E. Ravasz, A.L. Somera, D.A. Mongru, A-L Barabasi, and Z.N. Oltvai, "Hierarchical Organization of Modularity in Metabolic Networks," Science, vol. 297, pp. 1551-1555, 2002.
[34] S. Ross, A First Course in Probability. Prentice Hall, 2009.
[35] P. Sridhar, T. Kahveci, and S. Ranka, "An Iterative Algorithm for Metabolic Network-Based Drug Target Identification," Proc. Pacific Symp. Biocomputing, 2007.
[36] M.P.H. Stumpf and P.J. Ingram, "Probability Models for Degree Distributions of Protein Interactions Networks," Europhysics Letters, 2005.
[37] M.P.H. Stumpf, C. Wiuf, and R.M. May, "Subnets of Scale-Free Networks Are Not Scale-Free: Sampling Properties of Networks," Proc. Nat'l Academy of Sciences USA, vol. 102, pp. 4221-4224, 2005.
[38] D. Szklarczyk, A. Franceschini, M. Kuhn, M. Simonovic, A. Roth, P. Minguez, T. Doerks, M. Stark, J. Muller, P. Bork, L.J. Jensen, and C.V. Mering, "The STRING Database in 2011: Functional Interaction Networks of Proteins, Globally Integrated and Scored," Nucleic Acids Research, vol. 39, pp. D561-D568, 2011.
[39] R. Tanaka, "Scale-Rich Metabolic Networks," Physical Rev. Letters, vol. 94, no. 16,article 168101, 2005.
[40] E.O. Voit, Computational Analysis of Biochemical Systems: A Practical Guide for Biochemists and Molecular Biologists. Cambridge Univ. Press, 2000.
[41] N. Watanabe, M.M. Cherney, M.J. van Belkum, S.L. Markus, M.D. Flegel, M.D. Clay, M.K. Deyholos, J.C. Vederas, and M.N. James, "Crystal Structure of LL-Diaminopimelate Aminotransferase from Arabidopsis thaliana: A Recently Discovered Enzyme in the Biosynthesis of L-Lysine by Plants and Chlamydia," J. Molecular Biology, vol. 371, pp. 685-702, 2007.
[42] D.J. Watts and S.H. Strogatz, "Collective Dynamics of Small-World Networks," Letters to Nature, vol. 393, pp. 409-410, 1998.
[43] Y.I. Wolf, G. Karev, and E. Koonin, "Scale-Free Networks in Biology: New Insights into the Fundamentals of Evolution?" Bioessays, vol. 24, pp. 105-109, 2002.
[44] S. Wuchty, "Scale-Free Behavior in Protein Domain Networks," Molecular Biology and Evolution, vol. 18, pp. 1694-1702, 2001.
[45] S. Yerel and A. Konuk, "Bivariate Lognormal Distribution Model of Cutoff Grade Impurities: A Case Study of Magnesite Ore Deposits," Scientific Research and Essay, 2009.
[46] H. Yu, X. Zhu, D. Greenbaum, J. Karro, and M. Gerstein, "TopNet: A Tool for Comparing Biological Sub-Networks Correlating Protein Properties with Topological Statistics," Nucleic Acids Research, vol. 32, pp. 328-337, 2004.
[47] K. Zhu et al., "BMC: An Efficient Method to Evaluate the Probabilistic Reachability Queries," Proc. 16th Int'l Conf. Database Systems for Advanced Applications, 2011.
70 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool