Subscribe
Issue No.04 - July/August (2011 vol.8)
pp: 890-901
John Wiedenhoeft , Max Planck Institute for Molecular Genetics and Free University of Berlin, Berlin
Roland Krause , Max Planck Institute for Molecular Genetics and Free University of Berlin, Berlin
Oliver Eulenstein , Iowa State University, Ames
ABSTRACT
Interactions of protein domains control essential cellular processes. Thus, inferring the evolutionary histories of multidomain proteins in the context of their families can provide rewarding insights into protein function. However, methods to infer these histories are challenged by the complexity of macroevolutionary events. Here, we address this challenge by describing an algorithm that computes a novel network-like structure, called plexus, which represents the evolution of domains and their combinations. Finally, we demonstrate the performance of this algorithm with empirical data sets.
INDEX TERMS
Proteins, domains, plexus, graphs, phylogeny.
CITATION
John Wiedenhoeft, Roland Krause, Oliver Eulenstein, "The Plexus Model for the Inference of Ancestral Multidomain Proteins", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.8, no. 4, pp. 890-901, July/August 2011, doi:10.1109/TCBB.2011.22
REFERENCES
[1] N. Song, J.M. Joseph, G.B. Davis, and D. Durand, “Sequence Similarity Network Reveals Common Ancestry of Multidomain Proteins,” PLoS Computational Biology, vol. 4, no. 5, 2008.
[2] R.F. Doolittle, “The Multiplicity of Domains in Proteins,” Ann. Rev. Biochemistry, vol. 54, pp. 287-314, 1995.
[3] E.V. Koonin, S.F. Altschul, and P. Bork, “BRCA1 Protein Products... Functional Motifs...,” Nature Genetics, vol. 13, no. 3, pp. 266-288, 1996.
[4] D. Ekman, Å.K. Björklund, J. Frey-Skött, and A. Elofsson, “Multi-Domain Proteins in the Three Kingdoms of Life: Orphan Domains and Other Unassigned Regions,” J. Molecular Biology, vol. 348, no. 1, pp. 231-243, Apr. 2005.
[5] M.K. Basu, L. Carmel, I.B. Rogozin, and E.V. Koonin, “Evolution of Protein Domain Promiscuity in Eukaryotes,” Genome Research, vol. 18, pp. 449-461, 2008.
[6] G. Apic, J. Gough, and S.A. Teichmann, “An Insight into Domain Combinations,” Bioinformatics, vol. 17, Suppl 1, pp. S83-S89, 2001.
[7] S.K. Kummerfeld and S.A. Teichmann, “Relative Rates of Gene Fusion and Fission in Multi-Domain Proteins,” Trends in Genetics, vol. 21, pp. 25-30, 2005.
[8] C.W. Duin, “Two Fast Algorithms for All-Pairs Shortest Paths,” Computers and Operations Research, vol. 34, no. 9, pp. 2824-2839, 2007.
[9] S. Yang, R.F. Doolittle, and P.E. Bourne, “Phylogeny Determined by Protein Domain Content,” Proc. Nat'l Academy of Sciences USA, vol. 102, no. 2, pp. 373-378, 2005.
[10] Å.K. Björklund, D. Ekman, S. Light, J. Frey-Skött, and A. Elofsson, “Domain Rearrangements in Protein Evolution,” J. Molecular Biology, vol. 353, no. 4, pp. 911-923, Nov. 2005.
[11] T. Przytycka, G. Davis, N. Song, and D. Durand, “Graph Theoretical Insights into Evolution of Multidomain Proteins,” J. Computational Biology, vol. 13, no. 2, pp. 351-363, 2006.
[12] J.H. Fong, L.Y. Geer, A.R. Panchenko, and S.H. Bryant, “Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony,” J. Molecular Biology, vol. 366, no. 1, pp. 307-315, Feb. 2007.
[13] F.D. Ciccarelli, C. von Mering, M. Suyama, E.D. Harrington, E. Izaurralde, and P. Bork, “Complex Genomic Rearrangements Lead to Novel Primate Gene Function,” Genome Research, vol. 15, no. 3, pp. 343-351, 2005.
[14] J.I. Lucas, V. Arnau, and I. Marín, “Comparative Genomics and Protein Domain Graph Analyses Link Ubiquitination and RNA Metabolism,” J. Molecular Biology, vol. 357, no. 1, pp. 9-17, 2006.
[15] B. Behzadi and M. Vingron, “Reconstructing Domain Compositions of Ancestral Multi-Domain Proteins,” Comparative Genomics, pp. 1-10, Springer, 2006.
[16] D.H. Huson and D. Bryant, “Application of Phylogenetic Networks in Evolutionary Studies,” Molecular Biology and Evolution, vol. 23, no. 2, pp. 254-267, 2006.
[17] B.M.E. Moret, L. Nakhleh, T. Warnow, C.R. Linder, A. Tholse, A. Padolina, J. Sun, and R. Timme, “Phylogenetic Networks: Modeling, Reconstructibility, and Accuracy,” IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 1, no. 1, pp. 1-12, Jan.-Mar. 2004.
[18] A.D. Moore, Å.K. Björklund, D. Ekman, E. Bornberg-Bauer, and A. Elofsson, “Arrangements in the Modular Evolution of Proteins,” Trends in Biochemical Sciences, vol. 33, no. 9, pp. 444-451, 2008.
[19] B. Snel, P. Bork, and M. Huynen, “Genome Evolution: Gene Fusion versus Gene Fission,” Trends in Genetics, vol. 16, no. 1, pp. 9-11, 2006.
[20] W. Wang, H. Yu, and M. Long, “Duplication-Degeneration as a Mechanism of Gene Fission and the Origin of New Genes in Drosophila Species,” Nature Genetics, vol. 36, no. 5, pp. 523-527, May 2004.
[21] J. Wiedenhoeft, “Phylogenetic Reconstruction of Ancestral Multidomain Proteins,” BSc thesis, Freie Universität Berlin, http://genome.cs.iastate.edu/CBL/ISBRA10 thesis.zip, 2009.
[22] UniProt Consortium “The Universal Protein Resource (UniProt) in 2010,” Nucleic Acids Research, vol. 38, (Database issue), pp. D142-D148, 2010.
[23] K. Forslund, A. Henricson, V. Hollich, and E.L. Sonnhammer, “Domain Tree-Based Analysis of Protein Architecture Evolution,” Molecular Biology and Evolution, vol. 25, no. 2, pp. 254-264, 2008.
[24] R.D. Finn, J. Tate, J. Mistry, P.C. Coggill, S.J. Sammut, H.R. Hotz, G. Ceric, K. Forslund, S.R. Eddy, E.L.L. Sonnhammer, and A. Bateman, “The Pfam Protein Families Database,” Nucleic Acids Research, vol. 36, Database issue, pp. D281-D288, Jan. 2008.
[25] M. Marija Buljan, A. Frankish, and A. Bateman, “Quantifying the Mechanisms of Domain Gain in Animal Proteins,” Genome Biology, vol. 11, p. R74, 2010.
[26] S. Guindon and O. Gascuel, “A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood,” Systematic Biology, vol. 52, no. 5, pp. 696-704, Oct. 2003.
[27] D. Durand, B.V. Halldorsson, and B. Vernot, “A Hybrid Micro-Macroevolutionary Approach to Gene Tree Reconstruction,” J. Computational Biology, vol. 13, no. 2, pp. 320-335, 2006.
[28] M. Boxem et al., “A Protein Domain-Based Interactome Network for C. elegans Early Embryogenesis,” Cell, vol. 134, no. 3, pp. 534-545, 2008.
[29] T. Takeuchi, Y. Watanabe, T. Takano-Shimizu, and S. Kondo, “Roles of Jumonji and Jumonji Family Genes in Chromatin Regulation and Development,” Developmental Dynamics, vol. 235, no. 9, pp. 2449-2459, 2006.
[30] J. Jin et al., “Eukaryotic Protein Domains as Functional Units of Cellular Evolution,” Science Signaling, vol. 98, no. 2, p. ra76, 2009.