Subscribe
Issue No.05 - Sept.-Oct. (2013 vol.10)
pp: 1253-1262
Noah A. Rosenberg , Stanford University, Stanford
ABSTRACT
A coalescent history is an assignment of branches of a gene tree to branches of a species tree on which coalescences in the gene tree occur. The number of coalescent histories for a pair consisting of a labeled gene tree topology and a labeled species tree topology is important in gene tree probability computations, and more generally, in studying evolutionary possibilities for gene trees on species trees. Defining the $(T_r)$-caterpillar-like family as a sequence of $(n)$-taxon trees constructed by replacing the $(r)$-taxon subtree of $(n)$-taxon caterpillars by a specific $(r)$-taxon labeled topology $(T_r)$, we examine the number of coalescent histories for caterpillar-like families with matching gene tree and species tree labeled topologies. For each $(T_r)$ with size $(r\le 8)$, we compute the number of coalescent histories for $(n)$-taxon trees in the $(T_r)$-caterpillar-like family. Next, as $(n\rightarrow \infty)$, we find that the limiting ratio of the numbers of coalescent histories for the $(T_r)$ family and caterpillars themselves is correlated with the number of labeled histories for $(T_r)$. The results support a view that large numbers of coalescent histories occur when a tree has both a relatively balanced subtree and a high tree depth, contributing to deeper understanding of the combinatorics of gene trees and species trees.
INDEX TERMS
Genetics, Network topology, Shape analysis, Polynomials, Bioinformatics, Computational biology,phylogenetics, Combinatorial identities, labeled histories, labeled topologies, lineage sorting
CITATION
Noah A. Rosenberg, "Coalescent Histories for Caterpillar-Like Families", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 5, pp. 1253-1262, Sept.-Oct. 2013, doi:10.1109/TCBB.2013.123
REFERENCES
 [1] J.H. Degnan and N.A. Rosenberg, "Gene Tree Discordance, Phylogenetic Inference and the Multispecies Coalescent," Trends in Ecology and Evolution, vol. 24, pp. 332-340, 2009. [2] J.H. Degnan and L.A. Salter, "Gene Tree Distributions under the Coalescent Process," Evolution, vol. 59, pp. 24-37, 2005. [3] N.A. Rosenberg, "Counting Coalescent Histories," J. Computational Biology, vol. 14, pp. 360-377, 2007. [4] C. Than, D. Ruths, H. Innan, and L. Nakhleh, "Confounding Factors in HGT Detection: Statistical Error, Coalescent Effects, and Multiple Solutions," J. Computational Biology, vol. 14, pp. 517-535, 2007. [5] N.A. Rosenberg and R. Tao, "Discordance of Species Trees with Their Most Likely Gene Trees: The Case of Five Taxa," Systematic Biology, vol. 57, pp. 131-140, 2008. [6] E.S. Allman, J.H. Degnan, and J.A. Rhodes, "Identifying the Rooted Species Tree from the Distribution of Unrooted Gene Trees under the Coalescent," J. Math. Biology, vol. 62, pp. 833-862, 2011. [7] C.V. Than and N.A. Rosenberg, "Consistency Properties of Species Tree Inference by Minimizing Deep Coalescences," J. Computational Biology, vol. 18, pp. 1-15, 2011. [8] A. Hobolth, O.F. Christensen, T. Mailund, and M.H. Schierup, "Genomic Relationships and Speciation Times of Human, Chimpanzee, and Gorilla Inferred from a Coalescent Hidden Markov Model," PLoS Genetics, vol. 3, no. 2,article e7, 2007. [9] A. Hobolth, J.Y. Dutheil, J. Hawks, M.H. Schierup, and T. Mailund, "Incomplete Lineage Sorting Patterns among Human, Chimpanzee, and Orangutan Suggest Recent Orangutan Speciation and Widepsread Selection," Genome Research, vol. 21, pp. 349-356, 2011. [10] J.Y. Dutheil, G. Ganapathy, A. Hobolth, T. Mailund, M.K. Uyenoyama, and M.H. Schierup, "Ancestral Population Genomics: The Coalescent Hidden Markov Model Approach," Genetics, vol. 183, pp. 259-274, 2009. [11] Y. Yu, C. Than, J.H. Degnan, and L. Nakhleh, "Coalescent Histories on Phylogenetic Networks and Detection of Hybridization Despite Incomplete Lineage Sorting," Systematic Biology, vol. 60, pp. 138-149, 2011. [12] N.A. Rosenberg and J.H. Degnan, "Coalescent Histories for Discordant Gene Trees and Species Trees," Theoretical Population Biology, vol. 77, pp. 145-151, 2010. [13] J.H. Degnan, "Gene Tree Distributions under the Coalescent Process," PhD dissertation, Univ. of New Mexico, 2005. [14] J.H. Degnan and N.A. Rosenberg, "Discordance of Species Trees with Their Most Likely Gene Trees," PLoS Genetics, vol. 2, pp. 762-768, 2006. [15] G.W. Furnas, "The Generation of Random, Binary Unordered Trees," J. Classification, vol. 1, pp. 187-233, 1984. [16] J.K.M. Brown, "Probabilities of Evolutionary Trees," Systematic Biology, vol. 43, pp. 78-91, 1994. [17] M. Steel and A. McKenzie, "Properties of Phylogenetic Trees Generated by Yule-Type Speciation Models," Math. Biosciences, vol. 170, pp. 91-112, 2001. [18] E.F. Harding, "The Probabilities of Rooted Tree-Shapes Generated by Random Bifurcation," Advances in Applied Probability, vol. 3, pp. 44-77, 1971. [19] J.M. Hammersley and G.R. Grimmett, "Maximal Solutions of the Generalized Subadditive Inequality," Stochastic Geometry, E.F. Harding and D.G. Kendall, eds., pp. 270-285, Wiley, 1974. [20] E.F. Harding, "The Probabilities of the Shapes of Randomly Bifurcating Trees," Stochastic Geometry, E.F. Harding and D.G. Kendall, eds., pp. 259-269, Wiley, 1974. [21] M. Petkovšek, H.S. Wilf, and D. Zeilberger, A=B. Peters, 1996. [22] D. Aldous, "Probability Distributions on Cladograms," Random Discrete Structures, D. Aldous and R. Pemantle, eds., pp. 1-18, Springer-Verlag, 1996. [23] Y. Wu, "Coalescent-Based Species Tree Inference from Gene Tree Topologies under Incomplete Lineage Sorting by Maximum Likelihood," Evolution, vol. 66, pp. 763-775, 2012.