Publication 2007 Issue No. 1 - January-March Abstract - Distorted Metrics on Trees and Phylogenetic Forests
We study distorted metrics on binary trees in the context of phylogenetic reconstruction. Given a binary tree T on n leaves with a path metric d, consider the pairwise distances {d(u,v)} between leaves. It is well known that these determine the tree and the d length of all edges. Here, we consider distortions $\hat{d}$ of d such that, for all leaves u and v, it holds that $|d(u,v) - \hat{d}(u,v)|< f/2$ if either d(u,v)< M + f/2 or $\hat{d}(u,v)< M + f/2$, where d satisfies f ≤ d(e) ≤ g for all edges e. Given such distortions, we show how to reconstruct in polynomial time a forest T1, ... ,Tα such that the true tree T may be obtained from that forest by adding α-1 edges and α-1 ≤ 2-Ω(M/g) n. Our distorted metric result implies a reconstruction algorithm of phylogenetic forests with a small number of trees from sequences of length logarithmic in the number of species. The reconstruction algorithm is applicable for the general Markov model. Both the distorted metric result and its applications to phylogeny are almost tight.