loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
The Impact of Multiple Protein Sequence Alignment on Phylogenetic Estimation
PrePrint
ISSN: 1545-5963
Li-San Wang, University of Pennsylvania, Philadelphia
Jim Leebens-Mack, University of Georgia, Athens
P. Kerr Wall, The Huck Institutes of Life Sciences and Pennsylvania State University
Kevin Beckmann, The Huck Institutes of Life Sciences and Pennsylvania State University
Claude W. dePamphilis, The Huck Institutes of Life Sciences and Pennsylvania State University
Tandy Warnow, University of Texas at Austin, Austin
Multiple sequence alignment is typically the first step in estimating phylogenetic trees, with the assumption being that as alignments improve, so will phylogenetic reconstructions. Over the last decade or so, new multiple sequence alignment methods have been developed to improve comparative analyses of protein structure, but these new methods have not been typically used in phylogenetic analyses. In this paper, we report on a simulation study that we performed to evaluate the consequences of using these new multiple sequence alignment methods in terms of the resultant phylogenetic reconstruction. We find that while alignment accuracy is positively correlated with phylogenetic accuracy, the amount of improvement in phylogenetic estimation that results from an improved alignment can range from quite small to substantial. We observe that phylogenetic accuracy is most highly correlated with alignment accuracy when sequences are most difficult to align, and that variation in alignment accuracy can have little impact on phylogenetic accuracy when alignment error rates are generally low. We discuss these observations and implications for future work.
Index Terms:
Simulation, biology and genetics, multiple protein sequence alignment, phylogeny reconstruction
Citation:
Li-San Wang, Jim Leebens-Mack, P. Kerr Wall, Kevin Beckmann, Claude W. dePamphilis, Tandy Warnow, "The Impact of Multiple Protein Sequence Alignment on Phylogenetic Estimation," IEEE/ACM Transactions on Computational Biology and Bioinformatics, 01 Sept. 2009. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TCBB.2009.68>
Usage of this product signifies your acceptance of the Terms of Use.