The Community for Technology Leaders
RSS Icon
Issue No.04 - October-December (2010 vol.7)
pp: 579-587
Adriana Muñoz , University of Ottawa, Ottawa
There has been a trend in increasing the phylogenetic scope of genome sequencing while decreasing the quality of the published sequence for each genome. With reduced finishing effort, there is an increasing number of genomes being published in contig form. Rearrangement algorithms, including gene order-based phylogenetic tools, require whole genome data on gene order, segment order, or some other marker order. Items whose chromosomal location is unknown cannot be part of the input. The question we address here is, for gene order-based phylogenetic analysis, how can we use rearrangement algorithms to handle genomes available in contig form only? Our suggestion is to use the contigs directly in the rearrangement algorithms as if they were chromosomes, while making a number of corrections, e.g., we correct for the number of extra fusion/fission operations required to make contigs comparable to full assemblies. We model the relationship between contig number and genomic distance, and estimate the parameters of this model using insect genome data. With this model, we use distance matrix methods to reconstruct the phylogeny based on genomic distance and numbers of contigs. We compare this with methods to reconstruct ancestral gene orders using uncorrected contig data.
Contigs, assembly, genome rearrangements, genomic distance, Drosophila, phylogeny.
Adriana Muñoz, "Rearrangement Phylogeny of Genomes in Contig Form", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.7, no. 4, pp. 579-587, October-December 2010, doi:10.1109/TCBB.2010.66
[1] "Assembly/Alignment/Annotation of 12 Related Drosophila Species," http://rana.lbl.govdrosophila/, 2010.
[2] A. Bergeron, J. Mixtacki, and J. Stoye, "A Unifying View of Genome Rearrangements," Algorithms in Bioinformatics, P. BüCher and B.M.E. Moret, eds., pp. 163-173, Springer, 2006.
[3] A. Bhutkar, W.M. Gelbart, and T.F. Smith, "Inferring Genome-Scale Rearrangement Phylogeny and Ancestral Gene Order: A Drosophila Case Study," Genome Biology, vol. 8, p. R236.1-15, 2007.
[4] A. Bhutkar, S. Russo, T.F. Smith, and W.M. Gelbart, "Techniques for Multi-Genome Synteny Analysis to Overcome Assembly Limitations," Genome Informatics, vol. 17, pp. 152-161, 2006.
[5] A. Bhutkar, S.W. Schaeffer, S.M. Russo, M. Xu, T.F. Smith, and W.M. Gelbart, "Chromosomal Rearrangement Inferred from Comparisons of 12 Drosophila Genomes," Genetics, vol. 179, pp. 1657-1680, 2008.
[6] D. Dalevi and N. Eriksen, "Expected Gene-Order Distances and Model Selection in Bacteria," Bioinformatics, vol. 24, pp. 1332-1338, 2008.
[7] Drosophila 12 Genomes Consortium, A.G. Clark et al., "Evolution of Genes and Genomes on the Drosophila Phylogeny," Nature, vol. 450, pp. 203-218, 2007.
[8] N. Eriksen and A. Hultman, "Estimating the Expected Reversal Distance after a Fixed Number of Reversals," Advances in Applied Math., vol. 32, pp. 439-453, 2004.
[9] E. Gaul and M. Blanchette, "Ordering Partially Assembled Genomes Using Gene Arrangements," Proc. RECOMB Comparative Genomics Satellite 2006, Lecture Notes in Computer Science 4205, G. Bourque and N. El-Mabrouk, eds., pp. 113-128, Springer, 2006.
[10] J. Krzywinski, O.G. Grushko, and N.J. Besansky, "Analysis of the Complete Mitochondrial DNA from Anopheles funestus: An Improved Dipteran Mitochondrial Genome Annotation and a Temporal Dimension of Mosquito Evolution," Molecular Phylogenetics and Evolution, vol. 39, no. 2, pp. 417-423, 2006.
[11] D. Sankoff, C. Zheng, P.K. Wall, C.W. dePamphilis, J. Leebens-Mack, and V.A. Albert, "Internal Validation of Ancestral Gene Order Reconstruction in Angiosperm Phylogeny," Proc. RECOMB Comparative Genomics Satellite 2008, Lecture Notes in Computer Science 5267, S. Vialette and C. Nelson, eds., pp. 252-264, Springer, 2008.
[12] J. Savard, D. Tautz, S. Richards, G.M. Weinstock, R.A. Gibbs, J.H. Werren, H. Tettelin, and M.J. Lercher, "Phylogenomic Analysis Reveals Bees and Wasps (Hymenoptera) at the Base of the Radiation of Holometabolous Insects," Genome Research, vol. 16, pp. 1334-1338, 2006.
[13] D.W. Severson, B. DeBruyn, D.D. Lovin, S.E. Brown, D.L. Knudson, and I. Morlais, "Comparative Genome Analysis of the Yellow Fever Mosquito Aedes Aegypti with Drosophila melanogaster and the Malaria Vector Mosquito Anopheles gambiae," J. Heredity, vol. 95, pp. 103-113, 2004.
[14] G. Tesler, "Efficient Algorithms for Multichromosomal Genome Rearrangements," J. Computer and System Sciences, vol. 65, pp. 587-609, 2002.
[15] L.-S. Wang and T. Warnow, "Distance-Based Genome Rearrangement Phylogeny," Mathematics of Evolution and Phylogen, O. Gascuel, ed., pp. 353-383, Oxford Univ. Press, 2005.
[16] S. Yancopoulos, O. Attie, and R. Friedberg, "Efficient Sorting of Genomic Permutations by Translocation, Inversion, and Block Interchange," Bioinformatics, vol. 21, pp. 3340-3346, 2005.
[17] C. Zheng, "Path Groups: A Common Data Structure for Rapid Heuristic Solutions to Ancestral Gene Order Reconstruction Problems," Bioinformatics, vol. 26, pp. 1587-1594, 2010.
26 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool