The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March/April (2011 vol.8)
pp: 381-394
Yonghui Wu , Google Inc., Mountain View
Stefano Lonardi , University of California, Riverside, Riverside
ABSTRACT
We study the problem of merging genetic maps, when the individual genetic maps are given as directed acyclic graphs. The computational problem is to build a consensus map, which is a directed graph that includes and is consistent with all (or, the vast majority of) the markers in the input maps. However, when markers in the individual maps have ordering conflicts, the resulting consensus map will contain cycles. Here, we formulate the problem of resolving cycles in the context of a parsimonious paradigm that takes into account two types of errors that may be present in the input maps, namely, local reshuffles and global displacements. The resulting combinatorial optimization problem is, in turn, expressed as an integer linear program. A fast approximation algorithm is proposed, and an additional speedup heuristic is developed. Our algorithms were implemented in a software tool named MergeMap which is freely available for academic use. An extensive set of experiments shows that MergeMap consistently outperforms JoinMap, which is the most popular tool currently available for this task, both in terms of accuracy and running time. MergeMap is available for download at http://www.cs.ucr.edu/~yonghui/mgmap.html.
INDEX TERMS
Linear programming, constrained optimization, algorithms, biology and genetics.
CITATION
Yonghui Wu, Stefano Lonardi, "Accurate Construction of Consensus Genetic Maps via Integer Linear Programming", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.8, no. 2, pp. 381-394, March/April 2011, doi:10.1109/TCBB.2010.35
REFERENCES
[1] A.H. Sturtevant, "The Linear Arrangement of Six Sex-Linked Factors in Drosophila, as Shown by Their Mode of Association," J. Experimental Zoology, vol. 14, pp. 43-59, 1913.
[2] J. Jansen, A.G. de Jong, and J.W. van Ooijen, "Constructing Dense Genetic Linkage Maps," Theoretical and Applied Genetics, vol. 102, pp. 1113-1122, 2001.
[3] T. Schiex and C. Gaspin, "CARTHAGENE: Constructing and Joining Maximum Likelihood Genetic Maps," Proc. Int'l Conf. Intelligent Systems for Molecular Biology (ISMB), pp. 258-267, 1997.
[4] H. Iwata and S. Ninomiya, "AntMap: Constructing Genetic Linkage Maps Using an Ant Colony Optimization Algorithm," Breeding Science, vol. 56, pp. 371-377, 2006.
[5] H.V. Os, P. Stam, R.G.F. Visser, and H.J.V. Eck, "RECORD: A Novel Method for Ordering Loci on a Genetic Linkage Map," Theoretical and Applied Genetics, vol. 112, pp. 30-40, 2005.
[6] D.A. Cartwright, M. Troggio, R. Velasco, and A. Gutin, "Genetic Mapping in the Presence of Genotyping Errors," Genetics, vol. 174, pp. 2521-2527, 2007.
[7] Y. Wu, P.R. Bhat, T.J. Close, and S. Lonardi, "Efficient and Accurate Construction of Genetic Linkage Maps from Noisy and Missing Genotyping," Proc. Workshop Algorithms in Bioinformatics (WABI), pp. 395-406, 2007.
[8] Y. Wu, P.R. Bhat, T.J. Close, and S. Lonardi, "Efficient and Accurate Construction of Genetic Linkage Maps from the Minimum Spanning Tree of a Graph," PLoS Genetics, vol. 4, p. e1000212, Oct. 2008.
[9] C. Dib, S. Faure, C. Fizames, D. Samson, N. Drouot, A. Vignal, P. Millasseau, S. Marc, J. Kazan, E. Seboun, M. Lathrop, G. Gyapay, J. Morissette, and J. Weissenbach, "A Comprehensive Genetic Map of the Human Genome Based on 5264 Microsatellites," Nature, vol. 380, pp. 152-154, 1996.
[10] N. Ihara, A. Takasuga, K. Mizoshita, H. Takeda, M. Sugimoto, Y. Mizoguchi, T. Hirano, T. Itoh, T. Watanabe, K.M. Reed, W.M. Snelling, S.M. Kappes, C.W. Beattie, G.L. Bennett, and Y. Sugimoto, "A Comprehensive Genetic Map of the Cattle Genome Based on 3802 Microsatellites," Genome Research, vol. 14, pp. 1987-1998, 2004.
[11] W.F. Dietrich, J.C. Miller, R.G. Steen, M. Merchant, D. Damron, R. Nahf, A. Gross, D.C. Joyce, M. Wessel, R.D. Dredge, A. Marquis, L.D. Stein, N. Goodman, D.C. Page, and E.S. Lander, "A Genetic Map of the Mouse with 4,006 Simple Sequence Length Polymorphisms," Nature Genetics, vol. 7, no. 2S, pp. 220-245, 1994.
[12] R.G. Steen, A.E. Kwitek-Black, C. Glenn, J. Gullings-Handley, W. Van Etten, O.S. Atkinson, D. Appel, S. Twigger, M. Muir, T. Mull, M. Granados, M. Kissebah, K. Russo, R. Crane, M. Popp, M. Peden, T. Matise, D.M. Brown, J. Lu, S. Kingsmore, P.J. Tonellato, S. Rozen, D. Slonim, P. Young, M. Knoblauch, A. Provoost, D. Ganten, S.D. Colman, J. Rothberg, E.S. Lander, and H.J. Jacob, "A High-Density Integrated Genetic Linkage and Radiation Hybrid Map of the Laboratory Rat," Genome Research, vol. 9, no. 6, pp. AP1-8, 1999.
[13] B.N. Jackson, S. Aluru, and P.S. Schnable, "Consensus Genetic Maps: A Graph Theoretic Approach," Proc. Computational Systems Bioinformatics Conf. (CSB), pp. 35-43, 2005.
[14] B.N. Jackson, P.S. Schnable, and S. Aluru, "Consensus Genetic Maps as Median Orders from Inconsistent Sources," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 5, no. 2, pp. 161-171, Apr.-June 2008.
[15] W.D. Beavis and D. Grant, "A Linkage Map Based on Information from Four $f_2$ Populations of Maize (Zea Mays L.)," Theoretical and Applied Genetics, vol. 82, pp. 636-644, Oct. 1991.
[16] P. Stam, "Construction of Integrated Genetic Linkage Maps by Means of a New Computer Package: Joinmap," The Plant J., vol. 3, pp. 739-744, 1993.
[17] D.I. Mester, Y.I. Ronin, M.A. Korostishevsky, V.L. Pikus, A.E. Glazman, and A.B. Korol, "Multilocus Consensus Genetic Maps (mcgm): Formulation, Algorithms, and Results," Computational Biology and Chemistry, vol. 30, no. 1, pp. 12-20, 2006.
[18] I.V. Yap, D. Schneider, J. Kleinberg, D. Matthews, S. Cartinhour, and S.R. McCouch, "A Graph-Theoretic Approach to Comparing and Integrating Genetic, Physical and Sequence-Based Maps," Genetics, vol. 165, pp. 2235-2247, Dec. 2003.
[19] P. Wenzl, H. Li, J. Carling, M. Zhou, H. Raman, E. Paul, P. Hearnden, C. Maier, L. Xia, V. Caig, J. Ovesn, M. Cakir, D. Poulsen, J. Wang, R. Raman, K.P. Smith, G.J. Muehlbauer, K.J. Chalmers, A. Kleinhofs, and E.H.A. Kilian, "A High-Density Consensus Map of Barley Linking DArT Markers to SSR, RFLP and STS Loci and Agricultural Traits," BMC Genomics, vol. 7, 2006.
[20] S.A. Plotkin, D.B. Shmoys, and E. Tardos, "Fast Approximation Algorithms for Fractional Packing and Covering Problems," Proc. Ann. Symp. Foundations of Computer Science (FOCS), pp. 495-504, 1991.
[21] M. Girvan and M.E.J. Newman, "Community Structure in Social and Biological Networks," Proc. Nat'l Academy of Sciences USA, vol. 99, pp. 7821-7826, June 2002.
[22] A. Aho, "The Transitive Reduction of a Directed Graph," SIAM J. Computing, vol. 1, no. 2, pp. 131-137, 1972.
[23] A.B. Kahn, "Topological Sorting of Large Networks," Comm. ACM, vol. 5, no. 11, pp. 558-562, 1962.
26 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool