The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - July-Aug. (2013 vol.10)
pp: 1009-1016
Sergio Torres-Sanchez , Dept. de Lenguajes y Sist. Informaticos, Univ. de Granada, Granada, Spain
Nuria Medina-Medina , Dept. de Lenguajes y Sist. Informaticos, Univ. de Granada, Granada, Spain
Chris Gignoux , Dept. of Med., Univ. of California San Francisco, San Francisco, CA, USA
Maria M. Abad-Grau , Dept. de Lenguajes y Sist. Informaticos, Univ. de Granada, Granada, Spain
Esteban Gonzalez-Burchard , Dept. of Med., Univ. of California San Francisco, San Francisco, CA, USA
ABSTRACT
Principal component (PC) plots have become widely used to summarize genetic variation of individuals in a sample. The similarity between genetic distance in PC plots and geographical distance has shown to be quite impressive. However, in most situations, individual ancestral origins are not precisely known or they are heterogeneously distributed; hence, they are hardly linked to a geographical area. We have developed GeneOnEarth, a user-friendly web-based tool to help geneticists to understand whether a linear isolation-by-distance model may apply to a genetic data set; thus, genetic distances among a set of individuals resemble geographical distances among their origins. Its main goal is to allow users to first apply a by-view Procrustes method to visually learn whether this model holds. To do that, the user can choose the exact geographical area from an on line 2D or 3D world map by using, respectively, Google Maps or Google Earth, and rotate, flip, and resize the images. GeneOnEarth can also compute the optimal rotation angle using Procrustes analysis and assess statistical evidence of similarity when a different rotation angle has been chosen by the user. An online version of GeneOnEarth is available for testing and using purposes at >http://bios.ugr.es/GeneOnEarth.
INDEX TERMS
Sociology, Statistics, Google, Earth, Genetics, Europe, Data models,admixture, Population genetics, SNPs, web-based application, PCA, procrustes analysis, population stratification
CITATION
Sergio Torres-Sanchez, Nuria Medina-Medina, Chris Gignoux, Maria M. Abad-Grau, Esteban Gonzalez-Burchard, "GeneOnEarth: Fitting Genetic PC Plots on the Globe", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no. 4, pp. 1009-1016, July-Aug. 2013, doi:10.1109/TCBB.2013.81
REFERENCES
[1] A.L. Price, N.A. Zaitlen, D. Reich, and N. Patterson, "New Approaches to Population Stratification in Genome-Wide Association Studies," Nature Rev. Genetics, vol. 11, pp. 459-463, 2010.
[2] N. Patterson, A.L. Price, and D. Reich, "Population Structure and Eigenanalysis," PLoS Genetics, vol. 2, no. 12,article e190, 2006.
[3] D.H. Alexander, J. Novembre, and K. Lange, "Fast Model-Based Estimation of Ancestry in Unrelated Individuals," Genome Research, vol. 19, pp. 1655-1664, 2009.
[4] W.-Y. Yang, J. Novembre, E. Eskin, and E. Halperin, "A Model-Based Approach for Analysis of Spatial Structure in Genetic Data," Nature Genetics, vol. 44, no. 6, pp. 725-731, 2012.
[5] R. Gnanadesikan, J.R. Kettenring, and J.M. Landwehr, "Projection Plots for Displaying Clusters," Statistics and Probability: Essays in Honor of C. R. Rao, pp. 269-280, Elsevier, 1982.
[6] C. Wang, Z.A. Szpiech, J.H. Degnan, M. Jakobsson, T.J. Pemberton, J.A. Hardy, A.B. Singleton, and N.A. Rosenberg, "Comparing Spatial Maps of Human Population-Genetic Variation Using Procrustes Analysis," Statistical Applications in Genetics and Molecular Biology, vol. 9, no. 1,article 13, 2010.
[7] M.R. Nelson, K. Bryc, K.S. King, A. Indap, A.R. Boyko, J. Novembre, L.P. Briley, Y. Maruyama, D.M. Waterworth, G. Waeber, P. Vollenweider, J.R. Oksenberg, S.L. Hauser, H.A. Stirnadel, J.S. Kooner, J.C. Chambers, B. Jones, V. Mooser, C.D. Bustamante, A.D. Roses, D.K. Burns, M.G. Ehm, and E.H. Lai, "The Population Reference Sample, POPRES: A Resource for Population, Disease, and Pharmacological Genetics Research," Am. J. Human Genetics, vol. 83, no. 3, pp. 347-358, Sept. 2008.
[8] J. Novembre, T. Johnson, K. Bryc, Z. Kutalik, A.R. Boyko, A. Auton, A. Indap, K.S. King, S. Bergmann, M.R. Nelson, M. Stephens, and C.D. Bustamante, "Genes Mirror Geography within Europe," Nature, vol. 456, no. 7218, pp. 98-101, Nov. 2008.
[9] O. Lao, T.T. Lu, M. Nothnagel, O. Junge, S. Freitag-Wolf, A. Caliebe, M. Balascakova, J. Bertranpetit, L.A. Bindoff, D. Comas, G. Holmlund, A. Kouvatsi, M. Macek, I. Mollet, W. Parson, J. Palo, R. Ploski, A. Sajantila, A. Tagliabracci, U. Gether, T. Werge, F. Rivadeneira, A. Hofman, A.G. Uitterlinden, C. Gieger, H.-E. Wichmann, A. Rüther, S. Schreiber, C. Becker, P. Nürnberg, M.R. Nelson, M. Krawczak, and M. Kayser, "Correlation between Genetic and Geographic Structure in Europe," Current Biology, vol. 18, pp. 1241-1248, 2008.
[10] C. Tian, R. Kosoy, A. Lee, J.M. Ransom, W. Belmont, P.K. Gregersen, and M.F. Seldin, "Analysis of East Asia Genetic Substructure Using Genome-Wide SNP Arrays," PLoS ONE, vol. 3, no. 12, article e3862, 2008.
[11] K. Bryc, A. Auton, M.R. Nelson, J.R. Oksenberg, S.L. Hauser, S. Williams, A. Froment, J.-M. Bodo, C. Wambebe, S.A. Tishkoff, and C.D. Bustamante, "Genome-Wide Patterns of Population Structure and Admixture in West Africans and African Americans," Proc. Nat'l Academy of Sciences USA, vol. 107, no. 2. pp. 786-791, 2010.
[12] J. Xing, W.S. Watkins, D.J. Witherspoon, Y. Zhang, S.L. Guthery, R. Thara, B.J. Mowry, K. Bulayeva, R.B. Weiss, and L.B. Jorde, "Fine-Scaled Human Genetic Structure Revealed by SNP Microarrays," Genome Research, vol. 19, no. 5, pp. 815-825, 2009.
[13] J. Xing, W.S. Watkins, A. Shlien, E. Walker, C.D. Huff, D.J. Witherspoon, Y. Zhang, T.S. Simonson, R.B. Weiss, J.D. Schiffman, D. Malkin, S.R. Woodward, and L.B. Jorde, "Toward a More Uniform Sampling of Human Genetic Diversity: A Survey of Worldwide Populations by High-Density Genotyping," Genomics, vol. 96, no. 4, pp. 199-210, 2010.
[14] J. Chen, H. Zheng, J.-X. Bei, L. Sun, W. Jia, T. Li, F. Zhang, M. Seielstad, Y.-X. Zeng, X. Zhang, and J. Liu, "Genetic Structure of the Han Chinese Population Revealed by Genome-Wide SNP Variation," Am. J. Human Genetics, vol. 85, no. 6, pp. 775-785, 2009.
[15] A. Price, A. Helgason, S. Palsson, H. Stefansson, D.S. Clair, O. Andreassen, D. Reich, A. Kong, and K. Stefansso, "The Impact of Divergence Time on the Nature of Population Structure: An Example from Iceland," PLoS Genetics, vol. 5, no. 6, article e1000505, 2009.
[16] C. Wang, S. Zllner, and N.A. Rosenberg, "A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations," PLoS Genetics, vol. 8, no. 8,article e1002886, 2012.
[17] I.L. Dryden and K.V. Mardia, Statistical Shape Analysis. Wiley, 1998.
[18] J. Gower and G. Dijksterhuis, Procrustes Problems. Oxford Univ. Press, 2004.
[19] N.A. Johnson, M.A. Coram, M.D. Shriver, I. Romieu, G.S. Barsh, S.J. London, and H. Tang, "Ancestral Components of Admixed Genomes in a Mexican Cohort," PLOS Genetics, vol. 7, no. 12,article e1002410, 2009.
[20] A.L. Price, N. Patterson, R.M. Plenge, M.E. Weinblatt, N.A. Shadick, and D. Reich, "Principal Components Analysis Corrects for Stratification in Genome-Wide Association Studies," Nature Genetics, vol. 38, pp. 904-909, 2006.
[21] T.I. HapMap-Consortium, "Integrating Common and Rare Genetic Variation in Diverse Human Populations," Nature, vol. 467, no. 7311, pp. 52-58, Sept. 2010.
31 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool