
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
C. C. M. Chen, H. Schwender, J. Keith, R. Nunkesser, K. Mengersen, P. Macrossan, "Methods for Identifying SNP Interactions: A Review on Variations of Logic Regression, Random Forest and Bayesian Logistic Regression," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. 6, pp. 15801591, November/December, 2011.  
BibTex  x  
@article{ 10.1109/TCBB.2011.46, author = {C. C. M. Chen and H. Schwender and J. Keith and R. Nunkesser and K. Mengersen and P. Macrossan}, title = {Methods for Identifying SNP Interactions: A Review on Variations of Logic Regression, Random Forest and Bayesian Logistic Regression}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {8}, number = {6}, issn = {15455963}, year = {2011}, pages = {15801591}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2011.46}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE/ACM Transactions on Computational Biology and Bioinformatics TI  Methods for Identifying SNP Interactions: A Review on Variations of Logic Regression, Random Forest and Bayesian Logistic Regression IS  6 SN  15455963 SP1580 EP1591 EPD  15801591 A1  C. C. M. Chen, A1  H. Schwender, A1  J. Keith, A1  R. Nunkesser, A1  K. Mengersen, A1  P. Macrossan, PY  2011 KW  Monte Carlo methods KW  belief networks KW  genetics KW  genomics KW  medical computing KW  molecular biophysics KW  molecular configurations KW  single nucleotide polymorphism KW  SNP interactions KW  logic regression KW  random forest KW  Bayesian logistic regression KW  treelike structures KW  logic feature selection KW  Monte Carlo logic regression KW  Genetic Programming for Association Studies KW  modified logic regressiongene expression programming KW  real genotype data KW  random forests KW  stochastic search variable selection KW  Regression analysis KW  Mathematical model KW  Bayesian methods KW  Genetic programming KW  Monte Carlo methods KW  candidate gene search. KW  Logic regressions KW  Genetic Programming for Association Studies KW  Modified Logic RegressionGene Expression Programming KW  Random Forest KW  Bayesian logistic regression with stochastic search algorithm VL  8 JA  IEEE/ACM Transactions on Computational Biology and Bioinformatics ER   
[1] A.S. Andrew, M.R. Karagas, H.H. Nelson, S. Guarrera, S. Polidoro, S. Gamberini, C. Sacerdote, J.H. Moore, K.T. Kelsey, and E. Demidenko, “DNA Repair Polymorphisms Modify Bladder Cancer Risk: A MultiFactor Analytic Strategy,” Human Heredity, vol. 65, no. 2, pp. 105118, 2008.
[2] B. Atik, T.A. Skwor, R.P. Kandel, B. Sharma, H.K. Adhikari, L. Steiner, H. Erlich, and D. Dean, “Identification of Novel Single Nucleotide Polymorphisms in Inflammatory Genes as Risk Factors Associated with Trachomatous Trichiasis,” PLoS ONE, vol. 3, no. 10, pp. e3600, 2008.
[3] L. Breiman, “Bagging Predictors,” J. Machine Learning, vol. 24, no. 2, pp. 123140, 1996.
[4] L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 532, 2001.
[5] L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone, Classification and Regression Trees. Chapman and Hall CRC, 1984.
[6] C.C.M. Chen, K. Mengersen, and J.M. Keith (In prep), “, Bayesian Method for GenomeWide Association Studies: Review and Illustration,”
[7] D.V. Conti and W.J. Gauderman, “Snps, Haplotypes, and Model Selection in a Candidate Gene Region: The Simple Analysis for Multilocus Data,” Genetic Epidemiology, vol. 27, no. 4, pp. 429441, 2004.
[8] H.J. Cordell, “Epistasis: What it Means, What it Doesn't Mean, and Statistical Methods to Detect it in Humans,” Human Molecular Genetics, vol. 11, no. 20, pp. 24632468, 2002.
[9] H.J. Cordell, “Detecting GeneGene Interactions that Underlie Human Diseases,” Nature Rev. Genetics, vol. 10, no. 6, pp. 392404, 2009.
[10] C. Ferreira, “Gene Expression Programming: A New Adaptive Algorithm for Solving Problems,” Arxiv Preprint cs.AI/0102027, 2001.
[11] B.L. Fridley, “Bayesian Variable and Model Selection Methods for Genetic Association Studies,” Genetic Epidemiology, vol. 33, no. 1, pp. 2737, 2009.
[12] A. Fritsch and K. Ickstadt, “Comparing Logic Regression Based Methods for Identifying SNP Interactions,” Lecture Notes in Computer Science, vol. 4414, pp. 90103, 2007.
[13] E.I. George and R.E. McCulloch, “Variable Selection via Gibbs Sampling,” J. Am. Statistical Assoc., vol. 88, no. 423, pp. 881889, 1993.
[14] E.I. George and R.E. McCulloch, “Approaches for Bayesian Variable Selection,” Statistica Sinica, vol. 7, pp. 339374, 1997.
[15] P.J. Green, “Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination,” Biometrika, vol. 82, no. 4, pp. 711732, 1995.
[16] A.G. Heidema, J.M.A. Boer, N. Nagelkerke, and E.C.M. Mariman, “The Challenge for Genetic Epidemiologists: How to Analyze Large Numbers of SNPS in Relation to Complex Diseases,” BMC Genetics, vol. 7, no. 23, 2006, doi: 10.1186/147/2156723.
[17] J. Hoh, A. Wille, and J. Ott, “Trimming, Weighting, and Grouping SNPS in Human CaseControl Association Studies,” Genome Research, vol. 11, no. 12, pp. 21152119, 2001.
[18] J.H. Holland, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control and Artificial Intelligence. Univ. of Michigan Press, 1975.
[19] R. Jiang, W. Tang, X. Wu, and W. Fu, “A Random Forest Approach to the Detection of Epistatic Interactions in CaseControl Studies,” BMC Bioinformatics, vol. 10, Suppl 1, pp. S65, 2009.
[20] C. Justenhoven, U. Hamann, B. Pesch, V. Harth, S. Rabstein, C. Baisch, C. Vollmert, T. Illig, Y.D. Ko, T. Bruning, and H. Brauch, “For the Interdisciplinary Study Group on Gene Environment Interactions, and N. Breast Cancer in Germany,” Cancer Epidemiol Biomarkers Prev, vol. 13, no. 12, pp. 20592064, 2004.
[21] S. Kirkpatrick, C.D. Gelatt, and M.P. Vecchi, “Optimization by Simulated Annealing,” Science, vol. 220, no. 4598, pp. 671680, 1983.
[22] C. Kooperberg, J.C. Bis, K.D. Marciante, S.R. Heckbert, T. Lumley, and B.M. Psaty, “Logic Regression for Analysis of the Association between Genetic Variation in the ReninAngiotensin System and Myocardial Infarction or Stroke,” Am. J. Epidemiology, vol. 165, no. 3, pp. 334343, 2007.
[23] C. Kooperberg and I. Ruczinski, “Identifying Interacting SNPS Using Monte Carlo Logic Regression,” Genetic Epidemiology, vol. 28, no. 2, pp. 157170, 2005.
[24] J.R. Koza and J.P. Rice, Genetic Programming. Springer, 1992.
[25] E.S. Lander, L.M. Linton, B. Birren, C. Nusbaum, M.C. Zody, J. Baldwin, K. Devon, K. Dewar, M. Doyle, and W. FitzHugh, “Initial Sequencing and Analysis of the Human Genome,” Nature, vol. 409, no. 6822, pp. 860921, 2001.
[26] K. Lunetta, L.B. Hayward, J. Segal, and P. Van Eerdewegh, “Screening LargeScale Association Study Data: Exploiting Interactions Using Random Forests,” BMC Genetics, vol. 5, no. 1, p. 32, 2004.
[27] P. Macrossan, C.C.M. Chen, and K.L. Mengersen (In prep)., “Using Gene Expression Programming with Modified Logic Regression for the Investigation of SNP Interactions in Large Dimensional Data,” In Prep.,
[28] P. McCullagh, and J.A. Nelder, Generalized Linear Models. Chapman and Hall, 1983.
[29] L.E. Mechanic, B.T. Luke, J.E. Goodman, S.J. Chanock, and C.C. Harris, “Polymorphism Interaction Analysis (PIA): A Method for Investigating Complex GeneGene Interactions,” BMC Bioinformatics, vol. 9, no. 1, p. 146, 2008.
[30] Y. Meng, Q. Yang, K.T. Cuenco, L.A. Cupples, A.L. DeStefano, and K.L. Lunette, “TwoStage Approach for Identifying SingleNucleotide Polymorphisms Associated with Rheumatoid Arthritis Using Random Forests and Bayesian Networks,” BMC, vol. 1, Suppl 1, pp. S56, 2007.
[31] R.M. Neal, “Markov Chain Monte Carlo Methods Based on “Slicing” the Density Function,” Technical Report No. 9722, Dept. of Statistics, Univ. of Toronto, 1997.
[32] R. Nunkesser, T. Bernholt, H. Schwender, K. Ickstadt, and I. Wegener, “Detecting HighOrder Interactions of Single Nucleotide Polymorphisms Using Genetic Programming,” Bioinformatics, vol. 23, no. 24, pp. 32803288, 2007.
[33] P.C. Phillips, “Epistasisthe Essential Role of Gene Interactions in the Structure and Evolution of Genetic Systems,” Nature Rev. Genetics, vol. 9, no. 11, pp. 855867, 2008.
[34] “R Development Core Team 2008,” R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2008.
[35] T.K. Rice, N.J. Schork, and D.C. Rao, “Methods for Handling Multiple Testing,” Genetic Dissection of Complex Traits, D.C. Rao and C.C. Gu, eds., Academic Press, 2008.
[36] M.D. Ritchie, L.W. Hahn, N. Roodi, L.R. Bailey, W.D. Dupont, F.F. Parl, and J.H. Moore, “MultifactorDimensionality Reduction Reveals HighOrder Interactions among EstrogenMetabolism Genes in Sporadic Breast Cancer,” Am. J. Human Genetics, vol. 69, no. 1, pp. 138147, 2001.
[37] I. Ruczinski, C. Kooperberg, and M. Leblanc, “Logic Regression,” J. Computational and Graphical Statistics, vol. 12, no. 3, pp. 475511, 2003.
[38] H. Schwender, “Statistical Analysis of Genotype and Gene Expression Data,” PhD thesis, Dept. of Statistics, TU Dortmund Univ., 2007.
[39] H. Schwender and K. Ickstadt, “Identification of SNP Interactions Using Logic Regression,” Biostatistics, vol. 8, no. 1, pp. 187198, 2008.
[40] H. Schwender and K. Ickstadt, “Imputing Missing Genotypes with K Nearest Neighbors,” technical report, Collaborative Research Center 475, Dept. of Statistics, Univ. of Dortmund, 2008.
[41] “The International HapMap Consortium the International Hapmap Project,” Nature, vol. 426, pp. 789796, 2003.
[42] N. Yi, V. George, and D.B. Allison, “Stochastic Search Variable Selection for Identifying Multiple Quantitative Trait Loci,” Genetics, vol. 164, pp. 11291138, 2003.
[43] L.J. Zhao, X.G. Liu, Y.Z. Liu, Y.J. Liu, C.J. Papasian, B.Y. Sha, F. Pan, Y.F. Guo, L. Wang, and H. Yan, “GenomeWide Association Study for Femoral Neck Bone Geometry,” J. Bone and Mineral Research, vol. 0, pp. 134, 2009.
[44] Y. Zhang and J.S. Liu, “Bayesian Inference of Epistatic Interactions in CaseControl Studies,” Nature Genetics, vol. 39, no. 9, pp. 11671173, 2007.