CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2012 vol.9 Issue No.05 - Sept.-Oct.
Issue No.05 - Sept.-Oct. (2012 vol.9)
Bo Liao , Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
Xiong Li , Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
Wen Zhu , Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
Zhi Cao , Coll. of Inf. Sci. & Eng., Hunan Univ., Changsha, China
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.70
The association studies between complex diseases and single nucleotide polymorphisms (SNPs) or haplotypes have recently received great attention. However, these studies are limited by the cost of genotyping all SNPs. Therefore, it is essential to find a small subset of tag SNPs representing the rest of the SNPs. The presence of linkage disequilibrium between tag SNPs and the disease variant (genotyped or not), may allow fine mapping study. In this paper, we combine a nearest-means classifier (NMC) and ant colony algorithm to select tags. Results show that our method (ACO/NMC) can get a similar prediction accuracy with method BPSO/SVM and is better than BPSO/STAMPA for small data sets. For large data sets, although the prediction accuracy of our method is lower than BPSO/SVM, ACO/ NMC can reach a high accuracy (>;99 percent) in a relatively short time. when the number of tags increases, the time complexity of NMC is nearly linear growth. To find out that the ability of tags to locate disease locus, we simulate a case-control study and use two-locus haplotype analysis to quantitatively assess the power. The result showed that 20 percent of all SNPs selected by NMC have about 10 percent higher power than random tags, on average.
support vector machines, biology computing, diseases, genetics, molecular biophysics, pattern classification, polymorphism, two-locus haplotype analysis, genetic association, complex diseases, single nucleotide polymorphisms, linkage disequilibrium, genotyped disease variant, nearest-means classifier, ant colony algorithm, BPSO-SVM, BPSO-STAMPA, case-control study, Diseases, Accuracy, Support vector machines, Bioinformatics, Genomics, Prediction algorithms, tag selection., Haplotypes, single nucleotide polymorphism, informative SNP
Bo Liao, Xiong Li, Wen Zhu, Zhi Cao, "A Novel Method to Select Informative SNPs and Their Application in Genetic Association Studies", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.9, no. 5, pp. 1529-1534, Sept.-Oct. 2012, doi:10.1109/TCBB.2012.70