Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Clustering of SNP Data with Application to Genomics
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
Single nucleotide polymorphisms (SNPs) are very common throughout the genome and hence are potentially valuable for mapping disease susceptibility loci by detecting association between SNP markers and disease. Many methods may only be applicable when marker haplotypes, rather than genotypes (categorical data), are available for analysis. In this paper, we explore the properties of k-modes (categorical data) clustering algorithms to SNP data for detecting association between SNP markers and disease. Subspace k-modes clustering properties are also considered and tested.
Citation:
Michael K. Ng, Mark J. Li, Sio I. Ao, Pak C. Sham, Yiu-ming Cheung, Joshua Z. Huang, "Clustering of SNP Data with Application to Genomics," icdmw, pp.158-162, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006