loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
18th International Conference on Pattern Recognition (ICPR'06) Volume 3
Semi-Parametric Model-Based Clustering for DNA Microarray Data
Hong Kong
August 20-August 24
ISBN: 0-7695-2521-0
Bohyung Han, Samsung Information Systems America Irvine, CA
Larry S. Davis, University of Maryland College Park, MD
Various clustering methods have been proposed for the analysis of gene expression data, but conventional clustering algorithms have several critical limitations; how to set parameters such as number of clusters, initial cluster centers, and so on. In this paper, we propose a semi-parametric model-based clustering algorithm in which the underlying model is a mixture of Gaussian. Each gene expression data builds a Gaussian kernel, and the uncertainty of microarray data is naturally integrated in the data representation. Our algorithm provides a principled method to automatically determine parameters - number of components in the mixture, mean, covariance, and weight of each Gaussian - by mean-shift procedure [2] and curvature fitting. After the initialization, Expectation Maximization (EM) algorithm is employed for clustering to achieve Maximum Likelihood (ML). The performance of our algorithm is compared with standard EM algorithm using real data as well as synthetic data.
Citation:
Bohyung Han, Larry S. Davis, "Semi-Parametric Model-Based Clustering for DNA Microarray Data," icpr, vol. 3, pp.324-327, 18th International Conference on Pattern Recognition (ICPR'06) Volume 3, 2006
Usage of this product signifies your acceptance of the Terms of Use.