2006 First International Multi-Symposiums on Computer and Computational Sciences Clustering of Gene Expression Data: Performance and Similarity Analysis Hangzhou, Zhejiang, China June 20-June 24 ISBN: 0-7695-2581-4
Recent advances of the DNA Microarray technology allow monitoring gene expression profiles of thousands of genes simultaneously. However, the analysis and handling of such fast growing data is becoming the major bottleneck in the utilization of the technology. Clustering analysis is one of the most effective methods for analyzing such gene expression data. In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering, Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA), using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. Then, we present a data mining tool, Cluster Diff, which allows the similarity analysis of clusters generated by different algorithms. A case study is conducted based on clusters generated by SOTA and SOM.
Index Terms:
Clustering algorithms, Gene expression, Microarray, Cluster Similarity Analysis, Performance study
Citation:
Longde Yin, Chun-Hsi Huang, "Clustering of Gene Expression Data: Performance and Similarity Analysis," imsccs, vol. 1, pp.142-149, 2006 First International Multi-Symposiums on Computer and Computational Sciences, 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||