loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07)
Application of Double Clustering to Gene Expression Data for Class Prediction
Niagara Falls, Ontario, Canada
May 21-May 23
ISBN: 0-7695-2847-3
Mohammed Al-Shalalfa, University of Calgary, Canada
Reda Alhajj, University of Calgary, Canada; Global University, Lebanon
Extracting significant features from gene expression data is a hot subject that continues to receive great attention. Many methods have been proposed in the literature to deal with this issue, but all of these methods deal with features obtained directly from the data. Since microarray data exhibit a high degree of noise, in this paper we try to reduce the noise by using double clustering approach to identify reduced set of features capable of distinguishing between two classes. Also, we showed that the transformation of the data plays a significant role in classification. We have used two forms of data, and we have used k-means and Self organizing map for clustering. Support vector machine and binary decision trees are used for classification. As a result of the conducted experiments on AML/ALL data, we have observed that CSVM is able to correctly classify the whole training and testing data when the data is log2 transformed using only few features.
Index Terms:
clustering, classification, feature reduction, microarray, support vector machine.
Citation:
Mohammed Al-Shalalfa, Reda Alhajj, "Application of Double Clustering to Gene Expression Data for Class Prediction," ainaw, vol. 1, pp.733-738, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07), 2007
Usage of this product signifies your acceptance of the Terms of Use.