21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07)
Attractive Feature Reduction Approach for Colon Data Classification
Niagara Falls, Ontario, Canada
May 21-May 23
ISBN: 0-7695-2847-3
In this paper, we try to identify a set of reduced features capable of distinguishing between two classes by performing double clustering using fuzzy c-means. We decided on using fuzzy c-means because a fuzzy model fits better the gene expression data analysis. Fuzziness parameter m is a major problem in applying fuzzy c-means method for clustering. In this approach, we applied fuzzy c-means clustering using different fuzziness parameters for two forms of microarray data. Support vector machine with different kernel functions are used for classification. As a result of the experiments conducted on the colon dataset, we have observed that CSVM is able to correctly classify the whole training and test sets when the data is log2 transformed and when m is close to 1.5.
Index Terms:
Clustering, classification, microarray, validity analysis, support vector machine, fuzziness parameter, Fuzzy C-means(FCM).
Citation:
Mohammed Al-Shalalfa, Reda Alhajj, "Attractive Feature Reduction Approach for Colon Data Classification," ainaw, vol. 1, pp.678-683, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07), 2007