Third International Conference on Natural Computation (ICNC 2007) (2007)
Haikou, Hainan, China
Aug. 24, 2007 to Aug. 27, 2007
ISBN: 0-7695-2875-9
pp: 451-455
Yu Chen , Sichuan University, China
Changjie Tang , Sichuan University, China
Jun Zhu , National Center for Birth Defects Monitoring, China
Chuan Li , Sichuan University, China
Shaojie Qiao , Sichuan University, China
Rui Li , University of California Riverside, USA
Jiang Wu , Sichuan University, China
Most existing clustering methods require prior knowledge, such as the number of clusters and thresholds. They are difficult to determine accurately in practice. To solve the problem, this study proposes a novel clustering algorithm named GEP-Cluster based on Gene Expression Programming (GEP) without prior knowledge. The main contributions include: (1) a new concept named Clustering Algebra is proposed that makes clustering as algebraic operation, (2) a GEP-Cluster algorithm is proposed to find the best clustering information automatic by GEP and discover the best clustering solution without any prior knowledge, (3) an AMCA (Automatic Merging Cluster Algorithm) algorithm is proposed to merge clustering automatically. Extensive experiments demonstrate that GEP-Cluster algorithm is effective in clustering without any prior knowledge on various data sets.

C. Li et al., "Clustering Without Prior Knowledge Based on Gene Expression Programming," Third International Conference on Natural Computation (ICNC 2007)(ICNC), Haikou, Hainan, China, 2007, pp. 451-455.
