Issue No. 01 - January-March (2008 vol. 5)
When analyzing the results of microarray experiments, biologists generally use unsupervised categorization tools. However, such tools regard each time point as an independent dimension and utilize the Euclidean distance to compute the similarities between expressions. Furthermore, some of these methods require the number of clusters to be determined in advance, which is clearly impossible in the case of a new dataset. Therefore, this study proposes a novel scheme, designated as the Variation-based Co-expression Detection (VCD) algorithm, to analyze the trends of expressions based on their variation over time. The proposed algorithm has two advantages. First, it is unnecessary to determine the number of clusters in advance since the algorithm automatically detects those genes whose profiles are grouped together and creates patterns for these groups. Second, the algorithm features a new measurement criterion for calculating the degree of change of the expressions between adjacent time points and evaluating their trend similarities. Three real-world microarray datasets are employed to evaluate the performance of the proposed algorithm.
Pattern analysis, Time series analysis, Bioinformatics, Data mining, Clustering, Gene expression
Z. Yin and J. Chiang, "Novel Algorithm for Coexpression Detection in Time-Varying Microarray Data Sets," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 5, no. , pp. 120-135, 2007.