The Community for Technology Leaders
2008 Eighth IEEE International Conference on Data Mining (2008)
Dec. 15, 2008 to Dec. 19, 2008
ISSN: 1550-4786
ISBN: 978-0-7695-3502-9
pp: 133-142
ABSTRACT
Order-preserving submatrices (OPSM's) have been shown useful in capturing concurrent patterns in data when the relative magnitudes of data items are more important than their absolute values. To cope with data noise, repeated experiments are often conducted to collect multiple measurements. We propose and study a more robust version of OPSM, where each data item is represented by a set of values obtained from replicated experiments. We call the new problem OPSM-RM (OPSM with repeated measurements). We define OPSM-RM based on a number of practical requirements. We discuss the computational challenges of OPSM-RM and propose a generic mining algorithm. We further propose a series of techniques to speed up two time-dominating components of the algorithm. We clearly show the effectiveness of our methods through a series of experiments conducted on real microarray data.
INDEX TERMS
OPSM, sequence mining, gene expression
CITATION

C. K. Chui, K. Y. Yip, B. Kao and S. D. Lee, "Mining Order-Preserving Submatrices from Data with Repeated Measurements," 2008 Eighth IEEE International Conference on Data Mining(ICDM), vol. 00, no. , pp. 133-142, 2008.
doi:10.1109/ICDM.2008.12
96 ms
(Ver 3.3 (11022016))