2007 Seventh IEEE International Conference on Data Mining
Prism: A Primal-Encoding Approach for Frequent Sequence Mining
Omaha, Nebraska, USA
October 28-October 31
ISBN: 0-7695-3018-4
Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called PRISM, for mining frequent sequences. PRISM utilizes a vertical approach for enumeration and support counting, based on the novel notion of prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that PRISM outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.
Citation:
Karam Gouda, Mosab Hassaan, Mohammed J. Zaki, "Prism: A Primal-Encoding Approach for Frequent Sequence Mining," icdm, pp.487-492, 2007 Seventh IEEE International Conference on Data Mining, 2007