2007 Seventh IEEE International Conference on Data Mining Parallel Mining of Frequent Closed Patterns: Harnessing Modern Computer Architectures Omaha, Nebraska, USA October 28-October 31 ISBN: 0-7695-3018-4
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2007.13
Inspired by emerging multi-core computer architectures, in this paper we present MT CLOSED, a multi-threaded algorithm for frequent closed itemset mining (FCIM). To the best of our knowledge, this is the first FCIM parallel algorithm proposed so far. We studied how different duplicate checking techniques, typical of FCIM algorithms, may affect this parallelization. We showed that only one of them allows to decompose the global FCIM problem into independent tasks that can be executed in any order, and thus in parallel. Finally we show how MT CLOSED efficiently harness modern CPUs. We designed and tested several parallelization paradigms by investigating static/dynamic decomposition and scheduling of tasks, thus showing its scalability w.r.t. to the number of CPUs. We analyzed the cache friendliness of the algorithm. Finally, we provided additional speed-up by introducing SIMD extensions.
Citation:
Claudio Lucchese, Salvatore Orlando, Raffaele Perego, "Parallel Mining of Frequent Closed Patterns: Harnessing Modern Computer Architectures," icdm, pp.242-251, 2007 Seventh IEEE International Conference on Data Mining, 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||