10th International Conference on Parallel and Distributed Systems (ICPADS'04) Parallel Implementation of WAP-Tree Mining Algorithm Newport Beach, California July 07-July 09 ISBN: 0-7695-2152-5
In this paper, we present parallel algorithms for web log mining and the performance prediction model. The algorithm, based on WAP-tree, scans dataset only twice and avoids candidate generation process. We parallelized mining part of WAP tree. To balance the workload among processors, we developed a task scheduling strategy. A performance model of parallel web mining algorithm is also developed to predict the performance of parallel implementation. This model shows that we can get linear speedup for a small number of processors, and a slow down of speedup as the number of processors increases. Using the performance model, we can also estimate the maximum speed up. We implemented the algorithm on a Pittsburg Super Computer Center Lemieux using up to 48 processors. Our benchmark results showed that the performance model correctly predicts the performance of the parallel implementation. We have achieved a good speedup as the size of the dataset is increased.
Citation:
Ming Wu, Moon Jung Chung, H. D. K. Moonesinghe, "Parallel Implementation of WAP-Tree Mining Algorithm," icpads, pp.135, 10th International Conference on Parallel and Distributed Systems (ICPADS'04), 2004 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||