loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
10th International Conference on Parallel and Distributed Systems (ICPADS'04)
Parallel Implementation of WAP-Tree Mining Algorithm
Newport Beach, California
July 07-July 09
ISBN: 0-7695-2152-5
Ming Wu, MIchigan State University, MI
Moon Jung Chung, MIchigan State University, MI
H. D. K. Moonesinghe, MIchigan State University, MI
In this paper, we present parallel algorithms for web log mining and the performance prediction model. The algorithm, based on WAP-tree, scans dataset only twice and avoids candidate generation process. We parallelized mining part of WAP tree. To balance the workload among processors, we developed a task scheduling strategy. A performance model of parallel web mining algorithm is also developed to predict the performance of parallel implementation. This model shows that we can get linear speedup for a small number of processors, and a slow down of speedup as the number of processors increases. Using the performance model, we can also estimate the maximum speed up. We implemented the algorithm on a Pittsburg Super Computer Center Lemieux using up to 48 processors. Our benchmark results showed that the performance model correctly predicts the performance of the parallel implementation. We have achieved a good speedup as the size of the dataset is increased.
Citation:
Ming Wu, Moon Jung Chung, H. D. K. Moonesinghe, "Parallel Implementation of WAP-Tree Mining Algorithm," icpads, pp.135, 10th International Conference on Parallel and Distributed Systems (ICPADS'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.