loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Complex Computer Systems (ICECCS'00)
Scalable Data Mining with Log Based Consistency DSM for High Performance Distributed Computing
Tokyo, Japan
September 11-September 15
ISBN: 0-7695-0583-X
H. Hirayama, Graduate Sch. of Inf. Syst., Univ. of Electro-Commun., Tokyo, Japan
H. Honda, Graduate Sch. of Inf. Syst., Univ. of Electro-Commun., Tokyo, Japan
T. Yuba, Graduate Sch. of Inf. Syst., Univ. of Electro-Commun., Tokyo, Japan
Abstract: Mining the large Web based online distributed databases to discover new knowledge and financial gain is an important research problem. These computations require high performance distributed and parallel computing environments. Traditional data mining techniques such as classification, association, clustering can be extended to find new efficient solutions. The paper presents the scalable data mining problem, proposes the use of software DSM (distributed shared memory) with a new mechanism as an effective solution and discusses both the implementation and performance evaluation results. It is observed that the overhead of a software DSM is very large for scalable data mining programs. A new Log Based Consistency (LBC) mechanism, especially designed for scalable data mining on the software DSM is proposed to overcome this overhead. Traditional association rule based data mining programs frequently modify the same fields by count-up operations. In contrast, the LBC mechanism keeps up the consistency by broadcasting the count-up operation logs among the multiple nodes.
Index Terms:
data mining; scalable data mining; log based consistency DSM; high performance distributed computing; Web based online distributed database mining; research problem; parallel computing environments; data mining techniques; scalable data mining problem; distributed shared memory; performance evaluation results; software DSM; scalable data mining programs; association rule based data mining programs; count-up operations; LBC mechanism; count-up operation logs; multiple nodes
Citation:
H. Hirayama, H. Honda, T. Yuba, "Scalable Data Mining with Log Based Consistency DSM for High Performance Distributed Computing," iceccs, pp.0143, Sixth IEEE International Conference on Complex Computer Systems (ICECCS'00), 2000
Usage of this product signifies your acceptance of the Terms of Use.