loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
20th International Symposium on High-Performance Computing in an Advanced Collaborative Environment (HPCS'06)
Distributed Data Mining on Virtual Clusters
St. John's, Newfoundland
May 14-May 17
ISBN: 0-7695-2582-2
Gabriel Mateescu, Research Computing Support Group, Canada
Julio Valdes, Institute for Information Technology, Canada
For complex processes investigated in scientific fields such as medicine and earth sciences, knowledge discovery that exposes the underlying structure of the processes is crucial for detecting changes of state and constructing forecasting procedures. A model discovery approach has been recently developed, which uses computational intelligence techniques to deal with the heterogeneity, incompleteness and imprecision of the data describing complex proceeses. While the approach offers a tractable and effective means for model discovery, it is still computationally expensive, routinely requiring tens of thousands of hours of CPU time.

To satisfy the needs of such applications, it is more costeffective to employ shared resources located in different departments of an organization, than to purchase large and expensive compute clusters. We present a method for aggregating resources under multiple administrative domains into a virtual resource that can satisfy efficiently the needs of data-mining based model discovery. The proposed resource aggregation and job management approach provides an end-to-end solution to distributed data mining across organization-wide resources.

Citation:
Gabriel Mateescu, Julio Valdes, "Distributed Data Mining on Virtual Clusters," hpcs, pp.6, 20th International Symposium on High-Performance Computing in an Advanced Collaborative Environment (HPCS'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.