Issue No. 01 - January/February (2002 vol. 14)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/69.979975
<p><b>Abstract</b>—We present a new distributed algorithm for computing the minimum description length (MDL) in learning Bayesian inference networks from data. Our learning algorithm exploits both properties of the MDL-based score metric and a distributed, asynchronous, adaptive search technique called nagging. Nagging is intrinsically fault-tolerant, has dynamic load balancing features, and scales well. We demonstrate the viability, effectiveness, and scalability of our approach empirically with several experiments using networked machines. More specifically, we show that our distributed algorithm can provide optimal solutions for larger problems as well as good solutions for Bayesian networks of up to 150 variables.</p>
data mining, Bayesian networks, minimum description length principle, distributed systems
A. Segre and W. Lam, "A Distributed Learning Algorithm for Bayesian Inference Networks," in IEEE Transactions on Knowledge & Data Engineering, vol. 14, no. , pp. 93-105, 2002.