Issue No. 08 - August (2005 vol. 27)
Many vision tasks can be formulated as graph partition problems that minimize energy functions. For such problems, the Gibbs sampler  provides a general solution but is very slow, while other methods, such as Ncut  and graph cuts , , are computationally effective but only work for specific energy forms  and are not generally applicable. In this paper, we present a new inference algorithm that generalizes the Swendsen-Wang method  to arbitrary probabilities defined on graph partitions. We begin by computing graph edge weights, based on local image features. Then, the algorithm iterates two steps. 1) Graph clustering: It forms connected components by cutting the edges probabilistically based on their weights. 2) Graph relabeling: It selects one connected component and flips probabilistically, the coloring of all vertices in the component simultaneously. Thus, it realizes the split, merge, and regrouping of a "chunk” of the graph, in contrast to Gibbs sampler that flips a single vertex. We prove that this algorithm simulates ergodic and reversible Markov chain jumps in the space of graph partitions and is applicable to arbitrary posterior probabilities or energy functions defined on graphs. We demonstrate the algorithm on two typical problems in computer vision—image segmentation and stereo vision. Experimentally, we show that it is 100-400 times faster in CPU time than the classical Gibbs sampler and 20-40 times faster then the DDMCMC segmentation algorithm . For stereo, we compare performance with graph cuts and belief propagation. We also show that our algorithm can automatically infer generative models and obtain satisfactory results (better than the graphic cuts or belief propagation) in the same amount of time.
Index Terms- Swendsen-Wang, cluster sampling, Markov chain Monte Carlo, Bayesian inference, image segmentation, stereo matching.
Song-Chun Zhu, Adrian Barbu, "Generalizing Swendsen-Wang to Sampling Arbitrary Posterior Probabilities", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. , pp. 1239-1253, August 2005, doi:10.1109/TPAMI.2005.161