This Article 
 Bibliographic References 
 Add to: 
An Integrated Bayesian Approach to Layer Extraction from Image Sequences
March 2001 (vol. 23 no. 3)
pp. 297-303

Abstract—This paper describes a Bayesian approach for modeling 3D scenes as a collection of approximately planar layers that are arbitrarily positioned and oriented in the scene. In contrast to much of the previous work on layer-based motion modeling, which computes layered descriptions of 2D image motion, our work leads to a 3D description of the scene. There are two contributions within the paper. The first is to formulate the prior assumptions about the layers and scene within a Bayesian decision making framework which is used to automatically determine the number of layers and the assignment of individual pixels to layers. The second is algorithmic. In order to achieve the optimization, a Bayesian version of RANSAC is developed with which to initialize the segmentation. Then, a generalized expectation maximization method is used to find the MAP solution.

[1] S. Ayer and H. Sawhney, "Layered Representation of Motion Video Using Robust Maximum-Likelihood Estimation of Mixture Models and mdl Encoding," Int'l Conf. Computer Vision, pp. 777-784,Cambridge, Mass., June 1995.
[2] S. Baker, R. Szeliski, and P. Anadan, A Layered Approach to Stereo Reconstruction Proc. IEEE Computer Vision and Pattern Recognition 1998 (CVPR '98) Conf., 1998.
[3] P.N. Belhumeur, “A Bayesian Approach to Binocular Stereopsis,” Int'l J. Computer Vision, vol. 19, no. 3, pp. 237-260, 1996.
[4] J.R. Bergen, P.J. Burt, R. Hingorani, and S. Peleg, "A Three-Frame Algorithm for Estimating Two-Component Image Motion," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, pp. 886-895, Sept. 1992.
[5] J. Besag, “Spatial Interaction and Statistical Analysis of Lattice Systems,” J. Royal Statistical Soc. London B, vol. 36, pp. 192-225, 1974.
[6] A. Blake and A. Zisserman, Visual Reconstruction. MIT Press, 1987.
[7] A.P. Dempster, N.M. Laird, and D.B. Rubin, “Maximum Likelihood from Incomplete Data via the EM Algorithm,” J. Royal Statistical Soc. B, (with discussion), vol. 39, pp. 1-38, 1977.
[8] O.D. Faugeras, Three-Dimensional Computer Vision: A Geometric Viewpoint.Cambridge, Mass.: MIT Press, 1993.
[9] A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin, Bayesian Data Analysis. Chapman&Hall, 1995.
[10] S. Geman and D. Geman, “Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 6, no. 6, pp. 721-741, June 1984.
[11] U. Grenander, Y. Chow, and D.M. Keenan, HANDS. Springer-Verlag, 1991.
[12] E.T. Jaynes, “Probability Theory as Extended Logic,” available at Jaynes/. need year.
[13] J.L. Mundy and A. Zisserman, Geometric Invariance in Computer Vision.Cambridge, Mass.: MIT Press, 1992.
[14] D.S. Sivia, Data Analysis, A Bayesian Tutorial. Clarendon, Oxford: Clarendon, 1996.
[15] R. Szeliski,“Bayesian modeling of uncertainty in low-level vision,” Int’l J. Computer Vision, vol. 5, no. 3, pp. 271-301, Dec. 1990.
[16] R. Szeliski, “A Multi-View Approach to Motion and Stereo,” Proc. Conf. Computer Vision and Pattern Recognition, pp. 157-163, 1999.
[17] J.Y.A. Wang and E.H. Adelson, Representing Moving Images with Layers IEEE Trans. Image Processing, vol. 3, no. 5, pp. 625-638, Sept. 1994.
[18] Y. Weiss and E. Adelson, “A Unified Mixture Framework for Motion Segmentation: Incorporating Spatial Coherence and Estimating the Number of Models,” Proc. IEEE Computer Soc. Conf. Computer Vision and Pattern Recognition, pp. 321-326, 1996.

Index Terms:
Layer extraction, segmentation, stereo matching, motion estimation.
Philip H.S. Torr, Richard Szeliski, P. Anandan, "An Integrated Bayesian Approach to Layer Extraction from Image Sequences," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 297-303, March 2001, doi:10.1109/34.910882
Usage of this product signifies your acceptance of the Terms of Use.