The Community for Technology Leaders
Green Image
Issue No. 09 - Sept. (2013 vol. 35)
ISSN: 0162-8828
pp: 2206-2222
M. Ranzato , Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
V. Mnih , Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
J. M. Susskind , Machine Perception Lab., Univ. of California San Diego, La Jolla, CA, USA
G. E. Hinton , Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
ABSTRACT
This paper describes a Markov Random Field for real-valued image modeling that has two sets of latent variables. One set is used to gate the interactions between all pairs of pixels, while the second set determines the mean intensities of each pixel. This is a powerful model with a conditional distribution over the input that is Gaussian, with both mean and covariance determined by the configuration of latent variables, which is unlike previous models that were restricted to using Gaussians with either a fixed mean or a diagonal covariance matrix. Thanks to the increased flexibility, this gated MRF can generate more realistic samples after training on an unconstrained distribution of high-resolution natural images. Furthermore, the latent variables of the model can be inferred efficiently and can be used as very effective descriptors in recognition tasks. Both generation and discrimination drastically improve as layers of binary latent variables are added to the model, yielding a hierarchical model called a Deep Belief Network.
INDEX TERMS
Image reconstruction, Probabilistic logic, Logic gates, Computational modeling, Covariance matrix, Vectors, Adaptation models,facial expression recognition, Gated MRF, natural images, deep learning, unsupervised learning, density estimation, energy-based model, Boltzmann machine, factored 3-way model, generative model, object recognition, denoising
CITATION
M. Ranzato, V. Mnih, J. M. Susskind, G. E. Hinton, "Modeling Natural Images Using Gated MRFs", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 35, no. , pp. 2206-2222, Sept. 2013, doi:10.1109/TPAMI.2013.29
195 ms
(Ver )