This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Learning to Relate Images
PrePrint
ISSN: 0162-8828
Roland Memisevic, University of Montreal, Montreal
A fundamental operation in many vision tasks, including motion understanding, stereopsis, visual odometry, or invariant recognition, is establishing correspondences between images or between images and data from other modalities. Recently, there has been an increasing interest in learning to infer correspondences from data using relational, spatio-temporal, and bi-linear variants of deep learning methods. These methods use multiplicative interactions between pixels or between features to represent correlation patterns across multiple images. In this paper we review the recent work on relational feature learning, and we provide an analysis of the role that multiplicative interactions play in learning to encode relations. We also discuss how square-pooling and complex cell models can be viewed as a way to represent multiplicative interactions and thereby as a way to encode relations.
Index Terms:
Machine learning,Computing Methodologies,Artificial Intelligence,Vision and Scene Understanding,Learning
Citation:
Roland Memisevic, "Learning to Relate Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, 08 March 2013. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TPAMI.2013.53>
Usage of this product signifies your acceptance of the Terms of Use.