This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Correspondence with Cumulative Similiarity Transforms
February 2001 (vol. 23 no. 2)
pp. 222-227

Abstract—A local image transform based on cumulative similarity measures is defined and is shown to enable efficient correspondence and tracking near occluding boundaries. Unlike traditional methods, this transform allows correspondences to be found when the only contrast present is the occluding boundary itself and when the sign of contrast along the boundary is possibly reversed. The transform is based on the idea of a cumulative similarity measure which characterizes the shape of local image homogeneity; both the value of an image at a particular point and the shape of the region with locally similar and connected values is captured. This representation is insensitive to structure beyond an occluding boundary but is sensitive to the shape of the boundary itself, which is often an important cue. We show results comparing this method to traditional least-squares and robust correspondence matching.

[1] A.A. Amini,T.E. Weymouth,, and R.C. Jain,“Using dynamic programming for solving variational problems in vision,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, no. 9, pp. 855-867, 1990.
[2] S. Ayer and H. Sawhney, "Layered Representation of Motion Video Using Robust Maximum-Likelihood Estimation of Mixture Models and mdl Encoding," Int'l Conf. Computer Vision, pp. 777-784,Cambridge, Mass., June 1995.
[3] J.R. Bergen, P.J. Burt, K. Hanna, R. Hingorani, P. Jeanne, and S. Peleg, “Dynamic Multiple-Motion Computation,” Artificial Intelligence and Computer Vision, Y.A. Feldman and A. Bruckstein, eds., Elsevier Science Publishers B.V., 1991.
[4] D. Bhar and S. Nayar, “Ordinal Measures for Visual Correspondence,” Proc. Conf. Computer Vision and Pattern Recognition, 1994.
[5] M.J. Black and P. Anandan,“A framework for the robust estimation of optical flow,” Proc. Int’l Conf. on Computer Vision, ICCV-93, Berlin, pp. 231-236, May 1993.
[6] M.J. Black and Y. Yacoob, "Tracking and Recognizing Rigid and Non-Rigid Facial Motions Using Local Parametric Model of Image Motion," Proc. Int'l Conf. Computer Vision, pp. 374-381,Cambridge, Mass., 1995.
[7] M. Black, Y. Yacoob, A. Jepson, and D. Fleet, “Learning Parameterized Models of Image Motion,” Proc. Conf. Computer Vision and Pattern Recognition, 1997.
[8] M. Black, Y. Yacoob, and D. Fleet, “Modeling Appearance Change in Image Sequences,” Proc. Third Int'l Workshop Visual Form, 1997.
[9] A. Blake and M. Isard, ”Three-Dimensional Position and Shape Input Using Video Tracking of Hands and Lips,” Proc. SIGGRAPH '94, pp. 185-192, 1994.
[10] C. Bregler and M. Slaney, ”Snakes,” IRC-TR 1995-017, Interval Research Corp., Palo Alto, Calif., 1995. seehttp://web.interval.com/papers1995-017/.
[11] Y. Boykov, O. Veksler, and R. Zabih, “A Variable Window Approach to Early Vision,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 12, pp. 1283-1294, Dec. 1998.
[12] C. Bregler, M. Covell, and M. Slaney, “Video Rewrite,” Proc. SIGGRAPH, 1997.
[13] T.F. Cootes, C.J. Taylor, A. Lanitis, D.H. Cooper, and J. Graham, "Building and Using Flexible Models Incorporating Grey-Level Information," Proc. Fourth Int'l Conf. Computer Vision, pp. 242-246.Los Alamitos, Calif.: IEEE CS Press, 1993.
[14] M. Covell and T. Darrell, “Dynamic Occluding Contours: A New External Energy Term for Snakes,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1999.
[15] M. Covell and M. Withgott, “Automatic Morphing: Spanning the Gap between Motion Estimation and Morphing,” Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, 1994.
[16] M. Covell and C. Bregler, “Eigenpoints,” Proc. Int'l Conf. Image Processing, 1996.
[17] T. Darrell and A.P. Pentland,“Robust estimation of a multilayer motion representation,” Proc. IEEE Workshop on Visual Motion, pp. 173-177, 1991.
[18] T. Darrell and E. Simoncelli, “Nulling Filters and the Separation of Transparent Motions,” Proc. Computer Vision and Pattern Recognition, 1993.
[19] B. Galvin, B. McCane, and K. Novins, “Virtual Snakes for Occlusion Analysis,” Proc Computer Vision and Pattern Recognition, pp. 294-299, 1999.
[20] G. Hager and P. Belhumeur, “Real-Time Tracking of Image Regions with Changes in Geometry and Illumination,” Proc. Computer Vision and Pattern Recognition, 1996.
[21] D.P. Huttenlocher, J.J. Noh, and W.J. Rucklidge, "Tracking Non-Rigid Objects in Complex Scenes," Proc. IEEE Int'l Conf. Computer Vision, 1993.
[22] M. Irani, B. Rousso, and S. Peleg, “Computing Occluding and Transparent Motions,” Int'l J. Computer Vision, vol. 12, no. 1, pp. 5-16, Jan. 1994.
[23] M. Isard and A. Blake, “Contour Tracking by Stochastic Propagation of Conditional Density,” Proc. European Conf. Computer Vision, pp. 343-356, 1996.
[24] D. Jones and J. Malik, “A Computational Framework for Determining Stereo Correspondences from a Set of Linear Spatial Filters,” Proc. Second European Conf. Computer Vision, pp. 395-410, 1992.
[25] T. Kanade, A. Yoshida, K. Oda, H. Kano, and M. Tanaka, "A Video-Rate Stereo Machine and Its New Applications," Proc. IEEE Conf. Computer Vision and Pattern Recognition 96, pp.196-202, 1996.
[26] T. Kanade and M. Okutomi, “A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 16, pp. 920-932, 1994.
[27] M. Kass, A. Witkin, and D. Terzopolous, ”Snakes: Active Contour Models,” Proc. Int'l Conf. Computer Vision, pp. 259-268, 1987.
[28] P. Litwinowicz and L. Williams, "Animating Images with Drawings," Computer Graphics (Proc. SIGGRAPH '94), pp. 409-412, 1994.
[29] N. Peterfreund, ”The Velocity Snake,” Proc. IEEE Non-Rigid and Articulated Motion Workshop, pp. 70-79, 1997.
[30] M. Shizawa and K. Mase, “Simultaneous Multiple Optical Flow Estimation,” Proc. Computer Vision and Pattern Recognition, 1990.
[31] E.P. Simoncelli, E.H. Adelson, and D.J. Heeger, "Probability Distributions of Optic Flow," IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR'91), pp. 310-315,Maui, Hawaii, June 1991, IEEE CS Press.
[32] D. Terzopoulos and R. Szeliski, “Tracking with Kalman Snakes,” Active Vision, A. Blake and A. Yuille, eds., pp. 3-20, MIT Press, 1992.
[33] J. Wang and E. H. Adelson, “Layered Representations for Image Sequence Coding,” Proc. Conf. Computer Vision and Pattern Recognition, 1993.
[34] L. Wiskott, J. Fellous, N Kruger, C. von der Mals-burg, ”Face Recognition and Gender Determination,” Proc. Int'l Workshop Automatic Face and Gesture Recognition, pp. 92-97, 1995.
[35] W. Yu, K. Daniilidis, S. Beauchemin, and G. Sommer, “Detection and Characterization of Multiple Motion Points,“ IEEE Conf. Computer Vision and Pattern Recognition, vol. I, pp. 171-177, June 1999.
[36] A.L. Yuille, D.S. Cohen, and P.W. Hallinan, "Feature Extraction from Faces Using Deformable Templates," Proc. Computer Vision and Pattern Recognition,San Diego, June 1989.
[37] R. Zabih and J. Woodfill, “Non-Parametric Local Transforms for Computing Visual Correspondence,” Proc. European Conf. Computer Vision, May 1994.

Index Terms:
Image correspondence, stereo, motion, contour tracking.
Citation:
Trevor Darrell, Michele Covell, "Correspondence with Cumulative Similiarity Transforms," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 2, pp. 222-227, Feb. 2001, doi:10.1109/34.908973
Usage of this product signifies your acceptance of the Terms of Use.