This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Recovery of Ego-Motion Using Region Alignment
March 1997 (vol. 19 no. 3)
pp. 268-272

Abstract—A method for computing the 3D camera motion (the ego-motion) in a static scene is described, where initially a detected 2D motion between two frames is used to align corresponding image regions. We prove that such a 2D registration removes all effects of camera rotation, even for those image regions that remain misaligned. The resulting residual parallax displacement field between the two region-aligned images is an epipolar field centered at the FOE (Focus-of-Expansion). The 3D camera translation is recovered from the epipolar field. The 3D camera rotation is recovered from the computed 3D translation and the detected 2D motion. The decomposition of image motion into a 2D parametric motion and residual epipolar parallax displacements avoids many of the inherent ambiguities and instabilities associated with decomposing the image motion into its rotational and translational components, and hence makes the computation of ego-motion or 3D structure estimation more robust.

[1] G. Adiv, "Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 7, no. 4, pp. 384-401, July 1985.
[2] G. Adiv, “Inherent Ambiguities in Recovering 3-D Motion and Structure from a Noisy Flow Field,” Trans. Pattern Analysis and Machine Intelligence, vol. 11, pp. 477–489, 1989.
[3] S. Ayer and H. Sawhney, "Layered Representation of Motion Video Using Robust Maximum-Likelihood Estimation of Mixture Models and mdl Encoding," Int'l Conf. Computer Vision, pp. 777-784,Cambridge, Mass., June 1995.
[4] M. Ben-Ezra, S. Peleg, and B. Rousso, "Motion Segmentation Using Convergence Properties," Proc. ARPA IU Workshop, Nov. 1994.
[5] J.R. Bergen, P. Anandan, K.J. Hanna, and R. Hingorani, "Hierarchical Model-Based Motion Estimation," European Conf. Computer Vision, pp. 237-252,Santa Margarita Ligure, Italy, May 1992.
[6] J.R. Bergen, P.J. Burt, R. Hingorani, and S. Peleg, "A Three-Frame Algorithm for Estimating Two-Component Image Motion," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, pp. 886-895, Sept. 1992.
[7] M.J. Black and P. Anandan, "The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields," CVGIP: Image Understanding, vol. 63, no. 1, pp. 75-104, Jan. 1996.
[8] R. Chipolla, Y. Okamoto, and Y. Kuno, "Robust Structure from Motion Using Motion Parallax," Int'l Conf. Computer Vision, pp. 374-382,Berlin, May 1993.
[9] K. Daniilidis and H.-H. Nagel, "The Coupling of Rotation and Translation in Motion Estimation of Planar Surfaces," IEEE Conf. Computer Vision and Pattern Recognition, pp. 188-193, June 1993.
[10] K. Hanna, "Direct Multi-Resolution Estimation of Ego-Motion and Structure from Motion," IEEE Workshop Visual Motion, pp. 156-162,Princeton, N.J., Oct. 1991.
[11] D.J. Heeger and A. Jepson, "Simple Method for Computing 3d Motion and Depth," Int'l Conf. Computer Vision, pp. 96-100, 1990.
[12] B.K.P. Horn, "Relative Orientation," Int'l J. Computer Vision, vol. 4, no. 1, pp. 58-78, June 1990.
[13] B.K.P. Horn and E.J. Weldon, "Direct Methods for Recovering Motion," Int'l J. Computer Vision, vol. 2, no. 1, pp. 51-76, June 1988.
[14] M. Irani and P. Anandan, "Parallax Geometry of Pairs of Points for 3D Scene Analysis," European Conf. Computer Vision, pp. I:17-30,Cambridge, England, Apr. 1996.
[15] M. Irani, B. Rousso, and S. Peleg, "Detecting and Tracking Multiple Moving Objects Using Temporal Integration," European Conf. Computer Vision, pp. 282-287,Santa Margarita Ligure, Italy, May 1992.
[16] M. Irani, B. Rousso, and S. Peleg, "Computing Occluding and Transparent Motions," Int'l J. Computer Vision, vol. 12, no. 1, pp. 5-16, Jan. 1994.
[17] M. Irani, B. Rousso, and S. Peleg, "Recovery of Ego-Motion Using Image Stabilization," IEEE Conf. Computer Vision and Pattern Recognition, pp. 454-460,Seattle, June 1994.
[18] R. Kumar, P. Anandan, and K. Hanna, "Direct Recovery of Shape from Multiple Views: A Parallax Based Approach," Proc 12th ICPR, 1994.
[19] J.M. Lawn and R. Cipolla, "Epipolar Estimation Using Affine Motion-Parallax," BMVC93, 1993.
[20] D.T. Lawton and J.H. Rieger, "The Use of Difference Fields in Processing Sensor Motion," ARPA IU Workshop, pp. 78-83, June 1983.
[21] C.H. Lee, "Structure and Motion from Two Perspective Views via Planar Patch," Int'l Conf. Computer Vision, pp. 158-164, 1988.
[22] H.C. Longuet-Higgins, "Visual Ambiguity of a Moving Plane," Proc. Royal Society of London B, vol. 223, pp. 165-175, 1984.
[23] S. Negahdaripour and S. Lee, "Motion Recovery from Image Sequences Using First-Order Optical Flow Information," IEEE Workshop Visual Motion, pp. 132-139,Princeton, N.J., Oct. 1991.
[24] F. Lustman, O.D. Faugeras, and G. Toscani, "Motion and Structure from Motion from Point and Line Matching," Proc. First Int'l Conf. Computer Vision, pp. 25-34,London, 1987.
[25] H.S. Sawhney, "3D Geometry From Planar Parallax," Proc. CVPR '94, pp. 929-934, 1994.
[26] A. Shashua and N. Navab, "Relative Affine Structure: Theory and Application to 3D Reconstruction From Perspective Views," Proc. CVPR '94, pp. 483-489, 1994.

Index Terms:
Motion analysis, ego motion, video stabilization, plane-plus-parallax.
Citation:
Michal Irani, Benny Rousso, Shmuel Peleg, "Recovery of Ego-Motion Using Region Alignment," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 268-272, March 1997, doi:10.1109/34.584105
Usage of this product signifies your acceptance of the Terms of Use.