This Article 
 Bibliographic References 
 Add to: 
Fast and Globally Convergent Pose Estimation from Video Images
June 2000 (vol. 22 no. 6)
pp. 610-622

Abstract—Determining the rigid transformation relating 2D images to known 3D geometry is a classical problem in photogrammetry and computer vision. Heretofore, the best methods for solving the problem have relied on iterative optimization methods which cannot be proven to converge and/or which do not effectively account for the orthonormal structure of rotation matrices. We show that the pose estimation problem can be formulated as that of minimizing an error metric based on collinearity in object (as opposed to image) space. Using object space collinearity error, we derive an iterative algorithm which directly computes orthogonal rotation matrices and which is globally convergent. Experimentally, we show that the method is computationally efficient, that it is no less accurate than the best currently employed optimization methods, and that it outperforms all tested methods in robustness to outliers.

[1] W. Wilson, “Visual Servo Control of Robots Using Kalman Filter Estimates of Robot Pose Relative to Work-Pieces,” Visual Servoing, K. Hashimoto, ed., pp. 71-104, World Scientific, 1994.
[2] W.E.L. Grimson, T. Lozano-Perez, S.J. White, W.M. Wells III, R. Kikinis, and G.J. Ettinger, “An Automatic Registration Method for Frameless Stereotaxy, Image Guided Surgery, and Enhanced Reality Visualization,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 430-436, June 1994.
[3] A. State, G. Hirota, D.T. Chen, W.F. Garrett, and M.A. Livingston, “Superior Augmented Reality Registration by Integrating Landmark Tracking and Magnetic Tracking,” Proc. 23rd Int'l Conf. Computer Graphics and Interactive Techniques (SIGGRAPH '96), pp. 429-438, Aug. 1996.
[4] R. Azuma and G. Bishop, “Improving Static and Dynamic Registration in an Optical See-Through HMD,” Computer Graphics (Proc. Siggraph), ACM Press, 1994, pp. 197-204.
[5] M. Bajura, H. Fuchs, and R. Ohbuchi, "Merging Virtual Objects with the Real World: Seeing Ultrasound Imagery within the Patient," Proc. Siggraph 92, ACM Press, New York, 1992, pp. 203-210.
[6] W.E.L. Grimson, Object Recognition by Computer. MIT Press, 1990.
[7] S. Ganapathy, “Decomposition of Transformation Matrices for Robot Vision,” Pattern Recognition Letters, pp. 401-412, 1989.
[8] M.A. Fischler and R.C. Bolles, “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography,” Graphics and Image Processing, vol. 24, no. 6, pp. 381–395, June 1981.
[9] R. Horaud, B. Conio, O. Leboulleux, and B. Lacolle, “An Analytic Solution for the Perspective 4-Point Problem,” Computer Vision, Graphics, and Image Processing, vol. 47, pp. 33–44, 1989.
[10] J. Huang, J.A. Stankovic, K. Ramamritham, D. Towsley, and B. Purimetla, “On Using Priority Inheritance in Real-Time Databases,” Special Issue of Real-Time Systems J., vol. 4. no. 3, Sept. 1992.
[11] D.F. DeMenthon and L.S. Davis, "Exact and Approximate Solutions of the Perspective Three-Point Problem," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, no, 11, pp. 1,100-1,105, 1992.
[12] M. Dhome, M. Richetin, J.T. Lapreste, and G. Rives, “Determination of the Attitude of 3D Objects from a Single Perspective View,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 11, no. 12, pp. 1,265-1,278, Dec. 1989.
[13] G.H. Rosenfield, “The Problem of Exterior Orientation in Photogrammetry,” Photogrammetric Eng., pp. 536-553, 1959.
[14] E.H. Tompson, “The Projective Theory of Relative Orientation,” Photogrammetria, pp. 67-75, 1968.
[15] R.M. Haralick and L.G. Shapiro, Computer and Robot Vision. New York: Addison-Wesley, 1993.
[16] D.G. Lowe, “Three-Dimensional Object Recognition from Single Two-Dimensional Images,” Artificial Intelligence, vol. 31, pp. 355-395, 1987.
[17] H. Araujo, R. Carceroni, and C. Brown, “A Fully Projective Formulation for Lowe's Tracking Algorithm,” Technical Report 641, Univ. of Rochester, 1996
[18] Y.I. Abdel-Aziz and H.M. Karara, “Direct Linear Transformation into Object Space Coordinates in Close-Range Photogrammetry,” Proc. Symp. Close-Range Photogrammetry, pp. 1-18, Jan. 1971.
[19] Y. Yakimovsky and R. Cunningham, “A System for Extracting Three-Dimensional Measurements from a Stereo Pair of TV Cameras,” Computer Graphics and Image Processing, vol. 7, pp. 195-210, 1978.
[20] O.D. Faugeras and G. Toscani, “Calibration Problem for Stereo,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 15-20, June 1986.
[21] R.Y. Tsai, “An Effecient and Accurate Camera Calibration Technique for 3D Machine Vision,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 364-374, 1986.
[22] R.K. Lenz and R.Y. Tsai, “Techniques for Calibration of the Scale Factor and Image Center for High Accuracy 3D Machine Vision Metrology,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 10, no. 5, pp. 713-720, Sept. 1988.
[23] D. Dementhon and L. Davis, "Model-Based Object Pose in 25 Lines of Code," Int'l J. Comp. Vision, vol. 15, pp. 123-141, 1995.
[24] T.D. Alter, “3D Pose from Corresponding Points under Weak-Perspective Projection,” Techical Report A.I. Memo No. 1,378, MIT Artificial Intelligence Lab., 1992.
[25] D.P. Huttenlocher and S. Ullman, “Recognizing Solid Objects by Alignment with an Image,” Int'l J. Computer Vision, vol. 5, no. 2, pp. 195-212, 1990.
[26] R. Horaud, S. Christy, and F. Dornaika, “Object Pose: The Link between Weak Perspective, Para Perspective and Full Perspective,” Techical Report RR-2356, INRIA, Sept. 1994.
[27] B.K.P. Horn, H.M. Hilden, and S. Negahdaripour, “Closed-Form Solution of Absolute Orientation Using Orthonomal Matrices,” J. Optical Soc. Am., vol. 5, pp. 1,127-1,135, 1988.
[28] R.M. Haralick, H. Joo, C.-N. Lee, X. Zhuang, and M.B. Kim, “Pose Estimation from Corresponding Point Data,” IEEE Trans. Systems, Man, and Cybernetics, vol. 19, no. 6, p. 1426, 1989.
[29] R.M. Haralick and L.G. Shapiro, Computer and Robot Vision. New York: Addison-Wesley, 1993.
[30] D.G. Lowe, "Fitting Parameterized Three-Dimensional Models to Images," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 5, pp. 441-450, May 1991.
[31] J. Weng, N. Ahuja, and T. Huang, "Optimal Motion and Structure Estimation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 9, pp. 864-884, 1993.
[32] G.D. Hager, “Real-Time Feature Tracking and Projective Invariance as a Basis for Hand-Eye Coordination,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 533-539, IEEE CS Press, 1994.
[33] S. Wijesoma, D. Wolfe, and R. Richards, “Eye-to-Hand Coordination for Vision-Guided Robot Control Applications,” Int'l J. Robotics Research, vol. 12, no. 1, pp. 65-78, 1993.
[34] Robust Vision for Vision-Based Control of Motion. M. Vincze and G. Hager eds., 1999.
[35] O.D. Faugeras, Three-Dimensional Computer Vision: A Geometric Viewpoint.Cambridge, Mass.: MIT Press, 1993.
[36] B.K.P. Horn, “Closed-Form Solution of Absolute Orientation Uusing Unit Quaternion,” J. Optical Soc. Am., vol. 4, pp. 629-642, 1987.
[37] M.W. Walker, L. Shao, and R.A. Volz, “Estimating 3D Location Parameters Using Dual Number Quaternions,” CVGIP: Image Understanding, vol. 54, no. 3, pp. 358-367, 1991.
[38] K.S. Arun, T.S. Huang, and S.D. Blostein, "Least Squares Fitting of Two 3-(D) Point Sets," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 9, pp. 698-700, 1987.
[39] D.G. Luenberger, Linear and Nonlinear Programming. second ed. Reading, Mass.: Addison Wesley, 1984.
[40] H.P. Moravec, “Obstacle Avoidance and Navigation in the Real World by a Seeing Robot Rover,” PhD thesis, Stanford Univ., Sept. 1980. (published as Robot Rover Visual Navigation. Ann Arbor, MI: UMI Research Press, 1981.)
[41] S.M. Kiang, R.J. Chou, and J.K. Aggarwal, “Triangulation Errors in Stereo Algorithms,” Proc. IEEE Workshop Computer Vision, pp. 72-78, 1987.
[42] D. Kirk, Graphics Gems III, pp. 124-132. Academic Press, 1992.
[43] T. Huang and A. Netravali,“Motion and structure from feature correspondences: A review,” Proc. IEEE, vol. 82, pp. 252-268, Feb. 1994.
[44] R.H. Byrd and D.A. Pyne, “Convergence of the Iteratively Reweighted Least-Squares Algorithm for Robust Regression,” Technical Report No. 313, Dept. of Math. Science, The Johns Hopkins Univ., 1992.
[45] G.D. Hager and K. Toyama, “XVision: A Portable Substrate for Real-Time Vision Applications,” Computer Vision and Image Understanding, vol. 69, no. 1, Jan. 1998.
[46] C.-P. Lu, “Online Pose Estimation and Model Matching,” PhD thesis, Yale Univ., 1995.

Index Terms:
Pose estimation, absolute orientation, optimization,weak-perspective camera models, numerical optimization.
Chien-Ping Lu, Gregory D. Hager, Eric Mjolsness, "Fast and Globally Convergent Pose Estimation from Video Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 6, pp. 610-622, June 2000, doi:10.1109/34.862199
Usage of this product signifies your acceptance of the Terms of Use.