This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Efficient Region Tracking With Parametric Models of Geometry and Illumination
October 1998 (vol. 20 no. 10)
pp. 1025-1039

Abstract—As an object moves through the field of view of a camera, the images of the object may change dramatically. This is not simply due to the translation of the object across the image plane. Rather, complications arise due to the fact that the object undergoes changes in pose relative to the viewing camera, changes in illumination relative to light sources, and may even become partially or fully occluded. In this paper, we develop an efficient, general framework for object tracking—one which addresses each of these complications. We first develop a computationally efficient method for handling the geometric distortions produced by changes in pose. We then combine geometry and illumination into an algorithm that tracks large image regions using no more computation than would be required to track with no accommodation for illumination changes. Finally, we augment these methods with techniques from robust statistics and treat occluded regions on the object as statistical outliers. Throughout, we present experimental results performed on live video sequences demonstrating the effectiveness and efficiency of our methods.

[1] P. Allen, B. Yoshimi, and A. Timcenko, "Hand-Eye Coordination for Robotics Tracking and Grasping," K. Hashimoto, ed., Visual Serving, pp. 33-70. World Scientific, 1994.
[2] S. Hutchinson, G.D. Hager, and P. Corke, "A Tutorial Introduction to Visual Servo Control," IEEE Trans. Robotics and Automation, vol. 12, no. 5, pp. 651-670, 1996.
[3] N. Papanikolopoulos, P. Khosla, and T. Kanade, "Visual Tracking of a Moving Target by a Camera Mounted on a Robot: A Combination of Control and Vision," IEEE Trans. Robotics and Automation, vol. 9, no. 1, pp. 14-35, 1993.
[4] E. Dickmanns and V. Graefe, "Dynamic Monocular Machine Vision," Machine Vision and Applications, vol. 1, pp. 223-240, 1988.
[5] A. Bobick and A. Wilson, “A State-Based Technique for the Summarization and Recognition of Gesture,” Proc. Fifth Int'l Conf. Computer Vision, pp. 382-388, 1995.
[6] T. Darrell, B. Moghaddam, and A. Pentland, "Active Face Tracking and Pose Estimation in an Interactive Room," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 67-72, 1996.
[7] D. Gavrila and L. Davis, "Tracking Humans in Action: A 3D Model-Based Approach," Proc. Image Understanding Workshop, pp. 737-746, 1996.
[8] R. Howarth and H. Buxton, "Visual Surveillance Monitoring and Watching," Proc. European Conf. Computer Vision, vol. 2, pp. 321-334, 1996.
[9] T. Frank, M. Haag, H. Kollnig, and H.-H. Nagel, "Tracking of Occluded Vehicles in Traffic Scenes," Proc. European Conf. Computer Vision, vol. 2, pp. 485-494, 1996.
[10] R.C. Harrell, D.C. Slaughter, and P.D. Adsit, "A Fruit-Tracking System for Robotic Harvesting," Machine Vision and Applications, vol. 2, pp. 69-80, 1989.
[11] D. Reynard, A. Wildenberg, A. Blake, and J. Marchant, "Learning Dynamics of Complex Motions From Image Sequences," Proc. European Conf. Computer Vision, vol. 1, pp. 357-368, 1996.
[12] E. Bardinet, L. Cohen, and N. Ayache, "Tracking Medical 3D Data With a Deformable Parametric Model," Proc. European Conf. Computer Vision, vol. 1, pp. 317-328, 1996.
[13] P. Shi, G. Robinson, T. Constable, A. Sinusas, and J. Duncan, "A Model-Based Integrated Approach to Track Myocardial Deformation Using Displacement and Velocity Constraints," Proc. Int'l Conf. Computer Vision, pp. 687-692, 1995.
[14] E. Boyer, "Object Models From Contour Sequences," Proc. European Conf. Computer Vision, vol. 2, pp. 109-118, 1996.
[15] L. Shapiro, Affine Analysis of Image Sequences.Cambridge, England: Cambridge Univ. Press, 1995.
[16] C. Tomasi and T. Kanade, "Shape and Motion From Image Streams Under Orthography: A Factorization Method," Int'l J. Computer Vision, vol. 9, no. 2, pp. 137-154, 1992.
[17] P. Hallinan, "A Low-Dimensional Representation of Human Faces for Arbitrary Lighting Conditions," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1994, pp. 995-999.
[18] R. Epstein, P. Hallinan, and A. Yuille, "5±2 Eigenimages Suffice: An Empirical Investigation of Low-Dimensional Lighting Models," Technical Report 94-11, Harvard Univ., 1994.
[19] P.N. Belhumeur and D.J. Kriegman, "What is the Set of Images of an Object under all Possible Lighting Conditions?," IEEE Proc. Conf. Computer Vision and Pattern Recognition, 1996.
[20] R. Dutter and P. Huber, "Numerical Methods for the Nonlinear Robust Regression Problem," J. Statistical Computer Simulation, vol. 13, no. 2, pp. 79-113, 1981.
[21] B.D. Lucas and T. Kanade, "An Iterative Image Registration Technique With an Application to Stereo Vision," Proc. Int'l Joint Conf. Artificial Intelligence, pp. 674-679, 1981.
[22] P. Anandan, "A Computational Framework and an Algorithm for the Measurement of Structure From Motion," Int'l J. Computer Vision, vol. 2, pp. 283-310, 1989.
[23] J. Shi and C. Tomasi, Good Features to Track Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 593-600, 1994.
[24] J. Rehg and T. Kanade, "Visual Tracking of High DOF Articulated Structures: An Application to Human Hand Tracking," Proc. European Conf. Computer Vision, vol. B, pp. 35-46, 1994.
[25] C. Bregler, “Learning and Recognizing Human Dynamics in Video Sequences,” Proc. Conf. Computer Vision and Pattern Recognition, 1997.
[26] J.M. Rehg and A.P. Witkin, “Visual Tracking with Deformation Models,” Proc. Int'l Conf. Robotics and Automation, pp. 844-850, 1991.
[27] M.J. Black and Y. Yacoob, "Tracking and Recognizing Rigid and Non-Rigid Facial Motions Using Local Parametric Model of Image Motion," Proc. Int'l Conf. Computer Vision, pp. 374-381,Cambridge, Mass., 1995.
[28] H. Murase and S. Nayar, "Visual Learning and Recognition of 3-D Objects From Appearance," Int'l J. Computer Vision, vol. 14, pp. 5-24, 1995.
[29] M. Black and A. Jepson, "Eigentracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation," Proc. European Conf. Computer Vision, pp. 329-342, 1996.
[30] M. Isard and A. Blake, "Contour Tracking by Stochastic Propagation of Conditional Density," European Conf. on Computer Vision, vol. 1, pp. 343-356, 1996.
[31] D.G. Lowe, "Robust Model-Based Motion Tracking Through the Integration of Search and Estimation," Int'l J. Computer Vision, vol. 8, no. 2, pp. 113-122, 1992.
[32] D.B. Gennery, "Visual Tracking of Known Three-Dimensional Objects," Int'l J. Computer Vision, vol. 7, no. 3, pp. 243-270, 1992.
[33] A. Blake, R. Curwen, and A. Zisserman, "A Framework for Spatio-Temporal Control in the Tracking of Visual Contour," Int'l J. Computer Vision, vol. 11, no. 2, pp. 127-145, 1993.
[34] B. Horn, Computer Vision.Cambridge, Mass.: MIT Press, 1986.
[35] M. Betke and N. Makris, Fast Object Recognition in Noisy Images Using Simulated Annealing Proc. Fifth Int'l Conf. Computer Vision, pp. 523-530, 1995.
[36] R. Szeliski, “Image Mosaicing for Tele-Reality Applications,” IEEE Computer Graphics and Applications, 1996.
[37] A. Shashua, "Geometry and Photometry in 3D Visual Recognition," PhD thesis, Massachusetts Institute of Tech nology, 1992.
[38] R. Woodham, "Analysing Images of Curved Surfaces," Artificial Intelligence, vol. 17, pp. 117-140, 1981.
[39] P. Huber, Robust Statistics.New York: John Wiley&Sons, 1981.
[40] R.M. Haralick and L.G. Shapiro, Computer and Robot Vision.Reading, Mass.: Addison Wesley, 1993.
[41] G.D. Hager and K. Toyama, "XVision: A Portable Substrate for Real-Time Vision Applications," Computer Vision and Image Understanding, vol. 69, no. 1, pp. 23-37, 1998.
[42] S. McKenna, S. Gong, and J. Collins, "Face Tracking and Pose Representation," British Machine Vision Conf., 1996.
[43] P. Belhumeur and G.D. Hager, "Tracking in 3D: Image Variability Decomposition for Recovering Object Pose and Illumination," Proc. Int'l Conf. Pattern Analysis Applications, 1998. Also available as Yale Computer Science #1141.
[44] S. Ullman and R. Basri, "Recognition by Linear Combinations of Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, pp. 992-1006, 1991.

Index Terms:
Visual tracking, real-time vision, illumination, motion estimation, robust statistics.
Citation:
Gregory D. Hager, Peter N. Belhumeur, "Efficient Region Tracking With Parametric Models of Geometry and Illumination," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 10, pp. 1025-1039, Oct. 1998, doi:10.1109/34.722606
Usage of this product signifies your acceptance of the Terms of Use.