| | This Article | |
| |
| |
| | Share | |
| |
| |
| | Bibliographic References | |
| |
| |
| | Add to: | |
| |
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
| |
| | Search | |
| |
| |
| | |
Monitoring Activities from Multiple Video Streams: Establishing a Common Coordinate Frame
August 2000 (vol. 22 no. 8)
pp. 758-767
Abstract—Monitoring of large sites requires coordination between multiple cameras, which in turn requires methods for relating events between distributed cameras. This paper tackles the problem of automatic external calibration of multiple cameras in an extended scene, that is, full recovery of their 3D relative positions and orientations. Because the cameras are placed far apart, brightness or proximity constraints cannot be used to match static features, so we instead apply planar geometric constraints to moving objects tracked throughout the scene. By robustly matching and fitting tracked objects to a planar model, we align the scene's ground plane across multiple views and decompose the planar alignment matrix to recover the 3D relative camera and ground plane positions. We demonstrate this technique in both a controlled lab setting where we test the effects of errors in the intrinsic camera parameters, and in an uncontrolled, outdoor setting. In the latter, we do not assume synchronized cameras and we show that enforcing geometric constraints enables us to align the tracking data in time. In spite of noise in the intrinsic camera parameters and in the image data, the system successfully transforms multiple views of the scene's ground plane to an overhead view and recovers the relative 3D camera and ground plane positions.
[1] 758 A. Azarbayejani and A.P. Pentland, “Real-Time Self-Calibrating Stereo Person Tracking Using 3D Shape Estimation from Blob Features,” Technical Report 363, Media Laboratory, Massachusetts Inst. Technology, Cambridge, Mass., Jan. 1996.[2] K. Bradshaw, I. Reid, and D. Murray, “The Active Recovery of 3D Motion Trajectories and Their Use in Prediction,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 3, pp. 219-233, Mar. 1997.[3] T. Cham and R. Cipolla, “A Statistical Framework for Long-Range Feature Matching in Uncalibrated Image Mosaicing,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 442-447, 1998.[4] O.D. Faugeras, Three-Dimensional Computer Vision: A Geometric Viewpoint.Cambridge, Mass.: MIT Press, 1993.[5] W.E.L. Grimson, L. Lee, R. Romano, and C. Stauffer, “Using Adaptive Tracking to Classify and Monitor Activities in a Site,“ IEEE Proc. Computer Vision and Pattern Recognition, pp. 22-31, 1998.[6] S. Intille and A. Bobick, “Closed-World Tracking,” Proc. Fifth Int'l Conf. Computer Vision, pp 672-678, 1995.[7] M. Irani, B. Rousso, and S. Peleg, "Recovery of Ego-Motion Using Image Stabilization," IEEE Conf. Computer Vision and Pattern Recognition, pp. 454-460,Seattle, June 1994.[8] D.W. Murray and L.S. Shapiro, “Dynamic Updating of Planar Structure and Motion: The Case of Constant Motion,” Computer Vision and Image Understanding, vol. 63, no. 1, pp. 169-181, Jan. 1996.[9] R.M. Murray, Z. Li, and S.S. Sastry, A Mathematical Introduction to Robotic Manipulation. CRC Press, 1994.[10] C. Stauffer and W.E.L. Grimson, Adaptive Background Mixture Models for Real-Time Tracking Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, pp. 246-252, 1999.[11] C. Stauffer and W.E.L. Grimson, Learning Patterns of Activity Using Real-Time Tracking IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 747-757, Aug. 2000.[12] S. Sull and N. Ahuja, "Estimation of Motion and Structure of Planar Surfaces From a Sequence of Monocular Images," Proc. Conf. Computer Vision and Pattern Recognition, pp. 732-733, 1991.[13] R.Y. Tsai and T.S. Huang, “Analysis of 3D Time Varying Scene,” IBM RC 9,479, IBM Watson Research Center, Yorktown Heights, N.Y., 1982.[14] R. Tsai,T. Huang,, and W. Zhu,“Estimating three-dimensional motion parameters of a rigid planar patch, ii: Singular value decomposition,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 30, no. 4, pp. 525-534, 1982.[15] J. Weng, N. Ahuja, and T. Huang, "Motion and Structure From Point Correspondences With Error Estimation: Planar Surfaces," IEEE Trans. Signal Processing, vol. 39, no. 12, pp. 2,691-2,716, 1991.[16] I. Zoghlami, O. Faugeras, and R. Deriche, “Using Geometric Corners to Build a 2D Mosaic from a Set of Images” Proc. Int'l Conf. Computer Vision and Pattern Recognition, June 1997.
Index Terms:
Video surveillance, multiple views, external camera calibration, planar motion, tracking.
Citation:
Lily Lee, Raquel Romano, Gideon Stein, "Monitoring Activities from Multiple Video Streams: Establishing a Common Coordinate Frame," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 758-767, Aug. 2000, doi:10.1109/34.868678