loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Learning Patterns of Activity Using Real-Time Tracking
August 2000 (vol. 22 no. 8)
pp. 747-757

Abstract—Our goal is to develop a visual monitoring system that passively observes moving objects in a site and learns patterns of activity from those observations. For extended sites, the system will require multiple cameras. Thus, key elements of the system are motion tracking, camera coordination, activity classification, and event detection. In this paper, we focus on motion tracking and show how one can use observed motion to learn patterns of activity in a site. Motion segmentation is based on an adaptive background subtraction method that models each pixel as a mixture of Gaussians and uses an on-line approximation to update the model. The Gaussian distributions are then evaluated to determine which are most likely to result from a background process. This yields a stable, real-time outdoor tracker that reliably deals with lighting changes, repetitive motions from clutter, and long-term scene changes. While a tracking system is unaware of the identity of any object it tracks, the identity remains the same for the entire tracking sequence. Our system leverages this information by accumulating joint co-occurrences of the representations within a sequence. These joint co-occurrence statistics are then used to create a hierarchical binary-tree classification of the representations. This method is useful for classifying sequences, as well as individual instances of activities in a site.

[1] 747 D. Beymer, P. McLauchlan, B. Coifman, and J. Malik, “A Real-Time Computer Vision System for Measuring Traffic Parameters,” Proc. Computer Vision and Pattern Recognition, June 1997.[2] R. Collins, A. Lipton, and T. Kanade, “A System for Video Surveillance and Monitoring,” Proc. Am. Nuclear Soc. (ANS) Eighth Int'l Topical Meeting Robotic and Remote Systems, Apr. 1999.[3] L. Davis et al., “Visual Surveillance of Human Activity,” Proc. Asian Conf. Computer Vision (ACCV98), Jan. 1998.[4] A Dempster, N. Laird, and D. Rubin, “Maximum Likelihood from Incomplete Data via the EM Algorithm,” J. Royal Statistical Soc., vol. 39(Series B), pp. 1-38, 1977.[5] N. Friedman and S. Russell, “Image Segmentation in Video Sequences: A Probabilistic Approach,” Proc. 13th Conf. Uncertainty in Artificial Intelligence (UAI), Aug. 1997.[6] A. Gersho and R.M. Gray, Vector Quantization and Signal Compression. Boston: Kluwer Academic, 1992.[7] W.E.L. Grimson, L. Lee, R. Romano, and C. Stauffer, “Using Adaptive Tracking to Classify and Monitor Activities in a Site,“ IEEE Proc. Computer Vision and Pattern Recognition, pp. 22-31, 1998.[8] P. Horn and B. Klaus, Robot Vision, MIT Press, Cambridge, Mass., 1986.[9] I. Haritaoglu, D. Harwood, and L.S. Davis, “$W^4$: Who? When? Where? What? A Real Time System for Detecting and Tracking People,” Proc. Third Int'l Conf. Automatic Face and Gesture Recognition, Apr. 1998.[10] I. Horswill and M. Yamamoto, “A $1000 Active Stereo Vision System,” Proc. IEEE/IAP Workshop Visual Behaviors, Aug. 1994.[11] I. Horswill, “Visual Routines and Visual Search: A Real-Time Implementation and Automata-Theoretic Analysis,” Proc. Int'l Joint Conf. AI, 1995.[12] Y. Ivanov, A. Bobick, and J. Liu, “Fast Lighting Independent Background Subtraction,” Technical Report no. 437, MIT Media Laboratory, 1997.[13] N. Johnson and D.C. Hogg, “Learning the Distribution of Object Trajectories for Event Recognition,” Proc. British Machine Vision Conf., D. Pycock, ed., pp. 583-592, Sept. 1995.[14] N. Johnson and D.C. Hogg, “Learning the Distribution of Object Trajectories for Event Recognition,” Image and Vision Computing, vol. 14, no. 8, pp. 609-615, Aug. 1996.[15] T. Kanade, “A Stereo Machine for Video-Rate Dense Depth Mapping and Its New Applications,” Proc. Image Understanding Workshop, pp. 805-811, Feb. 1995.[16] D. Koller, J. Weber, T. Huang, J. Malik, G. Ogasawara, B. Rao, and S. Russell, "Towards robust automatic traffic scene analysis in real-time," Proc. Int'l Conf. Pattern Recognition,Jerusalem, Israel, pp. 126-131, 1994.[17] K. Konolige, “Small Vision Systems: Hardware and Implementation,” Proc. Eighth Int'l Symp. Robotics Research, Oct. 1997.[18] L. Lee, R. Romano, and G. Stein, Monitoring Activities from Multiple Video Streams: Establishing a Common Coordinate Frame IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 758-767, Aug. 2000.[19] A.J. Lipton, H. Fujiyoshi, and R.S. Patil, “Moving Target Classification and Tracking from Real Time Video,” Proc. Fourth IEEE Workshop Applications of Computer Vision '98, pp. 8-14, 1998.[20] N. Oliver, B. Rosario, and A. Pentland, “A Bayesian Computer Vision System for Modeling Human Interactions,” Proc. Int'l Conf. Vision Systems '99, Jan. 1999.[21] C. Ridder, O. Munkelt, and H. Kirchner, “Adaptive Background Estimation and Foreground Detection Using Kalman-Filtering,” Proc. Int'l Conf. Recent Advances in Mechatronics, ICRAM '95, pp. 193-199, 1995.[22] J. Shi and J. Malik, “Normalized Cuts and Image Segmenation,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, pp. 731-737, June 1997.[23] C. Stauffer and W.E.L. Grimson, Adaptive Background Mixture Models for Real-Time Tracking Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, pp. 246-252, 1999.[24] C. Wren, A. Azarbayejani, T. Darrell, and A.P. Pentland, Pfinder: Real-Time Tracking of the Human Body IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 780-785, July 1997.

Index Terms:
Real-time visual tracking, adaptive background estimation, activity modeling, co-occurrence clustering, object recognition, video surveillance and monitoring (VSAM).
Citation:
Chris Stauffer, W. Eric L. Grimson, "Learning Patterns of Activity Using Real-Time Tracking," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 747-757, Aug. 2000, doi:10.1109/34.868677
Usage of this product signifies your acceptance of the Terms of Use.