This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A System for Learning Statistical Motion Patterns
September 2006 (vol. 28 no. 9)
pp. 1450-1464
Analysis of motion patterns is an effective approach for anomaly detection and behavior prediction. Current approaches for the analysis of motion patterns depend on known scenes, where objects move in predefined ways. It is highly desirable to automatically construct object motion patterns which reflect the knowledge of the scene. In this paper, we present a system for automatically learning motion patterns for anomaly detection and behavior prediction based on a proposed algorithm for robustly tracking multiple objects. In the tracking algorithm, foreground pixels are clustered using a fast accurate fuzzy K{\hbox{-}}\rm means algorithm. Growing and prediction of the cluster centroids of foreground pixels ensure that each cluster centroid is associated with a moving object in the scene. In the algorithm for learning motion patterns, trajectories are clustered hierarchically using spatial and temporal information and then each motion pattern is represented with a chain of Gaussian distributions. Based on the learned statistical motion patterns, statistical methods are used to detect anomalies and predict behaviors. Our system is tested using image sequences acquired, respectively, from a crowded real traffic scene and a model traffic scene. Experimental results show the robustness of the tracking algorithm, the efficiency of the algorithm for learning motion patterns, and the encouraging performance of algorithms for anomaly detection and behavior prediction.

[1] A. Baraldi and P. Blonda, “A Survey of Fuzzy Clustering Algorithms for Pattern Recognition— Part II,” IEEE Trans. Systems, Man, and Cybernetics— Part B: Cybernetics, vol. 29, no. 6, pp. 786-801, 1999.
[2] M. Bennewitz, W. Burgard, and G. Cielniak, “Utilizing Learned Motion Patterns to Robustly Track Persons,” Proc. Joint IEEE Int'l Workshop Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 102-109, Oct. 2003.
[3] M.J. Black and P. Anandan, “A Framework for the Robust Estimation of Optical Flow,” Proc. Int'l Conf. Computer Vision, pp. 231-236, 1993.
[4] A.F. Bobick and A.D. Wilson, “A State-Based Technique to the Representation and Recognition of Gesture,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 12, pp. 1325-1337, Dec. 1997.
[5] M. Brand, “Understanding Manipulation in Video,” Proc. Int'l Conf. Automatic Face and Gesture Recognition, pp. 94-99, 1996.
[6] M. Brand and V. Kettnaker, “Discovery and Segmentation of Activities in Video,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 844-851, Aug. 2000.
[7] J.F. Colen and T. Hutcheson, “Reducing the Time Complexity of the Fuzzy C-Means Algorithm,” IEEE Trans. Fuzzy Systems, vol. 2, no. 2, pp. 263-267, 2002.
[8] R.T. Collins, A.J. Lipton, T. Kanade, H. Fujiyoshi, D. Duggins, Y. Tsin, D. Tolliver, N. Enomoto, O. Hasegawa, P. Burt, and L. Wixson, “A System for Video Surveillance and Monitoring,” Technical Report CMU-RI-TR-00-12, Robotics Inst., Carnegie Mellon Univ., May 2000.
[9] R. Cutler and M. Turk, “View-Based Interpretation of Real-Time Optical Flow for Gesture Recognition,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 416-421, 1998.
[10] L. Davis, S. Fejes, D. Harwood, Y. Yacoob, I. Hariatoglu, and M.J. Black, “Visual Surveillance of Human Activity,” Proc. Asian Conf. Computer Vision, vol. 2, pp. 267-274, 1998.
[11] T.V. Duong, H.B. Bui, D.Q. Phung, and S. Venkatesh, “Activity Recognition and Abnormality Detection with the Switching Hidden Semi-Markov Model,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. I, pp. 838-845, June 2005.
[12] T. Ellis, D. Makris, and J. Black, “Learning a Multi-Camera Topology,” Proc. Joint IEEE Int'l Workshop VS-PETS, pp. 165-171, Oct. 2003.
[13] S. Eschrich, J. Ke, L.O. Hall, and D.B. Goldgof, “Fast Accurate Fuzzy Clustering Through Data Reduction,” IEEE Trans. Fuzzy Systems, vol. 11, no. 2, pp. 262-270, 2003.
[14] A. Galata, N. Johnson, and D. Hogg, “Learning Variable-Length Markov Models of Behavior,” Computer Vision and Image Understanding, vol. 81, no. 3, pp. 398-413, 2001.
[15] I. Haritaoglu, D. Harwood, and L.S. Davis, “$\rm W^4$ : Real-Time Surveillance of People and Their Activities,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 809-830, Aug. 2000.
[16] B. Heisele, U. KreBel, and W. Ritter, “Tracking Nonrigid, Moving Objects Based on Color Cluster Flow,” IEEE Conf. Computer Vision and Pattern Recognition, pp. 257-260, 1997.
[17] W.M. Hu, T.N. Tan, L. Wang, and S.J. Maybank, “A Survey on Visual Surveillance of Object Motion and Behaviors,” IEEE Trans. Systems, Man, and Cybernetics, Part C: Applications and Reviews, vol. 34, no. 3, pp. 334-352, 2004.
[18] Y.A. Ivanov and A.F. Bobick, “Recognition of Visual Activities and Interactions by Stochastic Parsing,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 852-872, Aug. 2000.
[19] N. Johnson, “Behaviour Model and Analysis,” PhD Thesis, Univ. of Leeds, 1999.
[20] N. Johnson and D. Hogg, “Learning the Distribution of Object Trajectories for Event Recognition,” Image and Vision Computing, vol. 14, no. 8, pp. 609-615, 1996.
[21] J. Joseph and J. Laviola, “Double Exponential Smoothing: An Alternative to Kalman Filter-Based Predictive Tracking,” Proc. Conf. Immersive Projection Technology and Virtual Environments 2003, pp. 199-206, 2003.
[22] I.N. Junejo, O. Javed, and M. Shah, “Multi Feature Path Modeling for Video Surveillance,” Proc. Int'l Conf. Pattern Recognition, vol. 2, pp. 716-719, 2004.
[23] S. Kamijo, Y. Matsushita, K. Ikeuchi, and M. Sakauchi, “Traffic Monitoring and Accident Detection at Intersections,” IEEE Trans. Intelligent Transportation Systems, vol. 1, no. 2, pp. 108-118, 2000.
[24] D. Koller, J. Weber, and J. Malik, “Robust Multiple Car Tracking with Occlusion Reasoning,” Proc. European Conf. Computer Vision, pp. 189-196, 1994.
[25] D. Makris and T. Ellis, “Learning Semantic Scene Models from Observing Activity in Visual Surveillance,” IEEE Trans. Systems, Man, and Cybernetics— Part B, vol. 35, no. 3, pp. 397-408, 2005.
[26] D. Makris and T. Ellis, “Path Detection in Video Surveillance,” Image and Vision Computing, vol. 20, no. 12, pp. 895-903, 2002.
[27] B. Maurin, O. Masoud, and N. Papanikolopoulos, “Monitoring Crowded Traffic Scenes,” Proc. IEEE Int'l Conf. Intelligent Transportation Systems, pp. 19-24, 2002.
[28] S. McKenna, S. Jabri, Z. Duric, A. Rosenfeld, and H. Wechsler, “Tracking Groups of People,” Computer Vision and Image Understanding, vol. 80, no. 1, pp. 42-56, 2000.
[29] U. Meier, R. Stiefelhagen, J. Yang, and A. Waibel, “Towards Unrestricted Lip Reading,” Int'l J. Pattern Recognition and Artificial Intelligence, vol. 14, no. 5, pp. 571-585, Aug. 2000.
[30] D. Minnen, I. Essa, and T. Starner, “Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 626-632, June 2003.
[31] A. Mittal and L.S. Davis, “M2Tracker: A Multi-View Approach to Segmenting and Tracking People in a Cluttered Scene,” Int'l J. Computer Vision, vol. 51, no. 3, pp. 189-203, Feb./Mar. 2003.
[32] N.T. Nguyen, H.H. Bui, S. Venkatesh, and G. West, “Recognising and Monitoring High-Level Behaviours in Complex Spatial Environments,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 620-625, June 2003.
[33] W.M. Hu, D. Xie, T.N. Tan, and S. Maybank, “Learning Patterns of Activity Using Fuzzy Self-Organizing Neural Network,” IEEE Trans. Systems, Man, and Cybernetics— Part B, vol. 34, no. 3, pp. 1618-1626, June 2004.
[34] N.M. Oliver, B. Rosario, and A.P. Pentland, “A Bayesian Computer Vision System for Modeling Human Interactions,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 831-843, 2000.
[35] J. Owens and A. Hunter, ”Application of the Self-Organizing Map to Trajectory Classification,” Proc. IEEE Int'l Workshop Visual Surveillance, pp. 77-83, 2000.
[36] N.R. Pal and J.C. Bezdek, “Complexity Reduction for Large Image Processing,” IEEE Trans. Systems, Man, and Cybernetics— Part B, vol. 32, no. 5, pp. 598-611, 2002.
[37] S. Park and J.K. Aggarwal, “Semantic-Level Understanding of Human Actions and Interactions Using Event Hierarchy,” Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshop, pp. 12-12, 2004.
[38] A.E.C. Pece, “From Cluster Tracking to People Counting,” Proc. IEEE Workshop Performance Evaluation of Tracking and Surveillance, pp. 9-17, June 2002.
[39] N. Peterfreund, “Robust Tracking of Position and Velocity with Kalman Snakes,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 6, pp. 564-569, June 1999.
[40] C.S. Pinhanez, “Representation and Recognition of Action in Interactive Spaces,” PhD thesis, MIT Media Laboratory, 1999.
[41] T. Starner, J. Weaver, and A. Pentland, “Real-Time American Sign Language Recognition Using Desk and Wearable Computer-Based Video,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 12, pp. 1371-1375, Dec. 1998.
[42] C. Stauffer and W.E.L. Grimson, “Learning Patterns of Activity Using Real-Time Tracking,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 747-757, Aug. 2000.
[43] N. Sumpter and A. Bulpitt, “Learning Spatio-Temporal Patterns for Predicting Object Behavior,” Image and Vision Computing, vol. 18, no. 9, pp. 697-704, 2000.
[44] C. Town, “Ontology-Driven Bayesian Networks for Dynamic Scene Understanding,” Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshops, pp. 124-124, 2004.
[45] N. Vaswani, A.R. Chowdbury, and R. Chellappa, “Activity Recognition Using the Dynamics of the Configuration of Interacting Objects,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 633-640, 2003.
[46] T. Wada and T. Matsuyama, “Multiobject Behavior Recognition by Event Driven Selective Attention Method,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 873-887, Aug. 2000.
[47] A.D. Wilson, A.F. Bobick, and J. Cassell, “Temporal Classification of Natural Gesture and Application to Video Coding,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 948-954, 1997.
[48] X. Xie and G. Beni, “A Validity Measure for Fuzzy Clustering,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 8, pp. 841-847, Aug. 1991.
[49] Y. Yacoob and M. Black, “Parameterized Modeling and Recognition of Activities,” Proc. Int'l Conf. Computer Vision, pp. 120-127, 1998.
[50] M.-H. Yang, N. Ahuja, and M. Tabb, “Extraction of 2D Motion Trajectories and Its Application to Hand Gesture Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 8, pp. 1061-1074, Aug. 2002.
[51] T. Zhao and R. Nevatia, ”Bayesian Human Segmentation in Crowded Situations,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 459-466, July 2003.
[52] H. Zhong, J. Shi, and M. Visontai, “Detecting Unusual Activity in Video,” Proc. IEEE Int'l Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 819-826, June-July 2004.
[53] D. Xie, W. Hu, T. Tan, and J. Peng, “A Multiobject Tracking System for Surveillance Video Analysis,” Proc. Int'l Conf. Pattern Recognition, pp. 767-770, 2004.

Index Terms:
Tracking multiple objects, learning statistical motion patterns, anomaly detection, behavior understanding.
Citation:
Weiming Hu, Xuejuan Xiao, Zhouyu Fu, Dan Xie, Tieniu Tan, Steve Maybank, "A System for Learning Statistical Motion Patterns," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1450-1464, Sept. 2006, doi:10.1109/TPAMI.2006.176
Usage of this product signifies your acceptance of the Terms of Use.