This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Statistical Analysis of Dynamic Actions
September 2006 (vol. 28 no. 9)
pp. 1530-1535
Real-world action recognition applications require the development of systems which are fast, can handle a large variety of actions without a priori knowledge of the type of actions, need a minimal number of parameters, and necessitate as short as possible learning stage. In this paper, we suggest such an approach. We regard dynamic activities as long-term temporal objects, which are characterized by spatio-temporal features at multiple temporal scales. Based on this, we design a simple statistical distance measure between video sequences which captures the similarities in their behavioral content. This measure is nonparametric and can thus handle a wide range of complex dynamic actions. Having a behavior-based distance measure between sequences, we use it for a variety of tasks, including: video indexing, temporal segmentation, and action-based video clustering. These tasks are performed without prior knowledge of the types of actions, their models, or their temporal extents.

[1] Z. Bar-Joseph, R. El-Yaniv, D. Lischinski, and M. Werman, “Texture Mixing and Texture Movie Synthesis Using Statistical Learning,” IEEE Trans. Visualization and Computer Graphics, 2001.
[2] M.J. Black and Y. Yacoob, “Recognizing Facial Expressions in Image Sequences Using Local Parametrized Models of Image Motion,” Int'l J. Computer Vision, vol. 25, no. 1, pp. 23-48, 1997.
[3] A. Bobick and J. Davis, “The Representation and Recognition of Action Using Temporal Templates,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 257-267, Mar. 2001.
[4] G. Bradski and J. Davis, “Motion Segmentation and Pose Recognition with Motion History Gradients,” Int'l J. Machine Vision and Applications, vol. 13, no. 3, pp. 174-184, 2002.
[5] O. Chomat and J.L. Crowley, “Probabilistic Recognition of Activity Using Local Appearance,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 1999.
[6] R. Cutler and L. Davis, “Robust Real-Time Periodic Motion Detection, Analysis, and Applications,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 781-796, Aug. 2000.
[7] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification. New York: John Wiley and Sons, Inc., 2001.
[8] A. Efros, A. Berg, G. Mori, and J. Malik, “Recognizing Action at a Distance,” Proc. Int'l Conf. Computer Vision, vol. 2, pp. 726-733, Oct. 2003.
[9] D.M. Gavrila and L.S. Davis, “3-D Model-Based Tracking of Humans in Action: A Multi-View Approach,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1996.
[10] M. Irani, B. Rousso, and S. Peleg, “Computing Occluding and Transparent Motions,” Int'l J. Computer Vision, vol. 12, pp. 5-16, Feb. 1994.
[11] S.X. Ju, M.J. Black, and Y. Yacoob, “Cardboard People: A Parametrized Model of Articulated Image Motion,” Proc. Second Int'l Conf. Automatic Face and Gesture Recognition, pp. 38-44, Oct. 1996.
[12] F. Liu and R.W. Picard, “Finding Periodicity in Space and Time,” Proc. Int'l Conf. Computer Vision, Jan. 1998.
[13] C.W. Ngo, T.C. Pong, H. Zhang, and R.T. Chin, “Detection of Gradual Transitions through Temporal Slices Analysis,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 36-41, June 1999.
[14] S.A. Niyogi and E.H. Adelson, “Analyzing and Recognizing Walking Figures in XYT,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 1994.
[15] S.A. Niyogi and E.H. Adelson, “Analyzing Gait with Spatiotemporal Surfaces,” Proc. Workshop Non-Rigid Motion and Articulated Objects, Nov. 1994.
[16] R. Polana and R. Nelson, “Recognition of Motion from Temporal Texture,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 1992.
[17] R. Polana and R. Nelson, “Detecting Activities,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 1993.
[18] A. Ng, M. Jordan, and Y. Weiss, “On Spectral Clustering: Analysis and an Algorithm,” Advances in Neural Information Processing Systems 14, 2001.
[19] Y. Rui and P. Anandan, “Segmenting Visual Actions Based on Spatio-Temporal Motion Patterns,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2000.
[20] P. Saisan, G. Doretto, S. Soatto, and Y.N. Wu, “Dynamic Texture Recognition,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, Dec. 2001.
[21] B. Schiele and J.L. Crowley, “Probabilistic Object Recognition Using Multidimensional Receptive Field Histograms,” Proc. Int'l Conf. Pattern Recognition, Aug. 1996.
[22] M. Szummer and R.W. Picard, “Temporal Texture Modeling,” Proc. Int'l Conf. on Image Processing, Sept. 1996.
[23] Y. Yacoob and M.J. Black, “Parametrized Modeling and Recognition of Activities,” Computer Vision and Image Understanding, vol. 73, no. 2, pp. 232-247, 1999.
[24] L. Zelnik-Manor and M. Irani, “Event-Based Analysis of Video,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. II, pp. 123-130, Dec. 2001.
[25] H. Zhang, A. Kankanhali, and W. Smoliar, “Automatic Partitioning of Full-Motion Video,” Multimedia Systems, 1993.

Index Terms:
Action recognition, video indexing, temporal segmentation.
Citation:
Lihi Zelnik-Manor, Michal Irani, "Statistical Analysis of Dynamic Actions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1530-1535, Sept. 2006, doi:10.1109/TPAMI.2006.194
Usage of this product signifies your acceptance of the Terms of Use.