2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance (2009)
Sept. 2, 2009 to Sept. 4, 2009
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/AVSS.2009.66
Human action recognition is a significant task in automatic understanding systems for video surveillance. Probabilistic Latent Semantic Analysis (PLSA) model has been used to learn and recognize human actions in videos. Specifically, PLSA employs the expectation maximization (EM) algorithm for parameter estimation during the training. The EM algorithm is an iterative estimation scheme that is guaranteed to find a local maximum of the likelihood function. However its convergence usually takes a large number of iterations. For action recognition with large amount of training data, this would result in long training time. This paper presents an incremental version of EM to speed up the training of PLSA without sacrificing performance accuracy. The proposed algorithm is tested on two challenging human action datasets. Experimental results demonstrate that the proposed algorithm converges with fewer number of full passes compared with the batch EM algorithm. And the trained PLSA models achieve comparable or better recognition accuracies than those using batch EM training.
Incremental EM, PLSA
Y. Wang, G. Ye, B. Zhang, G. Herman, J. Xu and J. Yang, "Incremental EM for Probabilistic Latent Semantic Analysis on Human Action Recognition," 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance(AVSS), Genova, Italy, 2009, pp. 55-60.