Issue No. 05 - May (2014 vol. 36)
Jiang Wang , EECS Dept., Northwestern Univ., Evanston, IL, USA
Zicheng Liu , Microsoft Res., Redmond, WA, USA
Ying Wu , EECS Dept., Northwestern Univ., Evanston, IL, USA
Junsong Yuan , Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Human action recognition is an important yet challenging task. Human actions usually involve human-object interactions, highly articulated motions, high intra-class variations, and complicated temporal structures. The recently developed commodity depth sensors open up new possibilities of dealing with this problem by providing 3D depth data of the scene. This information not only facilitates a rather powerful human motion capturing technique, but also makes it possible to efficiently model human-object interactions and intra-class variations. In this paper, we propose to characterize the human actions with a novel actionlet ensemble model, which represents the interaction of a subset of human joints. The proposed model is robust to noise, invariant to translational and temporal misalignment, and capable of characterizing both the human motion and the human-object interactions. We evaluate the proposed approach on three challenging action recognition datasets captured by Kinect devices, a multiview action recognition dataset captured with Kinect device, and a dataset captured by a motion capture system. The experimental evaluations show that the proposed approach achieves superior performance to the state-of-the-art algorithms.
sensors, human computer interaction, learning (artificial intelligence), motion compensation,motion capture system, learning actionlet ensemble, 3D human action recognition, human-object interactions, temporal structures, commodity depth sensors, Kinect device,Joints, Three-dimensional displays, Hidden Markov models, Robustness, Noise, Feature extraction,Gesture, Computer vision, Video analysis,human-object interaction, Action recognition, Kinect, ensemble method, human pose
Jiang Wang, Zicheng Liu, Ying Wu, Junsong Yuan, "Learning Actionlet Ensemble for 3D Human Action Recognition", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 36, no. , pp. 914-927, May 2014, doi:10.1109/TPAMI.2013.198