loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '03) - Volume 2
Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition
Madison, Wisconsin
June 18-June 20
ISBN: 0-7695-1900-8
David Minnen, Georgia Institute of Technology
Irfan Essa, Georgia Institute of Technology
Thad Starner, Georgia Institute of Technology
Video-based recognition and prediction of a temporally extended activity can benefit from a detailed description of high-level expectations about the activity. Stochastic grammars allow for an efficient representation of such expectations and are well-suited for the specification of temporally well-ordered activities. In this paper, we extend stochastic grammars by adding event parameters, state checks, and sensitivity to an internal scene model. We present an implemented system that uses human-specified grammars to recognize a person performing the Towers of Hanoi task from a video sequence by analyzing object interaction events. Experimental results from several videos show robust recognition of the full task and its constituent subtasks even though no appearance models of the objects in the video are provided. These experiments include videos of the task performed with different shaped objects and with distracting and extraneous interactions.
Citation:
David Minnen, Irfan Essa, Thad Starner, "Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition," cvpr, vol. 2, pp.626, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '03) - Volume 2, 2003
Usage of this product signifies your acceptance of the Terms of Use.