Pattern Recognition, International Conference on (2004)
Aug. 23, 2004 to Aug. 26, 2004
Orkun Alatas , University of Central Florida, Orlando, FL
Omar Javed , University of Central Florida, Orlando, FL
Mubarak Shah , University of Central Florida, Orlando, FL
The contents of a video can be described in terms of appearance and motion of the scenes. In this paper, we propose a compressed spatio-temporal descriptor that is suitable for video matching and retrieval tasks. We use a modified wavelet based compression technique that exploits the temporal redundancy of the data using optical flow. In order to achieve a compact flow representation, a spline based technique is used. The optical flow field gives the directions along which the gray levels have regular variations in time. Wavelet decomposition along these directions results in fewer coefficients and thus higher compression. We demonstrate that the wavelet coefficients and flow parameters can be efficiently used for 1) video retrieval and matching, and 2) calculating spatio-temporal similarity between articulated objects. The results are demonstrated on several sequences.
M. Shah, O. Javed and O. Alatas, "Compressed Spatio-temporal Descriptors for Video Matching and Retrieval," Pattern Recognition, International Conference on(ICPR), Cambridge UK, 2004, pp. 882-885.