The Community for Technology Leaders
Green Image
Issue No. 03 - March (2013 vol. 35)
ISSN: 0162-8828
pp: 697-715
R. Chellappa , Center for Autom. Res., Univ. of Maryland, College Park, MD, USA
R. Li , Zickler Group, Harvard Univ., Cambridge, MA, USA
We investigate the problem of spatiotemporal alignment of videos, signals, or feature sequences extracted from them. Specifically, we consider the scenario where the spatiotemporal misalignments can be characterized by parametric transformations. Using a nonlinear analytical structure referred to as an alignment manifold, we formulate the alignment problem as an optimization problem on this nonlinear space. We focus our attention on semantically meaningful videos or signals, e.g., those describing or capturing human motion or activities, and propose a new formalism for temporal alignment accounting for executing rate variations among instances of the same video event. The strategy taken in this effort bridges the family of geometric optimization and the family of stochastic algorithms: We regard the search for optimal alignment parameters as a recursive state estimation problem for a particular dynamic system evolving on the alignment manifold. Subsequently, a Sequential Importance Sampling procedure on the alignment manifold is designed for effective alignment. We further extend the basic Sequential Importance Sampling algorithm into a new version called Stochastic Gradient Sequential Importance Sampling, in which we incorporate a steepest descent structure on the alignment manifold and provide a more efficient particle propagation mechanism. We demonstrate the performance of alignment using manifolds on several types of input data that arise in vision problems.
Manifolds, Cameras, Videos, Optimization, Stochastic processes, Heuristic algorithms, Algorithm design and analysis, geometric methods, Spatiotemporal alignment, video matching, stochastic optimization
R. Chellappa, R. Li, "Spatiotemporal Alignment of Visual Signals on a Special Manifold", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 35, no. , pp. 697-715, March 2013, doi:10.1109/TPAMI.2012.144
193 ms
(Ver 3.3 (11022016))