Computer Vision, IEEE International Conference on (2013)
Sydney, Australia Australia
Dec. 1, 2013 to Dec. 8, 2013
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICCV.2013.330
In many cases, the predictive power of structured models for for complex vision tasks is limited by a trade-off between the expressiveness and the computational tractability of the model. However, choosing this trade-off statically a priori is sub optimal, as images and videos in different settings vary tremendously in complexity. On the other hand, choosing the trade-off dynamically requires knowledge about the accuracy of different structured models on any given example. In this work, we propose a novel two-tier architecture that provides dynamic speed/accuracy trade-offs through a simple type of introspection. Our approach, which we call dynamic structured model selection (DMS), leverages typically intractable features in structured learning problems in order to automatically determine' which of several models should be used at test-time in order to maximize accuracy under a fixed budgetary constraint. We demonstrate DMS on two sequential modeling vision tasks, and we establish a new state-of-the-art in human pose estimation in video with an implementation that is roughly 23× faster than the previous standard implementation.
pose estimation, structured prediction
D. Weiss, B. Sapp and B. Taskar, "Dynamic Structured Model Selection," 2013 IEEE International Conference on Computer Vision (ICCV), Sydney, Australia, 2013, pp. 2656-2663.