The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.11 - Nov. (2012 vol.34)
pp: 2282-2288
M. Eichner , Comput. Vision Lab., ETH Zurich, Zurich, Switzerland
V. Ferrari , IPAB Inst., Univ. of Edinburgh, Edinburgh, UK
ABSTRACT
Most existing techniques for articulated Human Pose Estimation (HPE) consider each person independently. Here we tackle the problem in a new setting, coined Human Pose Coestimation (PCE), where multiple people are in a common, but unknown pose. The task of PCE is to estimate their poses jointly and to produce prototypes characterizing the shared pose. Since the poses of the individual people should be similar to the prototype, PCE has less freedom compared to estimating each pose independently, which simplifies the problem. We demonstrate our PCE technique on two applications. The first is estimating the pose of people performing the same activity synchronously, such as during aerobics, cheerleading, and dancing in a group. We show that PCE improves pose estimation accuracy over estimating each person independently. The second application is learning prototype poses characterizing a pose class directly from an image search engine queried by the class name (e.g., “lotus pose”). We show that PCE leads to better pose estimation in such images, and it learns meaningful prototypes which can be used as priors for pose estimation in novel images.
INDEX TERMS
Prototypes, Estimation, Synchronization, Kinematics, Detectors, Humans, Computational modeling, object detection, Human pose estimation, articulated objects, multiple image correspondence
CITATION
M. Eichner, V. Ferrari, "Human Pose Co-Estimation and Applications", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.34, no. 11, pp. 2282-2288, Nov. 2012, doi:10.1109/TPAMI.2012.85
REFERENCES
[1] M. Andriluka, S. Roth, and B. Schiele, "Pictorial Structures Revisited: People Detection and Articulated Pose Estimation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[2] M. Andriluka, S. Roth, and B. Schiele, "Monocular 3D Pose Estimation and Tracking by Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[3] T. Berg, A. Berg, J. Edwards, and D. Forsyth, "Who's in the Picture," Proc. Neural Information Processing Systems, 2004.
[4] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge Univ. Press, 2004.
[5] P. Buehler, M. Everinghan, D. Huttenlocher, and A. Zisserman, "Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts," Proc. British Machine Vision Conf. , 2008.
[6] M. Eichner and V. Ferrari, "Better Appearance Models for Pictorial Structures," Proc. British Machine Vision Conf., 2009.
[7] M. Eichner and V. Ferrari, "We Are Family: Joint Pose Estimation of Multiple Persons," Proc. 11th European Conf. Computer Vision, 2010.
[8] P. Felzenszwalb and D. Huttenlocher, "Pictorial Structures for Object Recognition," Int'l J. Computer Vision, vol. 61, no. 1, pp. 55-79, 2005.
[9] P. Felzenszwalb, D. McAllester, and D. Ramanan, "A Discriminatively Trained Multiscale, Deformable Part Model," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[10] R. Fergus, P. Perona, and A. Zisserman, "A Visual Category Filter for Google Images," Proc. Eighth European Conf. Computer Vision, May 2004.
[11] V. Ferrari, M. Marin, and A. Zisserman, "Progressive Search Space Reduction for Human Pose Estimation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[12] V. Ferrari, M. Marin, and A. Zisserman, "Pose Search: Retrieving People Using Their Pose," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[13] S. Ioffe and D. Forsyth, "Human Tracking with Mixtures of Trees," Proc. Eighth IEEE Int'l Conf. Computer Vision, 2001.
[14] S. Johnson and M. Everingham, "Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation," Proc. 21st British Machine Vision Conf., 2010.
[15] A. Kanaujia, C. Sminchisescu, and D. Metaxas, "Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[16] X. Lan and D. Huttenlocher, "Beyond Trees: Common-Factor Models for 2D Human Pose Recovery," Proc. 10th IEEE Int'l Conf. Computer Vision, vol. 1, 2005.
[17] M. Lee and I. Cohen, "Human Upper Body Pose Estimation in Static Images," Proc. Eighth European Conf. Computer Vision, 2004.
[18] G. Mori, X. Ren, A. Efros, and J. Malik, "Recovering Human Body Configurations: Combining Segmentation and Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition , 2004.
[19] D. Ramanan, "Learning to Parse Images of Articulated Bodies," Proc. Advanced in Neural Information Processing Systems, 2006.
[20] D. Ramanan and C. Sminchisescu, "Training Deformable Models for Localization," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[21] C. Romesburg, Cluster Analysis for Researchers. Krieger Pub. Co., 2004.
[22] B. Sapp, C. Jordan, and B. Taskar, "Adaptive Pose Priors for Pictorial Structures," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[23] B. Sapp, A. Toshev, and B. Taskar, "Cascaded Models for Articulated Pose Estimation," Proc. Eighth European Conf. Computer Vision, 2010.
[24] F. Schroff, A. Criminisi, and A. Zisserman, "Harvesting Image Databases from the Web," Proc. 11th IEEE Int'l Conf. Computer Vision, 2007.
[25] L. Sigal and M. Black, "Measure Locally, Reason Globally: Occlusion-Sensitive Articulated Pose Estimation," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 2041-2048, 2006.
[26] L. Sigal and M. Black, "Predicting 3D People from 2D Pictures," Proc. Fourth Int'l Conf. Articulated Motion and Deformable Objects, 2006.
[27] D. Tran and D. Forsyth, "Improved Human Parsing with a Full Relational Model," Proc. 11th European Conf. Computer Vision, 2010.
[28] J. Valmadre and S. Lucey, "Deterministic 3D Human Pose Estimation Using Rigid Structure," Proc. 11th European Conf. Computer Vision, 2010.
[29] P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 511-518, 2001.
[30] Y. Yang and D. Ramanan, "Articulated Pose Estimation with Flexible Mixtures-of-Parts," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2011.
61 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool