The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - April (2010 vol.32)
pp: 604-618
Zhe Lin , Adobe Systems Incorporated, San Jose
Larry S. Davis , University of Maryland, College Park
ABSTRACT
We propose a shape-based, hierarchical part-template matching approach to simultaneous human detection and segmentation combining local part-based and global shape-template-based schemes. The approach relies on the key idea of matching a part-template tree to images hierarchically to detect humans and estimate their poses. For learning a generic human detector, a pose-adaptive feature computation scheme is developed based on a tree matching approach. Instead of traditional concatenation-style image location-based feature encoding, we extract features adaptively in the context of human poses and train a kernel-SVM classifier to separate human/nonhuman patterns. Specifically, the features are collected in the local context of poses by tracing around the estimated shape boundaries. We also introduce an approach to multiple occluded human detection and segmentation based on an iterative occlusion compensation scheme. The output of our learned generic human detector can be used as an initial set of human hypotheses for the iterative optimization. We evaluate our approaches on three public pedestrian data sets (INRIA, MIT-CBCL, and USC-B) and two crowded sequences from Caviar Benchmark and Munich Airport data sets.
INDEX TERMS
Generic human detector, part-template tree, hierarchical part-template matching, pose-adaptive descriptor, occlusion analysis.
CITATION
Zhe Lin, Larry S. Davis, "Shape-Based Human Detection and Segmentation via Hierarchical Part-Template Matching", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.32, no. 4, pp. 604-618, April 2010, doi:10.1109/TPAMI.2009.204
REFERENCES
[1] C. Papageorgiou, T. Evgeniou, and T. Poggio, "A Trainable Pedestrian Detection System," Proc. Symp. Intelligent Vehicles, pp. 241-246, 1998.
[2] P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 511-518, 2001.
[3] N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 886-893, 2005.
[4] Y. Wu, T. Yu, and G. Hua, "A Statistical Field Model for Pedestrian Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 1023-1030, 2005.
[5] O. Tuzel, F. Porikli, and P. Meer, "Human Detection via Classification on Riemannian Manifold," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[6] D.M. Gavrila and V. Philomin, "Real-Time Object Detection for SMART Vehicles," Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 87-93, 1999.
[7] L. Zhao and L.S. Davis, "Closely Coupled Object Detection and Segmentation," Proc. IEEE Int'l Conf. Computer Vision, pp. 454-461, 2005.
[8] D.M. Gavrila, "A Bayesian, Exemplar-Based Approach to Hierarchical Shape Matching," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 8, pp. 1408-1421, Aug. 2007.
[9] K. Mikolajczyk, C. Schmid, and A. Zisserman, "Human Detection Based on a Probabilistic Assembly of Robust Part Detectors," Proc. European Conf. Computer Vision, vol. 1, pp. 69-82, 2004.
[10] B. Leibe, E. Seemann, and B. Schiele, "Pedestrian Detection in Crowded Scenes," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 878-885, 2005.
[11] E. Seemann, B. Leibe, and B. Schiele, "Multi-Aspect Detection of Articulated Objects," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 1582-1588, 2006.
[12] J. Shotton, A. Blake, and R. Cipolla, "Contour-Based Learning for Object Detection," Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 503-510, 2005.
[13] A. Opelt, A. Pinz, and A. Zisserman, "A Boundary-Fragment-Model for Object Detection," Proc. European Conf. Computer Vision, vol. 2, pp. 575-588, 2006.
[14] V. Ferrari, T. Tuytelaars, and L.V. Gool, "Object Detection by Contour Segment Networks," Proc. European Conf. Computer Vision, vol. 3, pp. 14-28, 2006.
[15] V. Ferrari, L. Fevrier, F. Jurie, and C. Schmid, "Groups of Adjacent Contour Segments for Object Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 1, pp. 36-51, Jan. 2008.
[16] B. Wu and R. Nevatia, "Detection of Multiple, Partially Occluded Humans in a Single Image by Bayesian Combination of Edgelet Part Detectors," Proc. IEEE Int'l Conf. Computer Vision, pp. 90-97, 2005.
[17] A. Mohan, C. Papageorgiou, and T. Poggio, "Example-Based Object Detection in Images by Components," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 4, pp. 349-361, Apr. 2001.
[18] V.D. Shet, J. Neumann, V. Ramesh, and L.S. Davis, "Bilattice-Based Logical Reasoning for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[19] B. Wu and R. Nevatia, "Simultaneous Object Detection and Segmentation by Boosting Local Shape Feature Based Classifier," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[20] R. Fergus, P. Perona, and A. Zisserman, "Object Class Recognition by Unsupervised Scale Invariant Learning," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 264-271, 2003.
[21] H. Schneiderman and T. Kanade, "Object Detection Using Statistics of Parts," Int'l J. Computer Vision, vol. 56, no. 3, pp. 151-177, 2004.
[22] P.F. Felzenszwalb and D.P. Huttenlocher, "Pictorial Structures for Object Recognition," Int'l J. Computer Vision, vol. 61, no. 1, pp. 55-79, 2005.
[23] Y. Amit and A. Trouve, "POP: Patchwork of Parts Models for Object Recognition," Int'l J. Computer Vision, vol. 75, no. 2, pp. 267-282, 2007.
[24] P.F. Felzenszwalb, D. McAllester, and D. Ramanan, "A Discriminatively Trained, Multiscale, Deformable Part Model," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[25] M.P. Kumar, P.H.S. Torr, and A. Zisserman, "Obj Cut," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 18-25, 2005.
[26] J. Winn and J. Shotton, "The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 37-44, 2006.
[27] P. Viola, M. Jones, and D. Snow, "Detecting Pedestrians Using Patterns of Motion and Appearance," Proc. IEEE Int'l Conf. Computer Vision, pp. 734-741, 2003.
[28] N. Dalal, B. Triggs, and C. Schmid, "Human Detection Using Oriented Histograms of Flow and Appearance," Proc. European Conf. Computer Vision, vol. 2, pp. 428-441, 2006.
[29] V. Sharma and J.W. Davis, "Integrating Appearance and Motion Cues for Simultaneous Detection and Segmentation of Pedestrians," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[30] Q. Zhu, S. Avidan, M.-C. Yeh, and K.-T. Cheng, "Fast Human Detection Using a Cascade of Histograms of Oriented Gradients," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 1491-1498, 2006.
[31] S. Maji, A.C. Berg, and J. Malik, "Classification Using Intersection Kernel Support Vector Machines Is Efficient," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[32] P. Sabzmeydani and G. Mori, "Detecting Pedestrians by Learning Shapelet Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[33] B. Wu and R. Nevatia, "Optimizing Discrimination-Efficiency Tradeoff in Integrating Heterogeneous Local Features for Object Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[34] J. Pang, Q. Huang, and S. Jiang, "Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection," Proc. European Conf. Computer Vision, vol. 4, pp. 541-552, 2008.
[35] H. Tao, H. Sawhney, and R. Kumar, "A Sampling Algorithm for Detecting and Tracking Multiple Objects," Proc. IEEE Int'l Conf. Computer Vision Workshop Vision Algorithms, pp. 53-68, 1999.
[36] M. Isard and J. MacCormick, "BraMBLe: A Bayesian Multiple-Blob Tracker," Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 34-41, 2001.
[37] T. Zhao and R. Nevatia, "Tracking Multiple Humans in Crowded Environment," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 406-413, 2004.
[38] K. Smith, D.G. Perez, and J.M. Odobez, "Using Particles to Track Varying Numbers of Interacting People," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 962-969, 2005.
[39] J. Rittscher, P.H. Tu, and N. Krahnstoever, "Simultaneous Estimation of Segmentation and Shape," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 486-493, 2005.
[40] Q. Zhao, J. Kang, H. Tao, and W. Hua, "Part Based Human Tracking in a Multiple Cues Fusion Framework," Proc. Int'l Conf. Pattern Recognition, vol. 1, pp. 450-455, 2006.
[41] P. Dollar, B. Babenko, S. Belongie, P. Perona, and Z. Tu, "Multiple Component Learning for Object Detection," Proc. European Conf. Computer Vision, vol. 2, pp. 211-224, 2008.
[42] Z. Lin, L.S. Davis, D. Doermann, and D. DeMenthon, "Hierarchical Part-Template Matching for Human Detection and Segmentation," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[43] D. Tran and D.A. Forsyth, "Configuration Estimates Improve Pedestrian Finding," Proc. Conf. Advances in Neural Information Processing Systems, 2007.
[44] Z. Lin and L.S. Davis, "A Pose Invariant Descriptor for Human Detection and Segmentation," Proc. European Conf. Computer Vision, vol. 4, pp. 423-436, 2008.
[45] D. Hoiem, A. Efros, and M. Hebert, "Putting Objects in Perspective," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 2137-2144, 2006.
[46] C.-C. Chang and C.-J. Lin, LIBSVM: A Library for Support Vector Machines, http://www.csie.ntu.edu.tw/~cjlinlibsvm, 2001.
[47] S. Tran and L.S. Davis, "Robust Object Tracking with Regional Affine Invariant Features," Proc. IEEE Int'l Conf. Computer Vision, pp. 1-8, 2007.
[48] S. Tran, Z. Lin, D. Harwood, and L.S. Davis, "UMD_VDT, an Integration of Detection and Tracking Methods for Multiple Human Tracking," Proc. CLEAR Workshop, pp. 179-190, 2007.
28 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool