The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - October (2008 vol.30)
pp: 1713-1727
Oncel Tuzel , Rutgers University, Piscataway
Fatih Porikli , Mitsubishi Electric Research Labs., Cambridge
Peter Meer , Rutgers University, Piscataway
ABSTRACT
We present a new algorithm to detect pedestrian in still images utilizing covariance matrices as object descriptors. Since the descriptors do not form a vector space, well known machine learning techniques are not well suited to learn the classifiers. The space of d-dimensional nonsingular covariance matrices can be represented as a connected Riemannian manifold. The main contribution of the paper is a novel approach for classifying points lying on a connected Riemannian manifold using the geometry of the space. The algorithm is tested on INRIA and DaimlerChrysler pedestrian datasets where superior detection rates are observed over the previous approaches.
INDEX TERMS
Object recognition, Scene Analysis, Image Processing and Computer Vision, Computing Methodologies, Vision and Scene Understanding, Machine learning
CITATION
Oncel Tuzel, Fatih Porikli, Peter Meer, "Pedestrian Detection via Classification on Riemannian Manifolds", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.30, no. 10, pp. 1713-1727, October 2008, doi:10.1109/TPAMI.2008.75
REFERENCES
[1] S. Agarwal and D. Roth, “Learning a Sparse Representation for Object Detection,” Proc. European Conf. Computer Vision (ECCV '03), pp. 113-127, 2003.
[2] E. Begelfor and M. Werman, “Affine Invariance Revisited,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 2, pp. 2087-2094, 2006.
[3] S. Belongie, J. Malik, and J. Puzicha, “Shape Matching and Object Recognition Using Shape Contexts,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 4, pp. 509-522, Apr. 2002.
[4] A. Berg, T. Berg, and J. Malik, “Shape Matching and Object Recognition Using Low Distortion Correspondence,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), pp. 26-33, 2005.
[5] W.M. Boothby, An Introduction to Differentiable Manifolds and Riemannian Geometry. Academic Press, 2002.
[6] D. Comaniciu and P. Meer, “Mean Shift: A Robust Approach toward Feature Space Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, pp. 603-619, 2002.
[7] C. Cortes and V. Vapnik, “Support Vector Networks,” Machine Learning, vol. 20, pp. 273-297, 1995.
[8] N. Dalal, “Finding People in Images and Videos,” PhD dissertation, Inst. Nat'l Polytechnique de Gre noble, July 2006.
[9] N. Dalal and B. Triggs, “Histograms of Oriented Gradients for Human Detection,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), vol. 1, pp. 886-893, 2005.
[10] G. Dorkó and C. Schmid, “Selection of Scale-Invariant Parts for Object Class Recognition,” Proc. Ninth Int'l Conf. Computer Vision (ICCV '03), pp. 634-640, 2003.
[11] P.F. Felzenszwalb and D.P. Huttenlocher, “Pictorial Structures for Object Recognition,” Int'l J. Computer Vision, vol. 61, no. 1, pp. 55-79, 2005.
[12] R. Fergus, P. Perona, and A. Zisserman, “Object Class Recognition by Unsupervised Scale-Invariant Learning,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '03), pp. 264-271, 2003.
[13] P.T. Fletcher and S. Joshi, “Riemannian Geometry for the Statistical Analysis of Diffusion Tensor Data,” Signal Processing, vol. 87, no. 2, pp. 250-262, 2007.
[14] W. Förstner and E. Gülch, “A Fast Operator for Detection and Precise Location of Distinct Points, Corners and Centres of Circular Features,” Proc. Intercommission Conf. Fast Processing of Photogrammetric Data, pp. 281-305, 1987.
[15] W. Förstner and B. Moonen, “A Metric for Covariance Matrices,” technical report, Dept. of Geodesy and Geoinformatics, Stuttgart Univ., 1999.
[16] J. Friedman, T. Hastie, and R. Tibshirani, “Additive Logistic Regression: A Statistical View of Boosting,” Annals of Statistics, vol. 28, no. 2, pp. 337-407, 2000.
[17] D. Gavrila and V. Philomin, “Real-Time Object Detection for Smart Vehicles,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '99), pp. 87-93, 1999.
[18] C. Harris and M. Stephens, “A Combined Corner and Edge Detector,” Proc. Fourth Alvey Vision Conf., pp. 147-151, 1988.
[19] S. Haykin, Neural Networks: A Comprehensive Foundation, second ed. Prentice Hall, 1998.
[20] S. Ioffe and D.A. Forsyth, “Probabilistic Methods for Finding People,” Int'l J. Computer Vision, vol. 43, no. 1, pp. 45-68, 2001.
[21] T. Kadir and M. Brady, “Scale, Saliency and Image Description,” Int'l J. Computer Vision, vol. 45, no. 2, pp. 83-105, 2001.
[22] H. Karcher, “Riemannian Center of Mass and Mollifier Smoothing,” Comm. Pure and Applied Math., vol. 30, pp. 509-541, 1977.
[23] B. Leibe, E. Seemann, and B. Schiele, “Pedestrian Detection in Crowded Scenes,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), vol. 1, pp. 878-885, 2005.
[24] Y. Li, Y. Tsin, Y. Genc, and T. Kanade, “Object Detection Using 2D Spatial Ordering Constraints,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), pp. 711-718, 2005.
[25] D.G. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[26] K. Mikolajczyk, B. Leibe, and B. Schiele, “Multiple Object Class Detection with a Generative Model,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 1, pp. 26-36, 2006.
[27] K. Mikolajczyk and C. Schmid, “Indexing Based on Scale Invariant Interest Points,” Proc. Eighth Int'l Conf. Computer Vision (ICCV '01), pp. 525-531, 2001.
[28] K. Mikolajczyk, C. Schmid, and A. Zisserman, “Human Detection Based on a Probabilistic Assembly of Robust Part Detectors,” Proc. Eighth European Conf. Computer Vision (ECCV '04), vol. 1, pp. 69-81, 2004.
[29] M. Moakher, “A Differential Geometric Approach to the Geometric Mean of Symmetric Positive-Definite Matrices,” SIAM J. Matrix Analysis and Applications, vol. 26, pp. 735-747, 2005.
[30] A. Mohan, C. Papageorgiou, and T. Poggio, “Example-Based Object Detection in Images by Components,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 4, pp. 349-360, Apr. 2001.
[31] S. Munder and D.M. Gavrila, “An Experimental Study on Pedestrian Classification,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, pp. 1863-1868, 2006.
[32] A. Opelt, M. Fussenegger, A. Pinz, and P. Auer, “Weak Hypotheses and Boosting for Generic Object Detection and Recognition,” Proc. Eighth European Conf. Computer Vision (ECCV '04), pp. 71-84, 2004.
[33] A. Opelt, A. Pinz, and A. Zisserman, “Incremental Learning of Object Detectors Using a Visual Shape Alphabet,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 1, pp. 3-10, 2006.
[34] P. Papageorgiou and T. Poggio, “A Trainable System for Object Detection,” Int'l J. Computer Vision, vol. 38, no. 1, pp. 15-33, 2000.
[35] X. Pennec, P. Fillard, and N. Ayache, “A Riemannian Framework for Tensor Computing,” Int'l J. Computer Vision, vol. 66, no. 1, pp.41-66, 2006.
[36] F. Porikli, “Integral Histogram: A Fast Way to Extract Histograms in Cartesian Spaces,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '05), vol. 1, pp. 829-836, 2005.
[37] F. Porikli, O. Tuzel, and P. Meer, “Covariance Tracking Using Model Update Based on Lie Algebra,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 1, pp. 728-735, 2006.
[38] R. Ronfard, C. Schmid, and B. Triggs, “Learning to Parse Pictures of People,” Proc. Seventh European Conf. Computer Vision (ECCV '02), vol. 4, pp. 700-714, 2002.
[39] H. Rowley, S. Baluja, and T. Kanade, “Neural Network-Based Face Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, pp. 22-38, 1998.
[40] P. Sabzmeydani and G. Mori, “Detecting Pedestrians by Learning Shapelet Features,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2007.
[41] R.E. Schapire, “The Boosting Approach to Machine Learning, an Overview,” Proc. MSRI Workshop Nonlinear Estimation and Classification, 2002.
[42] P. Simard, L. Bottou, P. Haffner, and Y. LeCun, “Boxlets: A Fast Convolution Algorithm for Signal Processing and Neural Networks,” Proc. Conf. Advances in Neural Information Processing Systems II, pp. 571-577, 1998.
[43] L. Sirovitch and M. Kirby, “Low-Dimensional Procedure for the Characterization of Human Faces,” J. Optical Soc. of America, vol. 2, pp. 519-524, 1987.
[44] R. Subbarao and P. Meer, “Nonlinear Mean Shift for Clustering over Analytic Manifolds,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 1, pp. 1168-1175, 2006.
[45] K. Sung and T. Poggio, “Example-Based Learning for View-Based Human Face Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, pp. 39-51, 1998.
[46] A. Torralba, K. Murphy, and W. Freeman, “Sharing Features: Efficient Boosting Procedures for Multiclass Object Detection,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '04), pp. 762-769, 2004.
[47] M. Turk and A. Pentland, Face Recognition Using Eigenfaces, pp.586-591, 1991.
[48] O. Tuzel, F. Porikli, and P. Meer, “Region Covariance: A Fast Descriptor for Detection and Classification,” Proc. Ninth European Conf. Computer Vision (ECCV '06), vol. 2, pp. 589-600, 2006.
[49] O. Tuzel, R. Subbarao, and P. Meer, “Simultaneous Multiple 3D Motion Estimation via Mode Finding on Lie Groups,” Proc. 10th Int'l Conf. Computer Vision (ICCV '05), vol. 1, pp. 18-25, 2005.
[50] P. Viola and M. Jones, “Rapid Object Detection Using a Boosted Cascade of Simple Features,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '01), vol. 1, pp. 511-518, 2001.
[51] P. Viola, M. Jones, and D. Snow, “Detecting Pedestrians Using Patterns of Motion and Appearance,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '03), vol. 1, pp. 734-741, 2003.
[52] M. Weber, M. Welling, and P. Perona, “Unsupervised Learning of Models for Recognition,” Proc. Sixth European Conf. Computer Vision (ECCV '00), pp. 18-32, 2000.
[53] Q. Zhu, S. Avidan, M.C. Yeh, and K.T. Cheng, “Fast Human Detection Using a Cascade of Histograms of Oriented Gradients,” Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR '06), vol. 2, pp. 1491-1498, 2006.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool