This Article 
 Bibliographic References 
 Add to: 
An Experimental Study on Pedestrian Classification
November 2006 (vol. 28 no. 11)
pp. 1863-1868
S. Munder, Dept. of Machine Perception, DaimlerChrysler Res. & Dev., Ulm
D.M. Gavrila, Dept. of Machine Perception, DaimlerChrysler Res. & Dev., Ulm
Detecting people in images is key for several important application domains in computer vision. This paper presents an in-depth experimental study on pedestrian classification; multiple feature-classifier combinations are examined with respect to their ROC performance and efficiency. We investigate global versus local and adaptive versus nonadaptive features, as exemplified by PCA coefficients, Haar wavelets, and local receptive fields (LRFs). In terms of classifiers, we consider the popular support vector machines (SVMs), feedforward neural networks, and k-nearest neighbor classifier. Experiments are performed on a large data set consisting of 4,000 pedestrian and more than 25,000 nonpedestrian (labeled) images captured in outdoor urban environments. Statistically meaningful results are obtained by analyzing performance variances caused by varying training and test sets. Furthermore, we investigate how classification performance and training sample size are correlated. Sample size is adjusted by increasing the number of manually labeled training data or by employing automatic bootstrapping or cascade techniques. Our experiments show that the novel combination of SVMs with LRF features performs best. A boosted cascade of Haar wavelets can, however, reach quite competitive results, at a fraction of computational cost. The data set used in this paper is made public, establishing a benchmark for this important problem

[1] D.M. Gavrila, “The Visual Analysis of Human Movement: A Survey,” Computer Vision and Image Understanding, vol. 73, no. 1, pp. 82-98, 1999.
[2] “Face Recognition Vendor Test,”, 2004.
[3] C. Wöhler and J. Anlauf, “An Adaptable Time-Delay Neural-Network Algorithm for Image Sequence Analysis,” IEEE Trans. Intelligent Transportation, vol. 10, no. 6, pp. 1531-1536, Nov. 1999.
[4] L. Zhao and C. Thorpe, “Stereo- and Neural Network-Based Pedestrian Detection,” IEEE Trans. Intelligent Transportation Systems, vol. 1, no. 3, 2000.
[5] C. Papageorgiou and T. Poggio, “A Trainable System for Object Detection,” Int'l J. Computer Vision, vol. 38, no. 1, pp. 15-33, Sept. 2000.
[6] H. Elzein, S. Lakshmanan, and P. Watta, “A Motion and Shape-Based Pedestrian Detection Algorithm,” Proc. IEEE Intelligent Vehicle Symp., pp.500-504, 2003.
[7] P. Viola, M. Jones, and D. Snow, “Detecting Pedestrians Using Patterns of Motion and Appearance,” Proc. Int'l Conf. Computer Vision, pp. 734-741, 2003.
[8] A. Shashua, Y. Gdalyaha, and G. Hayun, “Pedestrian Detection for Driving Assistance Systems: Single-Frame Classification and System Level Performance,” Proc. IEEE Intelligent Vehicle Symp., 2004.
[9] A. Mohan, C. Papageorgiou, and T. Poggio, “Example-Based Object Detection in Images by Components,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 4, pp. 349-361, Apr. 2001.
[10] A. Jain, R. Duin, and J. Mao, “Statistical Pattern Recognition: A Review,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 4-37, Jan. 2000.
[11] K. Fukushima, S. Miyake, and T. Ito, “Neocognitron: A Neural Network Model for a Mechanism of Visual Pattern Recognition,” IEEE Trans. Systems, Man, and Cybernetics, vol. 13, pp. 826-834, 1983.
[12] V.N. Vapnik, Statistical Learning Theory. New York: John Wiley, 1998.
[13] K.K. Sung and T. Poggio, “Example Based Learning for View-Based Human Face Detection,” Technical Report CBCL-112, Artificial Intelligence Laboratory, Massachusettes Inst. of Tech nology, Jan. 1995.
[14] Y. Freund and R.E. Schapire, “A Decision-Theoretic Generalization of Online Learning and an Application to Boosting,” Proc. European Conf. Computational Learning Theory, pp. 23-37, 1995.
[15] “Intel Open Source Computer Vision Library,” 2004. opencv/.
[16] D.M. Gavrila and V. Philomin, “Real-Time Object Detection for ‘Smart’ Vehicles,” Proc. Int'l Conf. Computer Vision, pp. 87-93, 1999.
[17] G. Borgefors, “Distance Transformations in Digital Images,” Computer Vision, Graphics, and Image Processing, vol. 34, no. 3, pp. 344-371, June 1986.

Index Terms:
Application software,Computer vision,Principal component analysis,Support vector machines,Support vector machine classification,Neural networks,Feedforward neural networks,Analysis of variance,Performance analysis,Testing,performance analysis.,Pedestrian classification,feature evaluation,classifier evaluation
S. Munder, D.M. Gavrila, "An Experimental Study on Pedestrian Classification," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 11, pp. 1863-1868, Nov. 2006, doi:10.1109/TPAMI.2006.217
Usage of this product signifies your acceptance of the Terms of Use.