The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.06 - June (2013 vol.35)
pp: 1423-1436
Bongjin Jun , Pohang University of Science and Technology, Pohang
Inho Choi , Pohang University of Science and Technology, Pohang
Daijin Kim , Pohang University of Science and Technology, Pohang
ABSTRACT
We propose two novel local transform features: local gradient patterns (LGP) and binary histograms of oriented gradients (BHOG). LGP assigns one if the neighboring gradient of a given pixel is greater than its average of eight neighboring gradients and zero otherwise, which makes the local intensity variations along the edge components robust. BHOG assigns one if the histogram bin has a higher value than the average value of the total histogram bins, and zero otherwise, which makes the computation time fast due to no further postprocessing and SVM classification. We also propose a hybrid feature that combines several local transform features by means of the AdaBoost method, where the best feature having the lowest classification error is sequentially selected until we obtain the required classification performance. This hybridization makes face and human detection robust to global illumination changes by LBP, local intensity changes by LGP, and local pose changes by BHOG, which considerably improves detection performance. We apply the proposed features to face detection using the MIT+CMU and FDDB databases and human detection using the INRIA and Caltech databases. Our experimental results indicate that the proposed LGP and BHOG feature attain accurate detection performance and fast computation time, respectively, and the hybrid feature improves face and human detection performance considerably.
INDEX TERMS
Transforms, Face, Feature extraction, Histograms, Humans, Robustness, Face recognition,face and human detection, Local binary pattern, local gradient pattern, binary histograms of oriented gradients, feature hybridization
CITATION
Bongjin Jun, Inho Choi, Daijin Kim, "Local Transform Features and Hybridization for Accurate Face and Human Detection", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 6, pp. 1423-1436, June 2013, doi:10.1109/TPAMI.2012.219
REFERENCES
[1] M. Swain and D. Ballard, "Color Indexing," Int'l J. Computer Vision, vol. 7, no. 1, pp. 11-32, 1991.
[2] T. Randen and J. Husoy, "Filtering for Texture Classification: A Comparative Study," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 4, pp. 291-310, Apr. 1999.
[3] C. Papageorgiou and T. Poggio, "A Trainable System for Object Detection," Int'l J. Computer Vision, vol. 38, no. 1, pp. 15-33, 2000.
[4] P. Viola and M. Jones, "Robust Real-Time Face Detection," Int'l J. Computer Vision, vol. 57, no. 2, pp. 137-154, 2004.
[5] P. Viola, M. Jones, and D. Snow, "Detecting Pedestrians Using Patterns of Motion and Appearance," Int'l J. Computer Vision, vol. 63, no. 2, pp. 153-161, 2005.
[6] X.C.S. Yan, S. Shan, and W. Gao, "Locally Assembled Binary (LAB) Feature with Feature-Centric Cascade for Fast and Accurate Face Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-7, 2008.
[7] T. Ojala, M. Pietikainen, and D. Harwood, "A Comparative Study of Texture Measures with Classification Based on Feature Distributions," Pattern Recognition, vol. 29, no. 1, pp. 51-59, 1996.
[8] R. Zabih and J. Woodfill, "Non-Parametric Local Transforms for Computing Visual Correspondence," Proc. European Conf. Computer Vision, pp. 151-158, 1994.
[9] L. Zhang, R. Chu, S. Xiang, S. Liao, and S. Li, "Face Detection Based on Multi-Block LBP Representation," Proc. Int'l Conf. Biometrics, pp. 11-18, 2007.
[10] H. Jin, H.L.Q. Liu, and X. Tong, "Face Detection Using Improved LBP under Bayesian Framework," Proc. Int'l Conf. Image and Graphics, pp. 306-309. 2004.
[11] T. Ahonen, A. Hadid, and M. Pietikainen, "Face Description with Local Binary Patterns: Application to Face Recognition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 2037-2041, Dec. 2006.
[12] W. Zhang, S. Shan, W. Gao, X. Chen, and H. Zhang, "Local Gabor Binary Pattern Histogram Sequence (LGBPHS): A Novel Non-Statistical Model for Face Representation and Recognition," Proc. IEEE Int'l Conf. Computer Vision, pp. 786-791, 2005.
[13] X. Feng, M. Pietikainen, and A. Hadid, "Facial Expression Recognition with Local Binary Patterns and Linear Programming," Pattern Recognition and Image Analysis, vol. 15, no. 2, pp. 546-548, 2005.
[14] C. Shan, S. Gong, and P. McOwan, "Facial Expression Recognition Based on Local Binary Patterns: A Comprehensive Study," Image and Vision Computing, vol. 27, pp. 803-816, 2009.
[15] N. Sun, W. Zheng, C. Sun, C. Zou, and L. Zhao, "Gender Classification Based on Boosting Local Binary Pattern," Proc. Int'l Symp. Neural Networks, pp. 194-201, 2006.
[16] G. Heusch, Y. Rodriguez, and S. Marcel, "Local Binary Patterns as an Image Preprocessing for Face Authentication," Proc. Int'l Conf. Automatic Face and Gesture Recognition, pp. 9-14, 2006.
[17] V. Kellokumpu, G. Zhao, S. Li, and M. Pietikainen, "Dynamic Texture Based Gait Recognition," Proc. Int'l Conf. Biometrics, pp. 1000-1009, 2009.
[18] V. Takala, T. Ahonen, and M. Pietikainen, "Block-Based Methods for Image Retrieval Using Local Binary Patterns," Proc. Scandinavian Conf. Image Analysis, pp. 882-891, 2005.
[19] D. Grimes and R. Rao, "A Bilinear Model for Sparse Coding," Proc. Neural Information Processing Systems, vol. 15, pp. 1287-1294, 2003.
[20] T. Ojala, M. Pietikainen, and T. Maenpaa, "Multiresolution Grayscale and Rotation Invariant Texture Classification with Local Binary Patterns," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 971-987, July 2002.
[21] X. Huang, S.Z. Li, and Y. Wang, "Shape Localization Based on Statistical Method Using Extended Local Binary Pattern," Proc. Int'l Conf. Image and Graphics, pp. 184-187, 2004.
[22] M. Heikkila, M. Pietikainen, and J. Heikkila, "A Texture-Based Method for Detecting Moving Objects," Proc. British Machine Vision Conf., pp. 187-196, 2004.
[23] D. Lowe, "Distinctive Image Features from Scale Invariant Keypoints," Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[24] Y. Ke and R. Sukthankar, "PCA-Sift: A More Distinctive Representation for Local Image Descriptors," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 511-517, 2004.
[25] H. Bay, A. Ess, T. Tuytelaars, and L. Gool, "Surf: Speeded Up Robust Features," Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346-359, 2008.
[26] N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 886-893, 2005.
[27] Q. Zhu, S. Avidan, M. Yeh, and K. Cheng, "Fast Human Detection Using a Cascade of Histograms of Oriented Gradients," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1491-1498. 2006.
[28] P. Dollar, S. Belongie, and P. Perona, "The Fastest Pedestrian Detector in the West," Proc. British Machine Vision Conf., pp. 1-11, 2010.
[29] O. Deniza, G. Buenoa, and J. Salido, "Face Recognition Using Histograms of Oriented Gradients," Pattern Recognition Letters, vol. 32, no. 12, pp. 1598-1603, 2011.
[30] P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan, "Object Detection with Discriminatively Trained Part Based Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627-1645, Sept. 2010.
[31] P. Felzenszwalb, R. Girshick, and D. McAllester, "Cascade Object Detection with Deformable Part Models," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2241-2248, 2010.
[32] M. Dahmane and J. Meunier, "Emotion Recognition Using Dynamic Grid-Based Hog Features," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 884-888, 2011.
[33] V. Shet, J. Neumann, V. Ramesh, and L. Davis, "Bilattice-Based Logical Reasoning for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[34] F. Porkili, "Integral Histogram: A Fast Way to Extract Histograms in Cartesian Spaces," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 829-836, 2005.
[35] B. Froba and A. Ernst, "Face Detection with the Modified Census Transform," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 91-96, 2004.
[36] H.A. Rowley, "Neural Network-Based Face Detection." PhD thesis, Carnegie Mellon Univ., Pitsburgh, 1999.
[37] V. Jain and E. Learned-Miller, "FDDB: A Benchmark for Face Detection in Unconstrained Settings," Technical Report UM-CS-2010-009, Univ. of Massachusetts, Amherst, 2010.
[38] H. Rowley, S. Baluja, and T. Kanade, "Neural Network-Based Face Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 1, pp. 23-38, Jan. 1998.
[39] K. Mikolajczyk, C. Schmid, and A. Zisserman, "Human Detection Based on a Probabilistic Assembly of Robust Part Detectors," Proc. European Conf. Computer Vision, pp. 69-82, 2004.
[40] V. Subburaman and S. Marcel, "Fast Bounding Box Estimation Based Face Detection," Proc. ECCV Workshop Face Detection: Where We Are and What Next?, 2010.
[41] J. Li, T. Wang, and Y. Zhang, "Face Detection Using Surf Cascade," Proc. IEEE Int'l Conf. Computer Vision Workshops, pp. 2183-2190, 2011.
[42] V. Jain and E. Learned-Miller, "Online Domain Adaptation of a Pre-Trained Cascade of Classifiers," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 577-584, 2011.
[43] P. Dollar, C. Wojek, B. Schiele, and P. Perona, "Pedestrian Detection: An Evaluation of the State of the Art," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 34, no. 4, pp. 743-761, Apr. 2012.
[44] S. Walk, N. Majer, K. Schindler, and B. Schiele, "New Features and Insights for Pedestrian Detection," Proc. IEEE Computer Vision and Pattern Recognition, pp. 1030-1037, 2010.
[45] P. Dollar, Z. Tu, P. Perona, and S. Belongie, "Integral Channel Features," Proc. British Machine Vision Conf., pp. 1-11, 2004.
[46] P. Dollar, S. Belongie, and P. Perona, "The Fastest Pedestrian Detector in the West," Proc. British Machine Vision Conf., pp. 1-11, 2010.
[47] A. Bar-Hillel, D. Levi, E. Krupka, and C. Goldberg, "Part-Based Feature Synthesis for Human Detection," Proc. European Conf. Computer Vision, pp. 127-142, 2010.
[48] W. Schwartz, A. Kembhavi, D. Harwood, and L. Davis, "Human Detection Using Partial Least Squares Analysis," Proc. IEEE Int'l Conf. Computer Vision, pp. 24-31, 2009.
[49] P.F. Felzenszwalb, R.B. Girshick, D. McAllester, and D. Ramanan, "Object Detection with Discriminatively Trained Part Based Models," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 9, pp. 1627-1645, Sept. 2010.
[50] C. Wojek and B. Schiele, "A Performance Evaluation of Single and Multi-Feature People Detection," Proc. IEEE Int'l Conf. Computer Vision, pp. 24-31, 2009.
[51] X. Wang, T.X. Han, and S. Yan, "An HOG-LBP Human Detector with Partial Occlusion Handling," Proc. DAGM Symp. Pattern Recognition, pp. 82-91, 2008.
[52] S. Maji, A. Berg, and J. Malik, "Classification Using Intersection Kernel SVMs Is Efficient," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[53] P. Dollar, Z. Tu, H. Tao, and S. Belongie, "Feature Mining for Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[54] P. Felzenszwalb, D. McAllester, and D. Ramanan, "A Discriminatively Trained, Multiscale, Deformable Part Model," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2008.
[55] Z. Lin and L.S. Davis, "A Pose-Invariant Descriptor for Human Detection and Segmentation," Proc. European Conf. Computer Vision, pp. 423-436, 2008.
[56] P. Sabzmeydani and G. Mori, "Detecting Pedestrians by Learning Shapelet Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-8, 2007.
[57] V. Prisacariu and I. Reid, "FastHOG—a Real-Time GPU Implementation of HOG," Technical Report 2310/09, Dept. of Eng. Science, Oxford Univ., 2009.
49 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool