The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - May (2013 vol.35)
pp: 1149-1163
B. Martinez , Dept. of Comput., Imperial Coll. London, London, UK
M. F. Valstar , Mixed Reality Lab., Univ. of Nottingham, Nottingham, UK
X. Binefa , Dept. of Inf. Technol. & Telecommun., Univ. Pompeu Fabra, Barcelona, Spain
M. Pantic , Dept. of Comput., Imperial Coll. London, London, UK
ABSTRACT
We propose a new algorithm to detect facial points in frontal and near-frontal face images. It combines a regression-based approach with a probabilistic graphical model-based face shape model that restricts the search to anthropomorphically consistent regions. While most regression-based approaches perform a sequential approximation of the target location, our algorithm detects the target location by aggregating the estimates obtained from stochastically selected local appearance information into a single robust prediction. The underlying assumption is that by aggregating the different estimates, their errors will cancel out as long as the regressor inputs are uncorrelated. Once this new perspective is adopted, the problem is reformulated as how to optimally select the test locations over which the regressors are evaluated. We propose to extend the regression-based model to provide a quality measure of each prediction, and use the shape model to restrict and correct the sampling region. Our approach combines the low computational cost typical of regression-based approaches with the robustness of exhaustive-search approaches. The proposed algorithm was tested on over 7,500 images from five databases. Results showed significant improvement over the current state of the art.
INDEX TERMS
Shape, Face, Prediction algorithms, Training, Vectors, Support vector machines, Feature extraction,support vector regression, Facial point detection, object detection, probabilistic graphical networks
CITATION
B. Martinez, M. F. Valstar, X. Binefa, M. Pantic, "Local Evidence Aggregation for Regression-Based Facial Point Detection", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 5, pp. 1149-1163, May 2013, doi:10.1109/TPAMI.2012.205
REFERENCES
[1] S. Avidan, "Support Vector Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 8, pp. 1064-1072, Aug. 2004.
[2] P.N. Belhumeur, D.W. Jacobs, D.J. Kriegman, and N. Kumar, "Localizing Parts of Faces Using a Consensus of Exemplars," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2011.
[3] C.M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.
[4] S. Coşar and M. Çetin, "A Graphical Model Based Solution to the Facial Feature Point Tracking Problem," Image Vision Computing, vol. 29, no. 5, pp. 335-350, 2011.
[5] T. Cootes, G. Edwards, and C. Taylor, "Active Appearance Models," Proc. European Conf. Computer Vision, vol. 2, pp. 484-498, 1998.
[6] D. Cristinacce and T. Cootes, "Boosted Regression Active Shape Models," Proc. British Machine Vision Conf., pp. 880-889, 2007.
[7] D. Cristinacce and T. Cootes, "Automatic Feature Localisation with Constrained Local Models," Pattern Recognition, vol. 41, pp. 3054-3067, 2008.
[8] H. Dibeklioglu, A. Salah, and T. Gevers, "A Statistical Method for 2-D Facial Landmarking," IEEE Trans. Image Processing, vol. 21, no. 2, pp. 844 -858, Feb. 2012.
[9] L. Ding and A.M. Martinez, "Features Versus Context: An Approach for Precise and Detailed Detection and Delineation of Faces and Facial Features," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 11, pp. 2022-2038, Nov. 2010.
[10] P. Dollar, P. Welinder, and P. Perona, "Cascaded Pose Regression," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1078-1085, 2010.
[11] H. Drucker, "Improving Regressors Using Boosting Techniques," Proc. Int'l Workshop Machine Learning, pp. 107-115, 1997.
[12] B. Efraty, C. Huang, S. Shah, and I. Kakadiaris, "Facial Landmark Detection in Uncontrolled Conditions," Proc. Int'l Joint Conf. Biometrics, 2011.
[13] P. Ekman, W.V. Friesen, and J.C. Hager, FACS Manual. A Human Face, May 2002.
[14] P.F. Felzenszwalb and D.P. Huttenlocher, "Pictorial Structures for Object Recognition," Int'l J. Computer Vision, vol. 61, no. 1, pp. 55-79, 2005.
[15] R. Gross, I. Matthews, J. Cohn, T. Kanade, and S. Baker, "multiPie," Image and Vision Computing, vol. 28, no. 5, pp. 807-813, 2010.
[16] M. Hall, "Correlation-Based Feature Selection for Machine Learning," PhD thesis, The Univ. of Waikato, 1999.
[17] Y. Hu, Z. Zeng, L. Yin, X. Wei, X. Zhou, and T.S. Huang, "multiView Facial Expression Recognition," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 1-6, 2008.
[18] O. Jesorsky, K. Kirchberg, and R. Frischholz, "Robust Face Detection Using the Hausdorff Distance," Proc. Third Int'l Conf. Audio- and Video-Based Biometric Person Authentication, pp. 90-95, 2001.
[19] T. Kozakaya, T. Shibata, M. Yuasa, and O. Yamaguchi, "Facial Feature Localization Using Weighted Vector Concentration Approach," Image and Vision Computing, vol. 28, no. 5, pp. 772-780, 2010.
[20] B. Leibe, A. Leonardis, and B. Schiele, "Robust Object Detection with Interleaved Categorization and Segmentation," Int'l J. Computer Vision, vol. 77, nos. 1-3, pp. 259-289, 2008.
[21] L. Liang, F. Wen, Y.-Q. Xu, X. Tang, and H.-Y. Shum, "Accurate Face Alignment Using Shape Constrained Markov Network," Computer Vision Pattern Recognition, vol. 1, pp. 1313-1319, 2006.
[22] S. Liwicki, S. Zafeiriou, and M. Pantic, "Fast and Robust Appearance-Based Tracking," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 507-513, 2011.
[23] G. McKeown, M. Valstar, R. Cowie, M. Pantic, and M. Schröder, "The Semaine Database: Annotated Multimodal Records of Emotionally Coloured Conversations Between a Person and a Limited Agent," IEEE Trans. Affective Computing, vol. 3, no. 1, pp. 5-17, Jan.-Mar. 2012.
[24] K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, "XM2VTSbd: The Extended M2VTS Database," Proc. Conf. Audio and Video-Base Biometric Personal Verification, 1999.
[25] S. Milborrow and F. Nicolls, "Locating Facial Features with an Extended Active Shape Model," Proc. IEEE European Conf. Computer Vision, pp. 504-513, 2008.
[26] T. Ojala, M. Pietikainen, and D. Harwood, "A Comparative Study of Texture Measures with Classification Based on Featured Distributions," Pattern Recognition, vol. 29, no. 1, pp. 51-59, 1996.
[27] T. Ojala, M. Pietikainen, and T. Maenpaa, "Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 971-987, July 2002.
[28] V. Ojansivu and J. Heikkilä, "Blur Insensitive Texture Classification Using Local Phase Quantization," Proc. Third Int'l Conf. Image and Signal Processing, pp. 1-8, 2008.
[29] M. Pantic and L. Rothkrantz, "Expert System for Automatic Analysis of Facial Expressions," Image and Vision Computing J., vol. 18, no. 11, pp. 881-905, 2000.
[30] I. Patras and E.R. Hancock, "Coupled Prediction Classification for Robust Visual Tracking," IEEE Trans. Pattern Analysis Machine Intelligence, vol. 32, no. 9, pp. 1553-1567, Sept. 2010.
[31] J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers Inc., 1988.
[32] P. Phillips, H. Wechsler, J. Huang, and P. Rauss, "The FERET Database and Evaluation Procedure for Face-Recognition Algorithms," Image and Vision Computing, vol. 16, no. 5, pp. 295-306, 1998.
[33] V. Rapp, T. Senechal, K. Bailly, and L. Prevost, "Multiple Kernel Learning SVM and Statistical Validation for Facial Landmark Detection," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 265-271, 2011.
[34] O. Rudovic, I. Patras, and M. Pantic, "Coupled Gaussian Process Regression for Pose-Invariant Facial Expression Recognition," Proc. European Conf. Computer Vision, pp. 350-363, 2010.
[35] J.M. Saragih, S. Lucey, and J.F. Cohn, "Deformable Model Fitting by Regularized Landmark Mean-Shift," Int'l J. Computer Vision, vol. 91, no. 2, pp. 200-215, 2011.
[36] T. Senechal, V. Rapp, H. Salam, R. Seguier, K. Bailly, and L. Prevost, "Combining AAM Coefficients with LGBP Histograms in the multiKernel SVM Framework to Detect Facial Action Units," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 1-6, 2011.
[37] P.A. Tresadern, H. Bhaskar, S.A. Adeshina, C.J. Taylor, and T.F. Cootes, "Combining Local and Global Shape Models for Deformable Object Matching," British Machine Vision Assoc., 2009.
[38] M. Valstar and M. Pantic, "Induced Disgust, Happiness and Surprise: An Addition to the MMI Facial Expression Database," Proc. Int'l Conf. Language Resources and Evaluation, Workshop EMOTION, pp. 65-70, 2010.
[39] M. Valstar and M. Pantic, "Fully Automatic Recognition of the Temporal Phases of Facial Actions," IEEE Tarns. Systems, Man, and Cybernetics, Part B, vol. 42, no. 1, pp. 28-43, Feb. 2011.
[40] M.F. Valstar, B. Martinez, X. Binefa, and M. Pantic, "Facial Point Detection Using Boosted Regression and Graph Models," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 2729-2736, 2010.
[41] P. Viola and M. Jones, "Robust Real-Time Object Detection," Int'l J. Computer Vision, vol. 57, no. 2, pp. 137-154, 2002.
[42] D. Vukadinovic and M. Pantic, "Fully Automatic Facial Feature Point Detection Using Gabor Feature Based Boosted Classifiers," Proc. IEEE Int'l Conf. Systems, Man, and Cybernetics, vol. 2, pp. 1692-1698, 2005.
[43] O.M.C. Williams, A. Blake, and R. Cipolla, "Sparse Bayesian Learning for Efficient Visual Tracking," IEEE Trans. Pattern Analysis Machine Intelligence, vol. 27, no. 8, pp. 1292-1304, Aug. 2005.
[44] S.K. Zhou and D. Comaniciu, "Shape Regression Machine," Proc. 20th Int'l Conf. Information Processing in Medical Imaging, pp. 13-25, 2007.
51 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool