The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - April (2009 vol.31)
pp: 607-626
Erik Murphy-Chutorian , Google Inc., Mountain View
Mohan Manubhai Trivedi , University of California, San Diego, La Jolla
ABSTRACT
The capacity to estimate the head pose of another person is a common human ability that presents a unique challenge for computer vision systems. Compared to face detection and recognition, which have been the primary foci of face-related vision research, identity-invariant head pose estimation has fewer rigorously evaluated systems or generic solutions. In this paper, we discuss the inherent difficulties in head pose estimation and present an organized survey describing the evolution of the field. Our discussion focuses on the advantages and disadvantages of each approach and spans 90 of the most innovative and characteristic papers that have been published on this topic. We compare these systems by focusing on their ability to estimate coarse and fine head pose, highlighting approaches that are well suited for unconstrained environments.
INDEX TERMS
Introductory and Survey, Computer vision, Modeling and recovery of physical attributes, Human-centered computing, Vision I/O, Face and gesture recognition, Evaluation/methodology
CITATION
Erik Murphy-Chutorian, Mohan Manubhai Trivedi, "Head Pose Estimation in Computer Vision: A Survey", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.31, no. 4, pp. 607-626, April 2009, doi:10.1109/TPAMI.2008.106
REFERENCES
[1] S. Ba and J.-M. Odobez, “A Probabilistic Framework for Joint Head Tracking and Pose Estimation,” Proc. 17th Int'l Conf. Pattern Recognition, pp. 264-267, 2004.
[2] S. Ba and J.-M. Odobez, “From Camera Head Pose to 3D Global Room Head Pose Using Multiple Camera Views,” Proc. Int'l Workshop Classification of Events, Activities and Relationships, 2007.
[3] S. Baker, I. Matthews, J. Xiao, R. Gross, T. Kanade, and T. Ishikawa, “Real-Time Non-Rigid Driver Head Tracking for Driver Mental State Estimation,” Proc. 11th World Congress Intelligent Transportation Systems, 2004.
[4] V. Balasubramanian, J. Ye, and S. Panchanathan, “Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[5] S. Basu, T. Choudhury, B. Clarkson, and A. Pentland, “Towards Measuring Human Interactions in Conversational Settings,” Proc. IEEE Int'l Workshop Cues in Comm., 2001.
[6] M. Belkin and P. Niyogi, “Laplacian Eigenmaps for Dimensionality Reduction and Data Representation,” Neural Computation, vol. 15, no. 6, pp. 1373-1396, 2003.
[7] D. Beymer, “Face Recognition Under Varying Pose,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 756-761, 1994.
[8] C. Bishop, Neural Networks for Pattern Recognition. Oxford Univ. Press, 1995.
[9] K. Bowyer and S. Sarkar, “USF HumanID 3D Face Data Set,” 2001.
[10] L. Brown and Y.-L. Tian, “Comparative Study of Coarse Head Pose Estimation,” Proc. IEEE Workshop Motion and Video Computing, pp. 125-130, 2002.
[11] J. Bruske, E. Abraham-Mumm, J. Pauli, and G. Sommer, “Head-Pose Estimation from Facial Images with Subspace Neural Networks,” Proc. Int'l Conf. Neural Networks and Brain, pp. 528-531, 1998.
[12] C. Canton-Ferrer, J. Casas, and M. Pardàs, “Head Pose Detection Based on Fusion of Multiple Viewpoint Information,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 305-310, 2007.
[13] C. Canton-Ferrer, J. Casas, and M. Pardàs, “Head Orientation Estimation Using Particle Filtering in Multiview Scenarios,” Proc. Int'l Workshop Classification of Events, Activities and Relationships, 2007.
[14] M.L. Cascia, S. Sclaroff, and V. Athitsos, “Fast, Reliable Head Tracking Under Varying Illumination: An Approach Based on Registration of Texture-Mapped 3D Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 4, pp. 322-336, Apr. 2000.
[15] I. Chen, L. Zhang, Y. Hu, M. Li, and H. Zhang, “Head Pose Estimation Using Fisher Manifold Learning,” Proc. IEEE Int'l Workshop Analysis and Modeling of Faces and Gestures, pp. 203-207, 2003.
[16] S. Cheng, S. Park, and M. Trivedi, “Multi-Spectral and Multi-Perspective Video Arrays for Driver Body Tracking and Activity Analysis,” Computer Vision and Image Understanding, vol. 106, nos.2-3, 2006.
[17] T. Cootes, C. Taylor, D. Cooper, and J. Graham, “Active Shape Models—Their Training and Application,” Computer Vision and Image Understanding, vol. 61, no. 1, pp. 38-59, 1995.
[18] T. Cootes, K. Walker, and C. Taylor, “View-Based Active Appearance Models,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 227-232, 2000.
[19] T. Cootes, G. Edwards, and C. Taylor, “Active Appearance Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 6, pp. 681-685, June 2001.
[20] M. Cordea, E. Petriu, N. Georganas, D. Petriu, and T. Whalen, “Real-Time 2(1/2)-D Head Pose Recovery for Model-Based Video-Coding,” IEEE Trans. Instrumentation and Measurement, vol. 50, no. 4, pp. 1007-1013, 2001.
[21] D. DeCarlo and D. Metaxas, “Optical Flow Constraints on Deformable Models with Applications to Face Tracking,” Int'l J. Computer Vision, vol. 38, no. 2, pp. 231-238, 2000.
[22] F. Dornaika and F. Davoine, “Head and Facial Animation Tracking Using Appearance-Adaptive Models and Particle Filters,” Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshop, pp. 153-162, 2004.
[23] R. Duda, P. Hart, and D. Stork, Pattern Classification, second ed. John Wiley & Sons, 2001.
[24] G.J. Edwards, A. Lanitis, C.J. Taylor, and T.F. Cootes, “Statistical Models of Face Images—Improving Specificity,” Image Vision Computing, vol. 16, no. 3, pp. 203-211, 1998.
[25] B. Fasel and J. Luettin, “Automatic Facial Expression Analysis: A Survey,” Pattern Recognition, vol. 36, no. 1, pp. 259-275, 2003.
[26] V.F. Ferrario, C. Sforza, G. Serrao, G. Grassi, and E. Mossi, “Active Range of Motion of the Head and Cervical Spine: A Three-Dimensional Investigation in Healthy Young Adults,” J. Orthopaedic Research, vol. 20, no. 1, pp. 122-129, 2002.
[27] Y. Fu and T. Huang, “Graph Embedded Analysis for Head Pose Estimation,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 3-8, 2006.
[28] Y. Fu and T.S. Huang, “hMouse: Head Tracking Driven Virtual Computer Mouse,” Proc. Eighth IEEE Workshop Applications of Computer Vision, pp. 30-35, 2007.
[29] W. Gao, B. Cao, S. Shan, X. Zhang, and D. Zhou, “The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations,” Technical Report JDL-TR-04-FR-001, Joint Research and Development Laboratory, 2004.
[30] A. Gee and R. Cipolla, “Determining the Gaze of Faces in Images,” Image and Vision Computing, vol. 12, no. 10, pp. 639-647, 1994.
[31] A. Gee and R. Cipolla, “Fast Visual Tracking by Temporal Consensus,” Image and Vision Computing, vol. 14, no. 2, pp. 105-114, 1996.
[32] R. Gonzalez and R. Woods, Digital Image Processing, second ed., pp. 582-584. Prentice-Hall, 2002.
[33] N. Gourier, D. Hall, and J. Crowley, “Estimating Face Orientation from Robust Detection of Salient Facial Structures,” Proc. ICPR Workshop Visual Observation of Deictic Gestures, pp. 17-25, 2004.
[34] N. Gourier, J. Maisonnasse, D. Hall, and J. Crowley, “Head Pose Estimation on Low Resolution Images,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 270-280, 2007.
[35] Z. Gui and C. Zhang, “3D Head Pose Estimation Using Non-Rigid Structure-from-Motion and Point Correspondence,” Proc. IEEE Region 10 Conf., pp. 1-3, 2006.
[36] Z. Guo, H. Liu, Q. Wang, and J. Yang, “A Fast Algorithm Face Detection and Head Pose Estimation for Driver Assistant System,” Proc. Eighth Int'l Conf. Signal Processing, vol. 3, 2006.
[37] M. Harville, A. Rahimi, T. Darrell, G. Gordon, and J. Woodfill, “3D Pose Tracking with Linear Depth and Brightness Constraints,” Proc. IEEE Int'l Conf. Computer Vision, pp. 206-213, 1999.
[38] S. Haykin, Adaptive Filter Theory, fourth ed. Prentice-Hall, 2002.
[39] X. He, S. Yan, Y. Hu, and H.J. Zhang, “Learning a Locality Preserving Subspace for Visual Recognition,” Proc. IEEE Int'l Conf. Computer Vision, pp. 385-392, 2003.
[40] J. Heinzmann and A. Zelinsky, “3-D Facial Pose and Gaze Point Estimation Using a Robust Real-Time Tracking Paradigm,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 142-147, 1998.
[41] E. Hjelmås and B. Low, “Face Detection: A Survey,” Computer Vision and Image Understanding, vol. 83, no. 3, pp. 236-274, 2001.
[42] T. Horprasert, Y. Yacoob, and L. Davis, “Computing 3-D Head Orientation from a Monocular Image Sequence,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 242-247, 1996.
[43] T. Horprasert, Y. Yacoob, and L. Davis, “An Anthropometric Shape Model for Estimating Head Orientation,” Proc. Third Int'l Workshop Visual Form, pp. 247-256, 1997.
[44] C. Hu, J. Xiao, I. Matthews, S. Baker, J. Cohn, and T. Kanade, “Fitting a Single Active Appearance Model Simultaneously to Multiple Images,” Proc. British Machine Vision Conf., pp. 437-446, 2004.
[45] N. Hu, W. Huang, and S. Ranganath, “Head Pose Estimation by Non-Linear Embedding and Mapping,” Proc. IEEE Int'l Conf. Image Processing, vol. 2, pp. 342-345, 2005.
[46] Y. Hu, L. Chen, Y. Zhou, and H. Zhang, “Estimating Face Pose by Facial Asymmetry and Geometry,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 651-656, 2004.
[47] J. Huang, X. Shao, and H. Wechsler, “Face Pose Discrimination Using Support Vector Machines (SVM),” Proc. 14th Int'l Conf. Pattern Recognition, pp. 154-156, 1998.
[48] K. Huang and M. Trivedi, “Video Arrays for Real-Time Tracking of Person, Head, and Face in an Intelligent Room,” Machine Vision and Applications, vol. 14, no. 2, pp. 103-111, 2003.
[49] K. Huang and M. Trivedi, “Robust Real-Time Detection, Tracking, and Pose Estimation of Faces in Video Streams,” Proc. 17th Int'l Conf. Pattern Recognition, pp. 965-968, 2004.
[50] K. Huang, M. Trivedi, and T. Gandhi, “Driver's View and Vehicle Surround Estimation Using Omnidirectional Video Stream,” Proc. IEEE Intelligent Vehicles Symp., pp. 444-449, 2003.
[51] M. Isard and A. Blake, “CONDENSATION—Conditional Density Propagation for Visual Tracking,” Int'l J. Computer Vision, vol. 29, no. 1, pp. 5-28, 1998.
[52] T. Jebara and A. Pentland, “Parametrized Structure from Motion for 3D Adaptive Feedback Tracking of Faces,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 144-150, 1997.
[53] M. Jones and P. Viola, “Fast Multi-View Face Detection,” Technical Report 096, Mitsubishi Electric Research Laboratories, 2003.
[54] H. Kobayasi and S. Kohshima, “Unique Morphology of the Human Eye,” Nature, vol. 387, no. 6635, pp. 767-768, 1997.
[55] N. Krüger, M. Pötzsch, and C. von der Malsburg, “Determination of Face Position and Pose with a Learned Representation Based on Labeled Graphs,” Image and Vision Computing, vol. 15, no. 8, pp.665-673, 1997.
[56] V. Krüger and G. Sommer, “Gabor Wavelet Networks for Efficient Head Pose Estimation,” Image and Vision Computing, vol. 20, nos.9-10, pp. 665-672, 2002.
[57] M. Lades, J.C. Vorbrüggen, J. Buhmann, J. Lange, C. von der Malsburg, R.P. Würtz, and W. Konen, “Distortion Invariant Object Recognition in the Dynamic Link Architecture,” IEEE Trans. Computers, vol. 42, pp. 300-311, 1993.
[58] S. Langton and V. Bruce, “You Must See the Point: Automatic Processing of Cues to the Direction of Social Attention,” J.Experimental Psychology: Human Perception and Performance, vol. 26, no. 2, pp. 747-757, 2000.
[59] S. Langton, H. Honeyman, and E. Tessler, “The Influence of Head Contour and Nose Angle on the Perception of Eye-Gaze Direction,” Perception and Psychophysics, vol. 66, no. 5, pp. 752-771, 2004.
[60] A. Lanitis, C. Taylor, and T. Cootes, “Automatic Interpretation of Human Faces and Hand Gestures Using Flexible Models,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 98-103, 1995.
[61] A. Lanitis, C. Taylor, and T. Cootes, “Automatic Interpretation and Coding of Face Images Using Flexible Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 743-756, July 1997.
[62] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-Based Learning Applied to Document Recognition,” Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
[63] S. Li, Q. Fu, L. Gu, B. Scholkopf, Y. Cheng, and H. Zhang, “Kernel Machine Based Learning for Multi-View Face Detection and Pose Estimation,” Proc. IEEE Int'l Conf. Computer Vision, pp. 674-679, 2001.
[64] Y. Li, S. Gong, and H. Liddell, “Support Vector Regression and Classification Based Multi-View Face Detection and Recognition,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp.300-305, 2000.
[65] Y. Li, S. Gong, J. Sherrah, and H. Liddell, “Support Vector Machine Based Multi-View Face Detection and Recognition,” Image and Vision Computing, vol. 22, no. 5, p. 2004, 2004.
[66] Z. Li, Y. Fu, J. Yuan, T. Huang, and Y. Wu, “Query Driven Localized Linear Discriminant Models for Head Pose Estimation,” Proc. IEEE Int'l Conf. Multimedia and Expo, pp. 1810-1813, 2007.
[67] D. Little, S. Krishna, J. Black, and S. Panchanathan, “A Methodology for Evaluating Robustness of Face Recognition Algorithms with Respect to Variations in Pose Angle and Illumination Angle,” Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing, vol. 2, pp. 89-92, 2005.
[68] D. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[69] B. Ma, W. Zhang, S. Shan, X. Chen, and W. Gao, “Robust Head Pose Estimation Using LGBP,” Proc. 18th Int'l Conf. Pattern Recognition, pp. 512-515, 2006.
[70] Y. Ma, Y. Konishi, K. Kinoshita, S. Lao, and M. Kawade, “Sparse Bayesian Regression for Head Pose Estimation,” Proc. 18th Int'l Conf. Pattern Recognition, pp. 507-510, 2006.
[71] M. Malciu and F. Preteux, “A Robust Model-Based Approach for 3D Head Tracking in Video Sequences,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 169-174, 2000.
[72] I. Matthews and S. Baker, “Active Appearance Models Revisited,” Int'l J. Computer Vision, vol. 60, no. 2, pp. 135-164, 2004.
[73] T. Maurer and C. von der Malsburg, “Tracking and Learning Graphs and Pose on Image Sequences of Faces,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 176-181, 1996.
[74] I. McCowan, D. Gatica-Perez, S. Bengio, G. Lathoud, M. Barnard, and D. Zhang, “Automatic Analysis of Multimodal Group Actions in Meetings,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 3, pp. 305-317, Mar. 2005.
[75] S. McKenna and S. Gong, “Real-Time Face Pose Estimation,” Real-Time Imaging, vol. 4, no. 5, pp. 333-347, 1998.
[76] T. Moeslund, A. Hilton, and V. Krüger, “A Survey of Computer Vision-Based Human Motion Capture,” Computer Vision and Image Understanding, vol. 81, no. 3, pp. 231-268, 2001.
[77] T. Moeslund, A. Hilton, and V. Krüger, “A Survey of Advances in Vision-Based Human Motion Capture and Analysis,” Computer Vision and Image Understanding, vol. 104, no. 2, pp. 90-126, 2006.
[78] H. Moon and M. Miller, “Estimating Facial Pose from a Sparse Representation,” Proc. IEEE Int'l Conf. Image Processing, pp. 75-78, 2004.
[79] M. Morales, P. Mundy, C. Delgado, M. Yale, R. Neal, and H. Schwartz, “Gaze Following, Temperament, and Language Development in 6-Month-Olds: A Replica and Extension,” Infant Behavior and Development, vol. 23, no. 2, pp. 231-236, 2000.
[80] L.-P. Morency, A. Rahimi, N. Checka, and T. Darrell, “Fast Stereo-Based Head Tracking for Interactive Environments,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 375-380, 2002.
[81] L.-P. Morency, A. Rahimi, and T. Darrell, “Adaptive View-Based Appearance Models,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 803-810, 2003.
[82] L.-P. Morency, P. Sundberg, and T. Darrell, “Pose Estimation Using 3D View-Based Eigenspaces,” Proc. IEEE Int'l Workshop Analysis and Modeling of Faces and Gestures, pp. 45-52, 2003.
[83] L.-P. Morency, C. Christoudias, and T. Darrell, “Recognizing Gaze Aversion Gestures in Embodied Conversational Discourse,” Proc. Int'l Conf. Multimodal Interfaces, pp. 287-294, 2006.
[84] L.-P. Morency, C. Sidner, C. Lee, and T. Darrell, “Head Gestures for Perceptual Interfaces: The Role of Context in Improving Recognition,” Artificial Intelligence, vol. 171, nos. 8-9, pp. 568-585, 2007.
[85] E. Murphy-Chutorian and M. Trivedi, “Hybrid Head Orientation and Position Estimation (HyHOPE): A System and Evaluation for Driver Support,” Proc. IEEE Intelligent Vehicles Symp., 2008.
[86] E. Murphy-Chutorian and M. Trivedi, “Head Pose Estimation for Driver Assistance Systems: A Robust Algorithm and Experimental Evaluation,” Proc. 10th Int'l IEEE Conf. Intelligent Transportation Systems, pp. 709-714, 2007.
[87] R. Newman, Y. Matsumoto, S. Rougeaux, and A. Zelinsky, “Real-Time Stereo Tracking for Head Pose and Gaze Estimation,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 122-128, 2000.
[88] J. Ng and S. Gong, “Composite Support Vector Machines for Detection of Faces Across Views and Pose Estimation,” Image and Vision Computing, vol. 20, nos. 5-6, pp. 359-368, 2002.
[89] J. Ng and S. Gong, “Multi-View Face Detection and Pose Estimation Using a Composite Support Vector Machine Across the View Sphere,” Proc. IEEE Int'l Workshop Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pp. 14-21, 1999.
[90] A. Nikolaidis and I. Pitas, “Facial Feature Extraction and Pose Determination,” Pattern Recognition, vol. 33, no. 11, pp. 1783-1791, 2000.
[91] S. Niyogi and W. Freeman, “Example-Based Head Tracking,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp.374-378, 1996.
[92] J.-M. Odobez and S. Ba, “A Cognitive and Unsupervised Map Adaptation Approach to the Recognition of the Focus of Attention from Head Pose,” Proc. IEEE Int'l Conf. Multimedia and Expo, pp.1379-1382, 2007.
[93] S. Ohayon and E. Rivlin, “Robust 3D Head Tracking Using Camera Pose Estimation,” Proc. 18th Int'l Conf. Pattern Recognition, pp. 1063-1066, 2006.
[94] K. Oka, Y. Sato, Y. Nakanishi, and H. Koike, “Head Pose Estimation System Based on Particle Filtering with Adaptive Diffusion Control,” Proc. IAPR Conf. Machine Vision Applications, pp. 586-589, 2005.
[95] R. Osadchy, M. Miller, and Y. LeCun, “Synergistic Face Detection and Pose Estimation with Energy-Based Model,” Proc. Advances in Neural Information Processing Systems, pp. 1017-1024, 2004.
[96] R. Osadchy, M. Miller, and Y. LeCun, “Synergistic Face Detection and Pose Estimation with Energy-Based Models,” J. Machine Learning Research, vol. 8, pp. 1197-1215, 2007.
[97] E. Osuna, R. Freund, and F. Girosi, “Training Support Vector Machines: An Application to Face Detection,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 130-136, 1997.
[98] R. Pappu and P. Beardsley, “A Qualitative Approach to Classifying Gaze Direction,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 160-165, 1998.
[99] A. Pentland and T. Choudhury, “Face Recognition for Smart Environments,” Computer, vol. 33, no. 2, pp. 50-55, Feb. 2000.
[100] R. Rae and H. Ritter, “Recognition of Human Head Orientation Based on Artificial Neural Networks,” IEEE Trans. Neural Networks, vol. 9, no. 2, pp. 257-265, 1998.
[101] B. Raytchev, I. Yoda, and K. Sakaue, “Head Pose Estimation by Nonlinear Manifold Learning,” Proc. 17th Int'l Conf. Pattern Recognition, pp. 462-466, 2004.
[102] S. Roweis and L. Saul, “Nonlinear Dimensionality Reduction by Locally Linear Embedding,” Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
[103] H. Rowley, S. Baluja, and T. Kanade, “Rotation Invariant Neural Network-Based Face Detection,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 38-44, 1998.
[104] H. Rowley, S. Baluja, and T. Kanade, “Neural Network-Based Face Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 1, pp. 23-38, Jan. 1998.
[105] T. Rueda-Domingo, P. Lardelli-Claret, J.L. del Castillo, J. Jiménez-Moleón, M. García-Martín, and A. Bueno-Cavanillas, “The Influence of Passengers on the Risk of the Driver Causing a Car Collision in Spain,” Accident Analysis & Prevention, vol. 36, no. 3, pp. 481-489, 2004.
[106] B. Schiele and A. Waibel, “Gaze Tracking Based on Face-Color,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp.344-349, 1995.
[107] A. Schödl, A. Haro, and I. Essa, “Head Tracking Using a Textured Polygonal Model,” Proc. Workshop Perceptual User Interfaces, 1998.
[108] E. Seemann, K. Nickel, and R. Stiefelhagen, “Head Pose Estimation Using Stereo Vision for Human-Robot Interaction,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 626-631, 2004.
[109] J. Sherrah and S. Gong, “Fusion of Perceptual Cues for Robust Tracking of Head Pose and Position,” Pattern Recognition, vol. 34, no. 8, pp. 1565-1572, 2001.
[110] J. Sherrah, S. Gong, and E.-J. Ong, “Understanding Pose Discrimination in Similarity Space,” Proc. British Machine Vision Conf., pp. 523-532, 1999.
[111] J. Sherrah, S. Gong, and E.-J. Ong, “Face Distributions in Similarity Space under Varying Head Pose,” Image and Vision Computing, vol. 19, no. 12, pp. 807-819, 2001.
[112] T. Sim, S. Baker, and M. Bsat, “The CMU Pose, Illumination, and Expression Database,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 12, pp. 1615-1618, Dec. 2003.
[113] HOIP Face Database, Softopia, http://www.softopia.or.jp/en/rdfacedb.html , 2008.
[114] S. Srinivasan and K. Boyer, “Head Pose Estimation Using View Based Eigenspaces,” Proc. 16th Int'l Conf. Pattern Recognition, pp.302-305, 2002.
[115] R. Stiefelhagen, J. Yang, and A. Waibel, “Modeling Focus of Attention for Meeting Indexing Based on Multiple Cues,” IEEE Trans. Neural Networks, vol. 13, no. 4, pp. 928-938, 2002.
[116] R. Stiefelhagen, “Tracking Focus of Attention in Meetings,” Proc. Int'l Conf. Multimodal Interfaces, pp. 273-280, 2002.
[117] R. Stiefelhagen, “Estimating Head Pose with Neural Networks—Results on the Pointing04 ICPR Workshop Evaluation Data,” Proc. ICPR Workshop Visual Observation of Deictic Gestures, 2004.
[118] R. Stiefelhagen, K. Bernardin, R.B.J. Garofolo, D. Mostefa, and P. Soundararajan, “The CLEAR 2006 Evaluation,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 1-44, 2007.
[119] J. Tenenbaum, V. de Silva, and J. Langford, “A Global Geometric Framework for Nonlinear Dimensionality Reduction,” Science, vol. 290, pp. 2319-2323, 2000.
[120] Y.-L. Tian, L. Brown, J. Connell, S. Pankanti, A. Hampapur, A. Senior, and R. Bolle, “Absolute Head Pose Estimation from Overhead Wide-Angle Cameras,” Proc. IEEE Int'l Workshop Analysis and Modeling of Faces and Gestures, pp. 92-99, 2003.
[121] K. Toyama, ““look, ma—no hands!” Hands-Free Cursor Control with Real-Time 3D Face Tracking,” Proc. Workshop Perceptual User Interfaces, pp. 49-54, 1998.
[122] M. Trivedi, “Human Movement Capture and Analysis in Intelligent Environments,” Machine Vision and Applications, vol. 14, no. 4, pp. 215-217, 2003.
[123] M. Trivedi, K. Huang, and I. Mikić, “Dynamic Context Capture and Distributed Video Arrays for Intelligent Spaces,” IEEE Trans. Systems, Man, and Cybernetics, Part A, vol. 35, no. 1, pp. 145-163, 2005.
[124] J. Tu, T. Huang, and H. Tao, “Accurate Head Pose Tracking in Low Resolution Video,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 573-578, 2006.
[125] J. Tu, Y. Fu, Y. Hu, and T. Huang, “Evaluation of Head Pose Estimation for Studio Data,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 281-290, 2007.
[126] P. Viola and M. Jones, “Rapid Object Detection Using a Boosted Cascade of Simple Features,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 511-518, 2001.
[127] M. Voit, CLEAR 2007 Evaluation Plan: Head Pose Estimation, http://isl.ira.uka.de/~mvoit/clear07CLEAR07_HEADPOSE_ 2007-03-26.doc , 2007.
[128] M. Voit, K. Nickel, and R. Stiefelhagen, “A Bayesian Approach for Multi-View Head Pose Estimation,” Proc. IEEE Int'l Conf. Multisensor Fusion and Integration for Intelligent Systems, pp. 31-34, 2006.
[129] M. Voit, K. Nickel, and R. Stiefelhagen, “Neural Network-Based Head Pose Estimation and Multi-View Fusion,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 291-298, 2007.
[130] M. Voit, K. Nickel, and R. Stiefelhagen, “Head Pose Estimation in Single- and Multi-View Environments Results on the CLEAR'07 Benchmarks,” Proc. Int'l Workshop Classification of Events, Activities and Relationships, 2007.
[131] A. Waibel, T. Schultz, M. Bett, M. Denecke, R. Malkin, I. Rogina, R. Stiefelhagen, and J. Yang, “SMaRT: The Smart Meeting Room Task at ISL,” Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing, vol. 4, pp. 752-755, 2003.
[132] J.-G. Wang and E. Sung, “EM Enhancement of 3D Head Pose Estimated by Point at Infinity,” Image and Vision Computing, vol. 25, no. 12, pp. 1864-1874, 2007.
[133] H. Wilson, F. Wilkinson, L. Lin, and M. Castillo, “Perception of Head Orientation,” Vision Research, vol. 40, no. 5, pp. 459-472, 2000.
[134] W.H. Wollaston, “On the Apparent Direction of Eyes in a Portrait,” Philosophical Trans. Royal Soc. of London, vol. 114, pp.247-256, 1824.
[135] J.-W. Wu and M. Trivedi, “Visual Modules for Head Gesture Analysis in Intelligent Vehicle Systems,” Proc. IEEE Intelligent Vehicles Symp., pp. 13-18, 2006.
[136] J. Wu and M. Trivedi, “A Two-Stage Head Pose Estimation Framework and Evaluation,” Pattern Recognition, vol. 41, no. 3, pp.1138-1158, 2008.
[137] J. Wu, J. Pedersen, D. Putthividhya, D. Norgaard, and M.M. Trivedi, “A Two-Level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis,” Proc. ICPR Workshop Visual Observation of Deictic Gestures, 2004.
[138] Y. Wu and K. Toyama, “Wide-Range, Person- and Illumination-Insensitive Head Orientation Estimation,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 183-188, 2000.
[139] J. Xiao, T. Moriyama, T. Kanade, and J. Cohn, “Robust Full-Motion Recovery of Head by Dynamic Templates and Re-Registration Techniques,” Int'l J. Imaging Systems and Technology, vol. 13, no. 1, pp. 85-94, 2003.
[140] J. Xiao, S. Baker, I. Matthews, and T. Kanade, “Real-Time Combined $2{\rm D}+3{\rm D}$ Active Appearance Models,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 535-542, 2004.
[141] Y. Xiong and F. Quek, “Meeting Room Configuration and Multiple Cameras Calibration in Meeting Analysis,” Proc. Int'l Conf. Multimodal Interfaces, pp. 1-8, 2005.
[142] S. Yan, Z. Zhang, Y. Fu, Y. Hu, J. Tu, and T. Huang, “Learning a Person-Independent Representation for Precise 3D Pose Estimation,” Proc. Int'l Workshop Classification of Events, Activities and Relationships, 2007.
[143] M.-H. Yang, D. Kriegman, and N. Ahuja, “Detecting Faces in Images: A Survey,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 1, pp. 34-58, Jan. 2002.
[144] R. Yang and Z. Zhang, “Model-Based Head Pose Tracking with Stereovision,” Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 242-247, 2002.
[145] P. Yao, G. Evans, and A. Calway, “Using Affine Correspondence to Estimate 3-D Facial Pose,” Proc. IEEE Int'l Conf. Image Processing, pp. 919-922, 2001.
[146] Z. Zhang, Y. Hu, M. Liu, and T. Huang, “Head Pose Estimation in Seminar Room Using Multi View Face Detectors,” Multimodal Technologies for Perception of Humans: Proc. First Int'l Workshop Classification of Events, Activities and Relationships, R. Stiefelhagen and J. Garofolo, eds., pp. 299-304, 2007.
[147] G. Zhao, L. Chen, J. Song, and G. Chen, “Large Head Movement Tracking Using SIFT-Based Registration,” Proc. ACM Int'l Conf. Multimedia, pp. 807-810, 2007.
[148] L. Zhao, G. Pingali, and I. Carlbom, “Real-Time Head Orientation Estimation Using Neural Networks,” Proc. IEEE Int'l Conf. Image Processing, pp. 297-300, 2002.
[149] W. Zhao, R. Chellappa, P. Phillips, and A. Rosenfeld, “Face Recognition: A Literature Survey,” ACM Computing Surveys, vol. 35, no. 4, pp. 399-458, 2003.
[150] Y. Zhu and K. Fujimura, “3D Head Pose Estimation with Optical Flow and Depth Constraints,” Proc. Fourth Int'l Conf. 3-D Digital Imaging and Modeling, pp. 211-216, 2003.
[151] Y. Zhu and K. Fujimura, “Head Pose Estimation for Driver Monitoring,” Proc. IEEE Intelligent Vehicles Symp., pp. 501-506, 2004.
57 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool