The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.11 - November (2011 vol.33)
pp: 2174-2187
Daozheng Chen , University of Maryland, College Park
Mustafa Bilgic , Illinois Institute of Technology, Chicago
Lise Getoor , University of Maryland, College Park
David Jacobs , University of Maryland, College Park
ABSTRACT
Large stores of digital video pose severe computational challenges to existing video analysis algorithms. In applying these algorithms, users must often trade off processing speed for accuracy, as many sophisticated and effective algorithms require large computational resources that make it impractical to apply them throughout long videos. One can save considerable effort by applying these expensive algorithms sparingly, directing their application using the results of more limited processing. We show how to do this for retrospective video analysis by modeling a video using a chain graphical model and performing inference both to analyze the video and to direct processing. We apply our method to problems in background subtraction and face detection, and show in experiments that this leads to significant improvements over baseline algorithms.
INDEX TERMS
Video processing, resource allocation, graphical models, optimization, background subtraction, face detection, dynamic programming.
CITATION
Daozheng Chen, Mustafa Bilgic, Lise Getoor, David Jacobs, "Dynamic Processing Allocation in Video", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.33, no. 11, pp. 2174-2187, November 2011, doi:10.1109/TPAMI.2011.55
REFERENCES
[1] L. Telindus Surveillance Systems, http:/www.telindussurveil lance-us.com/, 2005.
[2] N. Dean, "Bombers Staged Dry Run before London Attacks," The Independent, online ed., Sept. 2005.
[3] A.J.R. Westrop, "Face Detection Technology on Digital Cameras Works," http://ezinearticles.com?Face-Detection-Technology-on-Digital-Cameras-Works&id=1598427 , 2011.
[4] "Brickstream Launches Video Analytics Managed Services Program," http://www.prweb.com/releases/2008/05prweb955134.htm , 2008.
[5] A. Krause and C. Guestrin, "Optimal Value of Information in Graphical Models," J. Artificial Intelligence Research, vol. 35, pp. 557-591, 2009.
[6] C. Wren, A. Azarbayejani, T. Darrell, and A. Pentland, "Pfinder: Real-Time Tracking of the Human Body," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 780-785, July 1997.
[7] B. Lo and S. Velastin, "Automatic Congestion Detection System for Underground Platforms," Proc. Int'l Symp. Intelligent Multimedia, Video and Speech Processing, pp. 158-161, 2001.
[8] A.M. Elgammal, D. Harwood, and L.S. Davis, "Non-Parametric Model for Background Subtraction," Proc. European Conf. Computer Vision, 2000.
[9] K. Kim, T.H. Chalidabhongse, D. Harwood, and L.S. Davis, "Real-Time Foreground-Background Segmentation Using Codebook Model," Real-Time Imaging, vol. 11, no. 3, pp. 172-185, 2005.
[10] J. Rittscher, J. Kato, S. Joga, and A. Blake, "A Probabilistic Background Model for Tracking," Proc. European Conf. Computer Vision, 2000.
[11] S.-C.S. Cheung and C. Kamath, "Robust Techniques for Background Subtraction in Urban Traffic Video," Proc. SPIE, vol. 5308, no. 1, pp. 881-892, 2004.
[12] M. Piccardi, "Background Subtraction Techniques: A Review," Proc. IEEE Int'l Conf. Systems, Man, and Cybernetics, 2004.
[13] A. Yilmaz, O. Javed, and M. Shah, "Object Tracking: A Survey," ACM Computing Surveys, vol. 38, no. 4, p. 13, 2006.
[14] R. Jain and H. Nagel, "On the Analysis of Accumulative Difference Pictures from Image Sequences of Real World Scenes," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 1, no. 2, pp. 206-213, Apr. 1979.
[15] C. Stauffer and W. Grimson, "Adaptive Background Mixture Models for Real-Time Tracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1999.
[16] Z. Zivkovic, "Improved Adaptive Gaussian Mixture Model for Background Subtraction," Proc. Int'l Conf. Pattern Recognition, 2004.
[17] M. Hsuan Yang, D.J. Kriegman, S. Member, and N. Ahuja, "Detecting Faces in Images: A Survey," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 1, pp. 34-58, Jan. 2002.
[18] G. Yang and T. Huang, "Human Face Detection in a Complex Background," Pattern Recognition, vol. 27, no. 1, pp. 53-63, Jan. 1994.
[19] R. Hsu, M. Abdel-Mottaleb, and A. Jain, "Face Detection in Color Images," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 696-706, May 2002.
[20] M.J. Jones and J.M. Rehg, "Statistical Color Models with Application to Skin Detection," Int'l J. Computer Vision, vol. 46, no. 1, pp. 81-96, 2002.
[21] P. Sinha, "Object Recognition via Image Invariants: A Case Study," Investigative Ophthalmology and Visual Science, vol. 35, no. 1, pp. 1735-1740, May 1994.
[22] E. Osuna, R. Freund, and F. Girosi, "Training Support Vector Machines: An Application to Face Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1997.
[23] H.A. Rowley, S. Baluja, and T. Kanade, "Neural Network-Based Face Detection," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 1, pp. 23-38, Jan. 1998.
[24] H. Schneiderman and T. Kanade, "Probabilistic Modeling of Local Appearance and Spatial Relationships for Object Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1998.
[25] P. Viola and M. Jones, "Robust Real-Time Object Detection," Int'l J. Computer Vision, vol. 57, pp. 137-154, 2002.
[26] R. Lienhart and J. Maydt, "An Extended Set of Haar-Like Features for Rapid Object Detection," Proc. IEEE Int'l Conf. Image Processing, 2002.
[27] C. Huang, H. Ai, Y. Li, and S. Lao, "Vector Boosting for Rotation Invariant Multi-View Face Detection," Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 446-453, Oct. 2005.
[28] T. Mita, T. Kaneko, and O. Hori, "Joint Haar-Like Features for Face Detection," Proc. IEEE Int'l Conf. Computer Vision, vol. 2, pp. 1619-1626, Oct. 2005.
[29] R. Xiao, L. Zhu, and H.-J. Zhang, "Boosting Chain Learning for Object Detection," Proc. IEEE Int'l Conf. Computer Vision, vol. 1, pp. 709-715, Oct. 2003.
[30] D. Weiss and B. Taskar, "Structured Prediction Cascades," Proc. Int'l Conf. Artificial Intelligence and Statistics, 2010.
[31] P.F. Felzenszwalb, R. Girshick, and D. McAllester, "Cascade Object Detection with Deformable Part Models," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[32] S. Vijayanarasimhan and A. Kapoor, "Visual Recognition and Detection under Bounded Computational Resources," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[33] R. Krishna, K. McCusker, and N. O'Connor, "Optimising Resource Allocation for Background Modeling Using Algorithm Switching," Proc. ACM/IEEE Int'l Conf. Distributed Smart Cameras, 2008.
[34] S. Barotti, L. Lombardi, and P. Lombardi, "Multi-Module Switching and Fusion for Robust Video Surveillance," Proc. Int'l Conf. Image Analysis and Processing, 2003.
[35] V. Bayer-Zubek, "Learning Diagnostic Policies from Examples by Systematic Search," Proc. Conf. Uncertainty in Artificial Intelligence, 2004.
[36] P.D. Turney, "Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm," J. Artificial Intelligence Research, vol. 2, pp. 369-409, 1995.
[37] M. Rattigan, M. Maier, and D. Jensen, "Exploiting Network Structure for Active Inference in Collective Classification," Proc. ICDM Workshop Mining Graphs and Complex Structures, 2007.
[38] M. Bilgic and L. Getoor, "Effective Label Acquisition for Collective Classification," Proc. Int'l Conf. Knowledge Discovery and Data Mining, 2008.
[39] B. Settles, "Active Learning Literature Survey," Computer Sciences Technical Report 1648, Univ. of Wisconsin–Madison, 2009.
[40] S. Vijayanarasimhan and K. Grauman, "What's It Going to Cost You?: Predicting Effort vs. Informativeness for Multi-Label Image Annotations," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[41] S. Vijayanarasimhan, P. Jain, and K. Grauman, "Far-Sighted Active Learning on a Budget for Image and Video Recognition," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2010.
[42] K.P. Murphy, "Dynamic Bayesian Networks: Representation, Inference and Learning," PhD dissertation, Computer Science Division, Univ. of California, Berkeley, 2002.
[43] L. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
[44] C. Loy, T. Xiang, and S. Gong, "Multi-Camera Activity Correlation Analysis," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
[45] C.D. Manning, P. Raghavan, and H. Schtze, Introduction to Information Retrieval. Cambridge Univ. Press, 2008.
[46] R. Howard, "Information Value Theory," IEEE Trans. Systems Science and Cybernetics, vol. 2, no. 1, pp. 22-26, Aug. 1966.
[47] Intel, "Opencv Open Source Computer Vision Library," http://www.intel.com/technology/computing/ opencvindex.htm, 2011.
[48] Y. Radovilsky, G. Shattah, and S. Shimony, "Efficient Deterministic Approximation Algorithms for Non-Myopic Value of Information in Graphical Models," Proc. IEEE Int'l Conf. Systems, Man, and Cybernetics 2006.
[49] C. Sutton and A. McCallum, "An Introduction to Conditional Random Fields for Relational Learning," Introduction to Statistical Relational Learning, L. Getoor and B. Taskar, eds., pp. 93-127, MIT Press, 2007.
[50] J. Fiscus, J. Garofolo, T. Rose, and M. Michel, "Avss Multiple Camera Person Tracking Challenge Evaluation Overview," Proc. IEEE Int'l Conf. Advanced Video and Signal Based Surveillance, pp. 219-219, 2009.
14 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool