The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - August (2011 vol.33)
pp: 1619-1632
Boris Babenko , University of California, San Diego, La Jolla
Ming-Hsuan Yang , University of California, Merced, Merced
Serge Belongie , University of California, San Diego, La Jolla
ABSTRACT
In this paper, we address the problem of tracking an object in a video given its location in the first frame and no other information. Recently, a class of tracking techniques called “tracking by detection” has been shown to give promising results at real-time speeds. These methods train a discriminative classifier in an online manner to separate the object from the background. This classifier bootstraps itself by using the current tracker state to extract positive and negative examples from the current frame. Slight inaccuracies in the tracker can therefore lead to incorrectly labeled training examples, which degrade the classifier and can cause drift. In this paper, we show that using Multiple Instance Learning (MIL) instead of traditional supervised learning avoids these problems and can therefore lead to a more robust tracker with fewer parameter tweaks. We propose a novel online MIL algorithm for object tracking that achieves superior results with real-time performance. We present thorough experimental results (both qualitative and quantitative) on a number of challenging video clips.
INDEX TERMS
Visual Tracking, multiple instance learning, online boosting.
CITATION
Boris Babenko, Ming-Hsuan Yang, Serge Belongie, "Robust Object Tracking with Online Multiple Instance Learning", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.33, no. 8, pp. 1619-1632, August 2011, doi:10.1109/TPAMI.2010.226
REFERENCES
[1] S. Birchfield, "Elliptical Head Tracking Using Intensity Gradients and Color Histograms," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 232-237, 1998.
[2] M. Isard and J. Maccormick, "Bramble: A Bayesian Multiple-Blob Tracker," Proc. IEEE Int'l Conf. Computer Vision, vol. 2, pp. 34-41, 2001.
[3] K. Branson and S. Belongie, "Tracking Multiple Mouse Contours (without Too Many Samples)," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2005.
[4] V. Lepetit and P. Fua, "Keypoint Recognition Using Randomized Trees," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 9, pp. 1465-1479, Sept. 2006.
[5] A. Yilmaz, O. Javed, and M. Shah, "Object Tracking: A Survey," ACM Computing Surveys, vol. 38, no. 4, 2006.
[6] G. Hager and P. Belhumeur, "Efficient Region Tracking with Parametric Models of Geometry and Illumination," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 10, pp. 1025-1039, Oct. 1998.
[7] M. Black and A. Jepson, "Eigentracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation," Int'l J. Computer Vision , vol. 26, no. 1, pp. 63-84, 1998.
[8] D. Comaniciu, V. Ramesh, and P. Meer, "Real-Time Tracking of Non-Rigid Objects Using Mean Shift," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 142-149, 2000.
[9] A. Adam, E. Rivlin, and I. Shimshoni, "Robust Fragments-Based Tracking Using the Integral Histogram," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 798-805, 2006.
[10] A.D. Jepson, D.J. Fleet, and T.F. El-Maraghi, "Robust Online Appearance Models for Visual Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 10, pp. 1296-1311, Oct. 2003.
[11] I. Matthews, T. Ishikawa, and S. Baker, "The Template Update Problem," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 6, pp. 810-815, June 2004.
[12] D. Ross, J. Lim, R.-S. Lin, and M.-H. Yang, "Incremental Learning for Robust Visual Tracking," Int'l J. Computer Vision, vol. 77, no. 1, pp. 125-141, 2008.
[13] T.G. Dietterich, R.H. Lathrop, and L.T. Perez, "Solving the Multiple-Instance Problem with Axis Parallel Rectangles," Artificial Intelligence, vol. 89, pp. 31-71, 1997.
[14] P. Viola, J.C. Platt, and C. Zhang, "Multiple Instance Boosting for Object Detection," Proc. Neural Information Processing Systems, pp. 1417-1426, 2005.
[15] P. Dollár, B. Babenko, S. Belongie, P. Perona, and Z. Tu, "Multiple Component Learning for Object Detection," Proc. European Conf. Computer Vision, 2008.
[16] S. Andrews, I. Tsochantaridis, and T. Hofmann, "Support Vector Machines for Multiple-Instance Learning," Proc. Neural Information Processing Systems, pp. 577-584, 2003.
[17] C. Galleguillos, B. Babenko, A. Rabinovich, and S. Belongie, "Weakly Supervised Object Recognition and Localization with Stable Segmentations," Proc. European Conf. Computer Vision, 2008.
[18] S. Vijayanarasimhan and K. Grauman, "Keywords to Visual Categories: Multiple-Instance Learning for Weakly Supervised Object Categorization," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[19] K. Okuma, A. Taleghani, N. De Freitas, J. Little, and D. Lowe, "A Boosted Particle Filter: Multitarget Detection and Tracking," Proc. European Conf. Computer Vision, pp. 28-39, 2004.
[20] M. Isard and A. Blake, "Contour Tracking by Stochastic Propagation of Conditional Density," Proc. European Conf. Computer Vision, vol. 1064, pp. 343-356, 1996.
[21] L. Vese and T. Chan, "A Multiphase Level Set Framework for Image Segmentation Using the Mumford and Shah Model," Int'l J. Computer Vision , vol. 50, no. 3, pp. 271-293, 2002.
[22] M. Salzmann, V. Lepetit, and P. Fua, "Deformable Surface Tracking Ambiguities," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[23] A.O. Balan and M.J. Black, "An Adaptive Appearance Model Approach for Model-Based Articulated Object Tracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 758-765, 2006.
[24] R. Lin, D. Ross, J. Lim, and M.-H. Yang, "Adaptive Discriminative Generative Model and Its Applications," Proc. Neural Information Processing Systems, pp. 801-808, 2004.
[25] H. Grabner, M. Grabner, and H. Bischof, "Real-Time Tracking via Online Boosting," Proc. Conf. British Machine Vision, pp. 47-56, 2006.
[26] X. Liu and T. Yu, "Gradient Feature Selection for Online Boosting," Proc. IEEE Int'l J. Computer Vision, pp. 1-8, 2007.
[27] S. Avidan, "Ensemble Tracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 494-501, 2005.
[28] S. Avidan, "Support Vector Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 8, pp. 1064-1072, Aug. 2004.
[29] J. Wang, X. Chen, and W. Gao, "Online Selecting Discriminative Tracking Features Using Particle Filter," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 1037-1042, 2005.
[30] R.T. Collins, Y. Liu, and M. Leordeanu, "Online Selection of Discriminative Tracking Features," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1631-1643, Oct. 2005.
[31] G. Mori and J. Malik, "Recovering 3D Human Body Configurations Using Shape Contexts," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, no. 7, pp. 1052-1062, July 2006.
[32] P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 511-518, 2001.
[33] H. Grabner, C. Leistner, and H. Bischof, "Semi-Supervised Online Boosting for Robust Tracking," Proc. European Conf. Computer Vision, 2008.
[34] N.C. Oza, "Online Ensemble Learning," PhD Thesis, Univ. of California, 2001.
[35] P. Dollár, Z. Tu, H. Tao, and S. Belongie, "Feature Mining for Image Classification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[36] Z. Khan, T. Balch, and F. Dellaert, "A Rao-Blackwellized Particle Filter for Eigentracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, 2004.
[37] J.H. Friedman, "Greedy Function Approximation: A Gradient Boosting Machine," The Annals of Statistics, vol. 29, no. 5, pp. 1189-1232, 2001.
[38] B. Babenko, P. Dollár, Z. Tu, and S. Belongie, "Simultaneous Learning and Alignment: Multi-Instance and Multi-Pose Learning," Proc. Faces in Real-Life Images, 2008.
[39] Y. Freund and R.E. Schapire, "A Decision-Theoretic Generalization of Online Learning and an Application to Boosting," J. Computer and System Sciences, vol. 55, pp. 119-139, 1997.
[40] J. Friedman, T. Hastie, and R. Tibshirani, "Additive Logistic Regression: A Statistical View of Boosting," The Annals of Statistics, vol. 28, no. 2, pp. 337-407, 2000.
[41] C. Leistner, A. Saffari, P. Roth, and H. Bischof, "On Robustness of Online Boosting—A Competitive Study," Proc. Third IEEE Workshop Online Computer Vision, 2009.
[42] H. Grabner and H. Bischof, "Online Boosting and Vision," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 260-267, 2006.
[43] A. Chan and N. Vasconcelos, "Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 30, no. 5, pp. 909-926, May 2008.
[44] M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes Challenge 2010 (VOC2010) Results," http://www.pascal-network.org/ challenges/ VOC/voc2010/workshopindex.html , 2011.
[45] S. Stalder, H. Grabner, and L. van Gool, "Beyond Semi-Supervised Tracking: Tracking Should Be as Simple as Detection, But Not Simpler than Recognition," Proc. Workshop Online Learning in Computer Vision , 2009.
[46] P. Felzenszwalb, D. McAllester, and D. Ramanan, "A Discriminatively Trained, Multiscale, Deformable Part Model," Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2008.
[47] L. Mu, J. Kwok, and L. Bao-liang, "Online Multiple Instance Learning with No Regret," Proc. IEEE Conf. Computer Vision and Pattern Recognitio, 2010.
6 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool