The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - March (2013 vol.35)
pp: 541-554
V. Mahadevan , Yahoo! Labs., Bangalore, India
N. Vasconcelos , Electr. & Comput. Eningeering Dept., Univ. of California, San Diego, La Jolla, CA, USA
ABSTRACT
A biologically inspired discriminant object tracker is proposed. It is argued that discriminant tracking is a consequence of top-down tuning of the saliency mechanisms that guide the deployment of visual attention. The principle of discriminant saliency is then used to derive a tracker that implements a combination of center-surround saliency, a spatial spotlight of attention, and feature-based attention. In this framework, the tracking problem is formulated as one of continuous target-background classification, implemented in two stages. The first, or learning stage, combines a focus of attention (FoA) mechanism, and bottom-up saliency to identify a maximally discriminant set of features for target detection. The second, or detection stage, uses a feature-based attention mechanism and a target-tuned top-down discriminant saliency detector to detect the target. Overall, the tracker iterates between learning discriminant features from the target location in a video frame and detecting the location of the target in the next. The statistics of natural images are exploited to derive an implementation which is conceptually simple and computationally efficient. The saliency formulation is also shown to establish a unified framework for classifier design, target detection, automatic tracker initialization, and scale adaptation. Experimental results show that the proposed discriminant saliency tracker outperforms a number of state-of-the-art trackers in the literature.
INDEX TERMS
Target tracking, Biology, Visualization, Object detection, Robustness, Adaptation models, video modeling, Object tracking, discriminant tracking, saliency, attention, motion saliency, automatic target initialization, scale adaptive tracking, discriminant center-surround architecture
CITATION
V. Mahadevan, N. Vasconcelos, "Biologically Inspired Object Tracking Using Center-Surround Saliency Mechanisms", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.35, no. 3, pp. 541-554, March 2013, doi:10.1109/TPAMI.2012.98
REFERENCES
[1] http://homepages.inf.ed.ac.uk/rbfcaviar, 2011.
[2] http://www.svcl.ucsd.edu/projects/tracking results.html, 2012.
[3] A. Adam, E. Rivlin, and I. Shimshoni, "Robust Fragments-Based Tracking Using the Integral Histogram," IEEE Conf. Computer Vision and Pattern Recognition, pp. 798-805, 2006.
[4] E.H. Adelson and J.R. Bergen, "Spatiotemporal Energy Models for the Perception of Motion," J. Optical Soc. of Am. A, vol. 2, no. 2, pp. 284-299, 1985.
[5] R. Allen, P. Mcgeorge, D. Pearson, and A.B. Milne, "Attention and Expertise in Multiple Target Tracking," Applied Cognitive Psychology, vol. 18, no. 3, pp. 337-347, 2004.
[6] S. Avidan, "Ensemble Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 2, pp. 261-271, Feb. 2007.
[7] B. Babenko, M.-H. Yang, and S. Belongie, "Visual Tracking with Online Multiple Instance Learning," IEEE Conf. Computer Vision and Pattern Recognition, pp. 983-990, 2009.
[8] S. Birchfield and S. Rangarajan, "Spatiograms versus Histograms for Region-Based Tracking," IEEE Conf. Computer Vision and Pattern Recognition, pp. 1158-1163, 2005.
[9] M. Black and A. Jepson, "Eigentracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation," Int'l J. Computer Vision, vol. 26, no. 1, pp. 63-84, 1998.
[10] E. Blaser, Z. Pylyshyn, and A.O. Holcombe, "Tracking an Object through Feature-Space," Nature, vol. 408, pp. 196-199, 2000.
[11] L. Bretzner and T. Lindeberg, "Feature Tracking with Automatic Selection of Spatial Scales," Computer Vision and Image Understanding, vol. 71, no. 3, pp. 385-392, 1998.
[12] R. Buccigrossi and E. Simoncelli, "Image Compression via Joint Statistical Characterization in the Wavelet Domain," IEEE Trans. Image Processing, vol. 8, no. 12, pp. 1688-1701, Dec. 1999.
[13] P. Cavanagh, "Attention-Based Motion Perception," Science, vol. 257, no. 5076, pp. 1563-1565, 1992.
[14] P. Cavanagh and G.A. Alvarez, "Tracking Multiple Targets with Multifocal Attention," Trends in Cognitive Sciences, vol. 9, no. 7, pp. 349-354, 2005.
[15] J. Cavanaugh, W. Bair, and J. Movshon, "Nature and Interaction of Signals from the Receptive Field Center and Surround in Macaque V1 Neurons," J. Neurophysiology, vol. 88, pp. 2530-2546, 2002.
[16] H. Chernoff, "On the Distribution of the Likelihood Ratio," The Annals of Math. Statistics, vol. 25, no. 3, pp. 573-578, 1954.
[17] R. Collins, "Mean-Shift Blob Tracking through Scale Space," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, 2003.
[18] R. Collins, Y. Liu, and M. Leordeanu, "Online Selection of Discriminative Tracking Features," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1631-1643, Oct. 2005.
[19] D. Comaniciu, V. Ramesh, and P. Meer, "Kernel-Based Object Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 564-577, May 2003.
[20] N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2005.
[21] M.N. Do and M. Vetterli, "Wavelet-Based Texture Retrieval Using Generalized Gaussian Density and Kullback-Leibler Distance," IEEE Trans. Image Processing, vol. 11, no. 2, pp. 146-158, Feb. 2002.
[22] M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, and A. Zisserman, "The PASCAL Visual Object Classes Challenge (VOC '07) Results," http://www.pascal-network.org/ challenges/ VOC/voc2007/workshopindex.html, 2007.
[23] J. Feldman and P.D. Tremoulet, "Individuation of Visual Objects over Time," Cognition, vol. 99, no. 2, pp. 131-165, 2006.
[24] Y. Freund and R.E. Schapire, "1997, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting," European Conf. Computational Learning Theory, pp. 23-37, 1995.
[25] D. Gao, S. Han, and N. Vasconcelos, "Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to Visual Recognition," IEEE Trans. Pattern Anaysis and Machine Intelligence, vol. 31, no. 6, p. 989, June 2009.
[26] D. Gao, V. Mahadevan, and N. Vasconcelos, "On the Plausibility of the Discriminant Center-Surround Hypothesis for Visual Saliency," J. Vision, vol. 8, no. 7, pp. 1-18, 2008.
[27] D. Gao and N. Vasconcelos, "Discriminant Saliency for Visual Recognition from Cluttered Scenes," Proc. Advances in Neural Information Processing Systems, 2005.
[28] D. Gao and N. Vasconcelos, "Bottom-Up Saliency Is a Discriminant Process," Proc. IEEE Int'l Conf. Computer Vision, 2007.
[29] D. Gao and N. Vasconcelos, "Discriminant Interest Points Are Stable," Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2007.
[30] D. Gao and N. Vasconcelos, "Decision-Theoretic Saliency: Computational Principle, Biological Plausibility, and Implications for Neurophysiology and Psychophysics," Neural Computation, vol. 21, pp. 239-271, Jan. 2009.
[31] H. Grabner and H. Bischof, "On-Line Boosting and Vision," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 260-267, 2006.
[32] H. Grabner, C. Leistner, and H. Bischof, "Semi-Supervised On-Line Boosting for Robust Tracking," Proc. European Conf. Computer Vision, pp. 234-247, 2008.
[33] B. Han and L. Davis, "On-Line Density-Based Appearance Modeling for Object Tracking," Proc. 10th IEEE Int'l Conf. Computer Vision, pp. 1492-1499, 2005.
[34] S. Han and N. Vasconcelos, "Biologically Plausible Saliency Mechanisms Improve Feedforward Object Recognition," Vision Research, vol. 50, pp. 2295-2307, 2010.
[35] D. Heeger, "Optical Flow from Spatiotemporal Filters," Int'l J. Computer Vision, vol. 1, no. 4, pp. 279-302, 1988.
[36] J. Ho, K. Lee, M. Yang, and D. Kriegman, "Visual Tracking Using Learned Linear Subspaces," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2004.
[37] J. Huang and D. Mumford, "Statistics of Natural Images and Models," Proc. IEEE Conf. Computer Vision and Pattern Recogition, pp. 541-547, 1999.
[38] J. Intriligator and P. Cavanagh, "The Spatial Resolution of Visual Attention," Cognitive Psychology, vol. 43, pp. 171-216, 1997.
[39] M. Isard and A. Blake, "Condensation Conditional Density Propagation for Visual Tracking," Int'l J. Computer Vision, vol. 29, pp. 5-28, 1998.
[40] L. Itti and C. Koch, "A Saliency-Based Search Mechanism for Overt and Covert Shifts of Visual Attention," Vision Research, vol. 40, pp. 1489-1506, 2000.
[41] L. Itti, C. Koch, and E. Niebur, "A Model of Saliency-Based Visual Attention for Rapid Scene Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 11, pp. 1254-1259, Nov. 1998.
[42] A.D. Jepson, D.J. Fleet, and T.F. El-Maraghi, "Robust Online Appearance Models for Visual Tracking," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 10, pp. 1296-1311, Oct. 2003.
[43] T. Kadir and M. Brady, "Scale, Saliency and Image Description," Int'l J. Computer Vision, vol. 45, pp. 83-105, Nov. 2001.
[44] B. Keane and Z. Pylyshyn, "Is Motion Extrapolation Employed in Multiple Object Tracking? Tracking as a Low-Level, Non-Predictive Function," Cognitive Psychology, vol. 52, no. 4, pp. 346-368, 2006.
[45] R. Lin, D. Ross, J. Lim, and M. Yang, "Adaptive Discriminative Generative Model and Its Applications," Proc. Advances in Neural Information Processing Systems, pp. 801-808, 2004.
[46] E. Maggio and A. Cavallaro, "Hybrid Particle Filter and Mean Shift Tracker with Adaptive Transition Model," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing, 2005.
[47] V. Mahadevan and N. Vasconcelos, "Background Subtraction in Highly Dynamic Scenes," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, 2008.
[48] V. Mahadevan and N. Vasconcelos, "Spatiotemporal Saliency in Dynamic Scenes," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 1, pp. 171-177, Jan. 2010.
[49] T. Makovski and Y. Jiang, "Feature Binding in Attentive Tracking of Distinct Objects," Visual Cognition, vol. 17, no. 1, pp. 180-194, 2009.
[50] H. Masnadi-Shirazi, V. Mahadevan, and N. Vasconcelos, "On the Design of Robust Classifiers for Computer Vision," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 779-786, 2010.
[51] H. Nguyen and A. Smeulders, "Robust Tracking Using Foreground-Background Texture Discrimination," Int'l J. Computer Vision, vol. 69, no. 3, pp. 277-293, 2006.
[52] H.C. Nothdurft, "Texture Segmentation and Pop-Out from Orientation Contrast," Vision Research, vol. 31, no. 6, pp. 1073-1078, 1991.
[53] S.E. Palmer, Vision Science: Photons to Phenomenology. MIT Press, 1999.
[54] M. Posner, C. Snyder, and B. Davidson, "Attention and the Detection of Signals," J. Experimental Psychology: General, vol. 109, no. 2, pp. 160-174, 1980.
[55] Z.W. Pylyshyn and R.W. Storm, "Tracking Multiple Independent Targets: Evidence for a Parallel Tracking Mechanism," Spatial Vision, vol. 3, no. 3, pp. 179-197, 1988.
[56] D. Ramanan, D. Forsyth, and A. Zisserman, "Tracking People by Learning Their Appearance," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 65-81, Jan. 2007.
[57] D. Ross, J. Lim, R. Lin, and M. Yang, "Incremental Learning for Robust Visual Tracking," Int'l J. Computer Vision, vol. 77, nos. 1-3, pp. 125-141, May 2008.
[58] A.B. Sekuler and R. Sekuler, "Collisions between Moving Visual Targets: What Controls Alternative Ways of Seeing an Ambiguous Display?" Perception, vol. 28, no. 4, pp. 415-432, 1999.
[59] T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio, "Robust Object Recognition with Cortex-Like Mechanisms," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 3, pp. 411-426, Mar. 2007.
[60] K. Sharifi and A. Leon-Garcia, "Estimation of Shape Parameter for Generalized Gaussian Distributions in Subband Decompositions of Video," IEEE Trans. Circuits and Systems for Video Technology, vol. 5, no. 1, pp. 52-56, Feb. 1995.
[61] A. Srivastava, A. Lee, E. Simoncelli, and S. Zhu, "On Advances in Statistical Modeling of Natural Images," J. Math. Imaging and Vision, vol. 18, no. 1, pp. 17-33, 2003.
[62] C. Stauffer and W. Grimson, "Adaptive Background Mixture Models for Real-Time Tracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 246-252, 1999.
[63] B. Tatler, "The Central Fixation Bias in Scene Viewing: Selecting an Optimal Viewing Position Independently of Motor Biases and Image Feature Distributions," J. Vision, vol. 7, no. 14, pp. 1-17, 2007.
[64] K. Toyama and G.D. Hager, "Incremental Focus of Attention for Robust Visual Tracking," Int'l J. Computer Vision, pp. 189-195, 1996.
[65] K. Toyama and Y. Wu, "Bootstrap Initialization of Nonparametric Texture Models for Tracking," Proc. European Conf. Computer Vision, 2000.
[66] S. Treue and J. Trujillo, "Feature-Based Attention Influences Motion Processing Gain in Macaque Visual Cortex," Nature, vol. 399, no. 6736, pp. 575-579, 1999.
[67] P. Tseng, R. Carmi, I. Cameron, D. Munoz, and L. Itti, "Quantifying Center Bias of Observers in Free Viewing of Dynamic Natural Scenes," J. Vision, vol. 9, no. 7,article 4, 2009.
[68] M. Vasconcelos and N. Vasconcelos, "Natural Image Statistics and Low-Complexity Feature Selection," IEEE Trans. Pattern Analysis and Machine Intelligent, vol. 31, no. 2, pp. 228-244, Feb. 2008.
[69] N. Vasconcelos, "Feature Selection by Maximum Marginal Diversity," Proc. Advances in Neural Information Processing Systems, 2002.
[70] F.A.J. Verstraten, P. Cavanagh, and A.T. Labianca, "Limits of Attentive Tracking Reveal Temporal Properties of Attention," Vision Research, vol. 40, no. 26, pp. 3651-3664, 2000.
[71] P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 511-518, 2001.
[72] A. Yarbus, Eye Movements and Vision. Plenum, 1967.
[73] A. Yilmaz, "Object Tracking by Asymmetric Kernel Mean Shift with Automatic Scale and Orientation Selection," Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1-6, 2007.
[74] A. Yilmaz, O. Javed, and M. Shah, "Object Tracking: A Survey," ACM Computing Surveys, vol. 38, no. 4,article 13, 2006.
[75] Y. Zhong, A. Jain, and M. Dubuisson-Jolly, "Object Tracking Using Deformable Templates," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 5, pp. 544-549, May 2000.
37 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool