This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Data- and Model-Driven Gaze Control for an Active-Vision System
December 2001 (vol. 23 no. 12)
pp. 1415-1429

Models of visual attention provide a general approach to control the activities of active vision systems. We will introduce a new model of attentional control that differs in important aspects from conventional ones. We divide the selection into two stages, which is more suitable for the system as well as explaining different phenomena found in natural visual attention, such as the dispute between early and late selection. The proposed model is especially designed for use in dynamic scenes. Our approach aims at modeling as much of a general active vision system as possible and designing clean interfaces for the integration of the remaining specific aspects needed in order to solve specific problems.

[1] S.-I. Amari, “Dynamics of Pattern Formation in Lateral Inhibition Type Neural Field,” Biological Cybernetic, vol. 27, pp. 77-87, 1977.
[2] G. Backer and B. Mertsching, “Integrating Time and Depth into the Attentional Control of an Active Vision System,” Proc. Dynamische Perzeption Workshop der GI-Fachgruppe 1.0.4, G. Baratoff and H. Neumann, eds., pp. 69-74, 2000.
[3] D.H. Ballard, “Animate Vision,” Artificial Intelligence, vol. 48, pp. 57-86, 1991.
[4] G. Baylis and J. Driver, “Visual Attention and Objects: Evidence for Hierarchical Coding of Locations,” J. Experimental Psychology: Human Perception and Performance, vol. 19, no. 3, pp. 451-470, 1993.
[5] E. Blaser, Z. Pylyshyn, and A. Holcombe, “Tracking an Object through Feature Space,” Nature, vol. 408, pp. 196-199, Nov. 2000.
[6] H.-J. Boehme, A. Brakensiek, U.-D. Braumann, M. Krabbes, and H.-M. Gross, “Neural Networks for Gesture-Based Remote Control of a Mobile Robot,” Proc. IEEE World Congress Computational Intelligence (WCCI '98 - IJCNN '98), vol. 1, pp. 372-377, 1998.
[7] M. Bollmann, “Entwicklung einer Aufmerksamkeitssteuerung für ein aktives Sehsystem,” PhD thesis, Univ. of Hamburg, 2000.
[8] M. Bollmann, C. Justkowski, and B. Mertsching, “Utilizing Color Information for the Gaze Control of an Active Vision System,” Proc. Fourth Workshop Farbbildverarbeitung, V. Rehrmann, ed., pp. 73-79, 1998.
[9] K. Brunnström, J.-O. Eklundh, T. Uhlin, “Active Fixation for Scene Exploration,” Int'l J. Computer Vision, vol. 17, pp. 137-162, 1994.
[10] G. Deco, “A Neurodynamical Model of Visual Attention: Feedback Enhancement of Spatial Resolution in a Hierarchical System,” Dynamische Perzeption, G. Baratoff and H. Neumann, eds., pp. 45-50, 2000.
[11] J. Ditterich, T. Eggert, and A. Straube, “The Role of the Attention Focus in the Visual Information Processing Underlying Saccadic Adaptation,” Vision Research, vol. 40, pp. 1125-1134, 2000.
[12] C.W. Eriksen and J.S. James, “Visual Attention within and Around the Focus of Attention: A Zoom Lens Model,” Perception and Psychophysics, vol. 40, no. 4, pp. 225-240, 1986.
[13] C.W. Eriksen and Y. Y. Yeh, “Allocation of Attention in the Visual Field,” J. Experimental Psychology: Human Perception and Performance, vol. 11, no. 5, pp. 583–597, 1985.
[14] G.-J. Giefing, H. Janßen, and H. Mallot, “Saccadic Object Recognition with an Active Vision System,” Proc. 10th European Conf. Artificial Intelligence, pp. 803-805, 1992.
[15] G.H. Granlund, H. Knutsson, C.-J. Westelius, and J. Wiklund, “Issues in Robot Vision,” Image and Vision Computing, vol. 12, no. 3, pp. 131-148, 1994.
[16] F.H. Hamker, “Distributed Competition in Directed Attention,” Dynamische Perzeption, G. Baratoff and H. Neumann, eds., pp. 39-44, 2000.
[17] R. Hübner and G. Backer, “Perceiving Spatially Inseparable Objects: Evidence for Feature-Based Object Selection Not Mediated by Location,” J. Experimental Psychology: Human Perception and Performance, vol. 23, no. 4, pp. 948-961, 1999.
[18] R. Hunt, Measuring Colour. Chichester: Ellis Horwood, 1987.
[19] L. Itti and C. Koch, “A Saliency-Based Search Mechanism for Overt and Covert Shifts of Visual Attention,” Vision Research, vol. 10, no. 12, pp. 1489–1506, 2000.
[20] C. Jenning and D. Murray, “Gesture Recognition for Robot Control,” Proc. Conf. Computer Vision and Pattern Recognition, 1997.
[21] D. Kahneman, A. Treisman, and B. Gibbs, “The Reviewing of Object Files: Object-Specific Integration of Information,” Cognitive Psychology, vol. 24, no. 2, pp. 175-210, 1992.
[22] L. Kaufman and W. Richards, “Spontaneous Fixation Tendencies for Visual Forms,” Perception and Psychophysics, vol. 5, no. 2, pp. 85-88, 1969.
[23] C. Koch and S. Ullman, “Shifts in Selective Visual Attention: Towards the Underlying Neural Circuitry,” Human Neurobiology, vol. 4, pp. 219-227, 1985.
[24] K. Kopecz, “Neural Field Dynamics Provide Robust Control of Attentional Resources,” Aktives Sehen in Technischen und Natürlichen Systemen, B. Mertsching, ed., 1996.
[25] A. Kramer and S. Hahn, “Splitting the Beam: Distribution of Attention over Noncontiguous Regions of the Visual Field,” Psychological Science, vol. 6, no. 6, pp. 381-386, 1995.
[26] D. LaBerge, R.L. Carlson, J.K. Williams, and B.G. Bunney, “Shifting Attention in Visual Space: Tests of Moving-Spotlight Models versus an Activity-Distribution Model,” J. Experimental Psychology: Human Perception and Performance, vol. 23, no. 5, pp. 1380-1392, 1997.
[27] A. Maki, P. Nordlund, and J.-O. Eklundh, “A Computational Model of Depth-Based Attention,” Proc. 13th Int'l Conf. Pattern Recognition, vol. 4, pp. 734-738, 1996.
[28] A. Maki, P. Nordlund, and J.-O. Eklundh, “Attentional Scene Segmentation: Integrating Depth and Motion,” Computer Vision and Image Understanding, vol. 78, pp. 351-373, 2000.
[29] B. Mertsching, M. Bollmann, R. Hoischen, and S. Schmalz, “The Neural Active Vision System Navis,” Handbook of Computer Vision and Applications Vol. 3 (Systems and Applications), B. Jähne, H. Haußecke, and P. Geißler, eds., pp. 543-568, 1999.
[30] B. Mertsching, M. Bollmann, A. Massad, and S. Schmalz, “Recognition of Complex Objects with an Active Vision System,” Proc. Int'l ICSC/IFAC Symp. Neural Computation (NC '98), pp. 469-475, 1998.
[31] R. Milanese, “Detecting Salient Regions in an Image: From Biological Evidence to Computer Implementation,” PhD thesis, Univ. of Geneva, 1993.
[32] M. Miyahara and Y. Yoshida, “Mathematical Transform of (R, G, B) Color Data to Munsell (H, V, C) Color Data,” Visual Comm. and Image Processing, vol. 1001, pp. 650-657, 1988.
[33] B. Olshausen, C. Anderson, and C.V. Essen, “A Multiscale Dynamic Routing Circuit for Forming Size- and Position-Invariant Object Representations,” J. Computational Neuroscience, vol. 2, no. 1, 45-62, 1995.
[34] K. Pahlavan, “Designing an Anthropomorphic Head-Eye System,” Visual Attention and Vision, W. Zangenmeister, H. Stiehl, and C. Freksa, eds., 1996.
[35] H. Pashler and P. Badgio, “Visual Attention and Stimulus Identification,” J. Experimental Psychology: Human Perception and Performance, vol. 11, no. 2, pp. 105-121, Apr. 1985.
[36] M. Posner, C. Snyder, and B. Davidson, “Attention and the Detection of Signals,” J. Experimental Psychology, vol. 109, no. 2, pp. 160-174, 1980.
[37] E. Postma, “SCAN: A Neural Model of Covert Attention,” PhD thesis, Rijksuniversiteit Limburg, Wageningen, 1994.
[38] Z. Pylyshyn, J. Burkell, B. Fisher, C. Sears, W. Schmidt, and L. Trick, “Mulitple Parallel Access in Visual Attention,” Canadian J. Experimental Psychology, vol. 48, no. 2, pp. 260-283, 1994.
[39] Z.W. Pylyshyn and R. Storm, “Tracking Multiple Independent Targets: Evidence for a Parallel Tracking Mechanism,” Spatial Vision, vol. 3, no. 3, 1988.
[40] D. Reisfeld, H. Wolfsow, and Y. Yeshurun, “Context-Free Attentional Operators: the Generalized Symmetry Transform,” Int'l J. Computer Vision, vol. 14, pp. 119-130, 1995.
[41] A. Rosenfeld and A.C. Kak,Digital Picture Processing. Academic Press, 2nd ed., 1982
[42] H. Rowley, S. Baluja, and T. Kanade, “Human Face Detection in Visual Scenes,” Technical Report CMU-CS95-158, Carnegie Mellon Univ., 1995.
[43] G. Schöner, M. Dose, and C. Engels, “Dynamics of Behavior: Theory and Applications for Autonomous Robot Architectures,” Robotics and Autonomous Systems, vol. 16, pp. 213-245, 1996.
[44] A. Takeuchi and S. Amari, “Formation of Topographic Maps in Columnar Microstructures in Nerve Fields,” Biological Cybernetics, vol. 35, pp. 63-72, 1979.
[45] S.P. Tipper, J. Driver, and B. Weaver, “Object-Centered Inhibition of Return of Visual Attention,” Quarterly J. Experimental Psychology, vol. 43A, no. 2, pp. 289-298, May 1991.
[46] J.K. Tsotsos, S.M. Culhane, W.Y.K. Wai, Y. Lai, N. Davis, and F. Nuflo, “Modeling Visual Attention via Selective Tuning,” Artificial Intelligence, vol. 78, pp. 507-545, 1995.
[47] S.P. Vecera and M. Behrmann, “Spatial Attention Does Not Require Preattentive Grouping,” Neuropsychology, vol. 11, no. 1, pp. 30-43, 1997.
[48] T. Vieville, E. Clergue, R. Enriso, and H. Mathieu, “Experimenting with 3-D Vision on a Robotic Head,” Robotics and Autonomous Systems, vol. 14, no. 1, 1-27, 1995.
[49] N. Voss and B. Mertsching, “Design and Implementation of an Accelerated Gabor Filter Bank Using Parallel Hardware,” Proc. 11th Int'l Conf. Field-Programmable Logic and Applications (FPL '01), 2001.
[50] J.M. Wolfe, “Guided Search 2.0: A Revised Model of Visual Search,” Psychonomic Bull. and Rev., vol. 1, no. 2, pp. 202-238, 1994.
[51] J.M. Wolfe, K.R. Cave, and S.L. Franzel, “Guided Search: An Alternative to the Feature Integration Model for Visual Search,” J. Experimental Psychology: Human Perception and Performance, vol. 15, pp. 419-433, 1989.
[52] C. Wren, A. Azarbayejani, T. Darrell, and A.P. Pentland, Pfinder: Real-Time Tracking of the Human Body IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 780-785, July 1997.

Index Terms:
Visual attention, gaze control, visual exploration, active vision
Citation:
G. Backer, B. Mertsching, M. Bollmann, "Data- and Model-Driven Gaze Control for an Active-Vision System," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 12, pp. 1415-1429, Dec. 2001, doi:10.1109/34.977565
Usage of this product signifies your acceptance of the Terms of Use.