Issue No. 02 - February (2006 vol. 28)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2006.28
A visual search is required when applying a recognition process on a scene containing multiple objects. In such cases, we would like to avoid an exhaustive sequential search. This work proposes a dynamic visual search framework based mainly on inner-scene similarity. Given a number of candidates (e.g., subimages), we hypothesize is that more visually similar candidates are more likely to have the same identity. We use this assumption for determining the order of attention. Both deterministic and stochastic approaches, relying on this hypothesis, are considered. Under the deterministic approach, we suggest a measure similar to Kolmogorov's epsilon-covering that quantifies the difficulty of a search task. We show that this measure bounds the performance of all search algorithms and suggest a simple algorithm that meets this bound. Under the stochastic approach, we model the identity of the candidates as a set of correlated random variables and derive a search procedure based on linear estimation. Several experiments are presented in which the statistical characteristics, search algorithm, and bound are evaluated and verified.
Index Terms- Computer vision, scene analysis, feature representation, similarity measures, performance evaluation of algorithms and systems, object recognition, visual search, attention.
Tamar Avraham, Michael Lindenbaum, "Attention-Based Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Bounds", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 28, no. , pp. 251-264, February 2006, doi:10.1109/TPAMI.2006.28