The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.11 - November (2011 vol.17)
pp: 1624-1636
Kristian Hildebrand , Technische Universität Berlin, Berlin
Mathias Eitz , Technische Universität Berlin, Berlin
Marc Alexa , Technische Universität Berlin, Berlin
ABSTRACT
We introduce a benchmark for evaluating the performance of large-scale sketch-based image retrieval systems. The necessary data are acquired in a controlled user study where subjects rate how well given sketch/image pairs match. We suggest how to use the data for evaluating the performance of sketch-based image retrieval systems. The benchmark data as well as the large image database are made publicly available for further studies of this type. Furthermore, we develop new descriptors based on the bag-of-features approach and use the benchmark to demonstrate that they significantly outperform other descriptors in the literature.
INDEX TERMS
Image/video retrieval, image databases, benchmarking.
CITATION
Kristian Hildebrand, Mathias Eitz, Marc Alexa, "Sketch-Based Image Retrieval: Benchmark and Bag-of-Features Descriptors", IEEE Transactions on Visualization & Computer Graphics, vol.17, no. 11, pp. 1624-1636, November 2011, doi:10.1109/TVCG.2010.266
REFERENCES
[1] R. Datta, D. Joshi, J. Li, and J. Wang, “Image Retrieval: Ideas, Influences, and Trends of the New Age,” ACM Computing Surveys, vol. 40, no. 2, pp. 1-60, 2008.
[2] A. Oliva and A. Torralba, “Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope,” Int'l J. Computer Vision, vol. 42, no. 3, pp. 145-175, 2001.
[3] D.G. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[4] D. Squire, W. Mueller, H. Mueller, and J. Raki, “Content-Based Query of Image Databases, Inspirations from Text Retrieval: Inverted Files, Frequency-Based Weights and Relevance Feedback,” Proc. Scandinavian Conf. Image Analysis, pp. 7-11, 1999.
[5] J. Sivic and A. Zisserman, “Video Google: A Text Retrieval Approach to Object Matching in Videos,” Proc. IEEE Int'l Conf. Computer Vision, pp. 1470-1477, 2003.
[6] H. Jégou, M. Douze, and C. Schmid, “Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search,” Proc. European Conf. Computer Vision, pp. 304-317, 2008.
[7] Z. Wu, Q. Ke, M. Isard, and J. Sun, “Bundling Features for Large Scale Partial-Duplicate Web Image Search,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 25-32, 2009.
[8] D.A. Forsyth, “Benchmarks for Storage and Retrieval in Multimedia Databases,” Proc. Storage and Retrieval for Media Databases, pp. 240-247, 2002.
[9] M. Eitz, K. Hildebrand, T. Boubekeur, and M. Alexa, “An Evaluation of Descriptors for Large-Scale Image Retrieval from Sketched Feature Lines,” Computers & Graphics, vol. 34, no. 5, pp. 482-498, 2010.
[10] R. Likert, “A Technique for the Measurement of Attitudes,” Archives of Psychology, vol. 22, no. 140, pp. 1-55, 1932.
[11] T.J. Andrews and D.M. Coppola, “Idiosyncratic Characteristics of Saccadic Eye Movements when Viewing Different Visual Environments,” Vision Research, vol. 39, no. 17, pp. 2947-2953, 1999.
[12] F.A. Wichmann, J. Drewes, P. Rosas, and K.R. Gegenfurtner, “Animal Detection in Natural Scenes: Critical Features Revisited,” J. Vision, vol. 10, no. 4, pp. 1-27, 2010.
[13] M.G. Kendall, “A New Measure of Rank Correlation,” Biometrika, vol. 30, nos. 1/2, pp. 81-93, June 1938.
[14] M. Kendall and J. Gibbons, Rank Correlation Methods, fifth ed. Griffin, 1990.
[15] N.V. Shirahatti and K. Barnard, “Evaluating Image Retrieval,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, pp. 955-961, 2005.
[16] J. Zobel and A. Moffat, “Inverted Files for Text Search Engines,” ACM Computing Surveys, vol. 38, no. 2, pp. 1-56, 2006.
[17] E. Nowak, F. Jurie, and B. Triggs, “Sampling Strategies for Bag-of-Features Image Classification,” Proc. European Conf. Computer Vision, pp. 490-503, 2006.
[18] I. Witten, A. Moffat, and T. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images. Morgan Kaufmann, 1999.
[19] S. Belongie, J. Malik, and J. Puzicha, “Shape Matching and Object Recognition Using Shape Contexts,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 4, pp. 509-522, Apr. 2002.
[20] N. Dalal and B. Triggs, “Histograms of Oriented Gradients for Human Detection,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, pp. 886-893, 2005.
[21] J. Canny, “A Computational Approach to Edge Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. PAMI-8, no. 6, pp. 679-698, Nov. 1986.
[22] A. Chalechale, G. Naghdy, and A. Mertins, “Sketch-Based Image Matching Using Angular Partitioning,” IEEE Trans. Systems, Man and Cybernetics, Part A, vol. 35, no. 1, pp. 28-41, Jan. 2005.
[23] B. Manjunath, P. Salembier, and T. Sikora, Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley & Sons, Inc., 2002.
[24] P. Phillips, H. Moon, S. Rizvi, and P. Rauss, “The FERET Evaluation Methodology for Face-Recognition Algorithms,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 10, pp. 1090-1104, Oct. 2000.
[25] D. DeCarlo, A. Finkelstein, S. Rusinkiewicz, and A. Santella, “Suggestive Contours for Conveying Shape,” ACM Trans. Graphics, vol. 22, no. 3, pp. 848-855, 2003.
[26] T. Judd, F. Durand, and E.H. Adelson, “Apparent Ridges for Line Drawing,” ACM Trans. Graphics, vol. 26, no. 3, 2007.
[27] R. Dony, J. Mateer, J. Robinson, and M. Day, “Iconic versus Naturalistic Motion Cues in Automated Reverse Storyboarding,” Proc. IEEE European Conf. Visual Media Production, pp. 17-25, 2005.
[28] D.B. Goldman, B. Curless, S.M. Seitz, and D. Salesin, “Schematic Storyboarding for Video Visualization and Editing,” ACM Trans. Graphics, vol. 25, no. 3, pp. 862-871, 2006.
[29] J.P. Collomosse, G. McNeill, and Y. Qian, “Storyboard Sketches for Content Based Video Retrieval,” Proc. IEEE Int'l Conf. Computer Vision, pp. 245-252, 2009.
[30] G.A. Miller, “The Magical Number Seven, Plus or Minus Two: Some Limits on Our Capacity for Processing Information,” Psychological Rev., vol. 63, no. 2, pp. 81-97, 1956.
[31] J. Dawes, “Do Data Characteristics Change According to the Number of Scale Points Used?,” Int'l J. Market Research, vol. 50, no. 1, pp. 61-77, 2008.
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool