The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - February (2010 vol.32)
pp: 371-377
Ondř;ej Chum , Czech Technical University, Prague
Jiř;í Matas , Czech Technical University, Prague
ABSTRACT
We propose a randomized data mining method that finds clusters of spatially overlapping images. The core of the method relies on the min-Hash algorithm for fast detection of pairs of images with spatial overlap, the so-called cluster seeds. The seeds are then used as visual queries to obtain clusters which are formed as transitive closures of sets of partially overlapping images that include the seed. We show that the probability of finding a seed for an image cluster rapidly increases with the size of the cluster. The properties and performance of the algorithm are demonstrated on data sets with 10^4, 10^5, and 5 \times 10^6 images. The speed of the method depends on the size of the database and the number of clusters. The first stage of seed generation is close to linear for databases sizes up to approximately 2^{34} \approx 10^{10} images. On a single 2.4 GHz PC, the clustering process took only 24 minutes for a standard database of more than 100,000 images, i.e., only 0.014 seconds per image.
INDEX TERMS
minHash, image clustering, image retrieval, bag of words.
CITATION
Ondř;ej Chum, Jiř;í Matas, "Large-Scale Discovery of Spatially Related Images", IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.32, no. 2, pp. 371-377, February 2010, doi:10.1109/TPAMI.2009.166
REFERENCES
[1] http://books.google.com/help/mapsstreetview /, 2009.
[2] http:/www.panoramio.com/, 2009.
[3] http:/www.flickr.com/, 2009.
[4] N. Snavely, S. Seitz, and R. Szeliski, “Photo Tourism: Exploring Photo Collections in 3D,” Proc. ACM SIGGRAPH, pp. 835-846, 2006.
[5] D. Nister and H. Stewenius, “Scalable Recognition with a Vocabulary Tree,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[6] J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, “Object Retrieval with Large Vocabularies and Fast Spatial Matching,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[7] H. Jegou, H. Harzallah, and C. Schmid, “A Contextual Dissimilarity Measure for Accurate and Efficient Image Search,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[8] O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman, “Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval,” Proc. IEEE Int'l Conf. Computer Vision, 2007.
[9] G. Schindler, M. Brown, and R. Szeliski, “City-Scale Location Recognition,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[10] F. Schaffalitzky and A. Zisserman, “Multi-View Matching for Unordered Image Sets, or ‘How Do I Organize My Holiday Snaps?’” Proc. European Conf. Computer Vision, pp. 414-431, 2002.
[11] J. Sivic and A. Zisserman, “Video Google: A Text Retrieval Approach to Object Matching in Videos,” Proc. IEEE Int'l Conf. Computer Vision, 2003.
[12] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L.V. Gool, “A Comparison of Affine Region Detectors,” Int'l J. Computer Vision, vol. 65, pp. 43-72, 2005.
[13] D. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int'l J. Computer Vision, vol. 60, pp. 91-110, 2004.
[14] J. Sivic and A. Zisserman, “Video Data Mining Using Configurations of Viewpoint Invariant Regions,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2004.
[15] J. Philbin and A. Zisserman, “Object Mining Using a Matching Graph on Very Large Image Collections,” Proc. Indian Conf. Computer Vision, Graphics and Image Processing, 2008.
[16] F. Fraundorfer, H. Stewénius, and D. Nistér, “A Binning Scheme for Fast Hard Drive Based Image Search,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2007.
[17] T. Quack, V. Ferrari, and L.V. Gool, “Video Mining with Frequent Itemset Configurations,” Proc. Int'l Conf. Image and Video Retrieval, 2006.
[18] T. Quack, B. Leibe, and L.V. Gool, “World-Scale Mining of Objects and Events from Community Photo Collections,” Proc. Int'l Conf. Image and Video Retrieval, 2008.
[19] X. Li, C. Wu, C. Zach, S. Lazebnik, and J.M. Frahm, “Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs,” Proc. European Conf. Computer Vision, 2008.
[20] A. Oliva and A. Torralba, “Building the Gist of a Scene: The Role of Global Image Features in Recognition,” Visual Perception, Progress in Brain Research, vol. 155, pp. 23-36, 2006.
[21] J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman, “Discovering Object Categories in Image Collections,” Technical Report A.I. Memo 2005-005, Massachusetts Inst. of Tech nology, 2005.
[22] T. Hofmann, “Probabilistic Latent Semantic Indexing,” Proc. ACM SIGIR, 1999.
[23] D. Blei, A. Ng, and M. Jordan, “Latent Dirichlet Allocation,” J. Machine Learning Research, vol. 3, pp. 993-1022, 2003.
[24] B.C. Russell, A.A. Efros, J. Sivic, W.T. Freeman, and A. Zisserman, “Using Multiple Segmentations to Discover Objects and Their Extent in Image Collections,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[25] J. Philbin, J. Sivic, and A. Zisserman, “Geometric LDA: A Generative Model for Particular Object Discovery,” Proc. British Machine Vision Conf., 2008.
[26] A. Broder, “On the Resemblance and Containment of Documents,” Proc. SEQS: Sequences '91, 1998.
[27] O. Chum, J. Philbin, M. Isard, and A. Zisserman, “Scalable Near Identical Image and Shot Detection,” Proc. Int'l Conf. Image and Video Retrieval, 2007.
[28] P. Indyk and R. Motwani, “Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality,” Proc. Symp. Theory of Computing, 1998.
[29] M.A. Fischler and R.C. Bolles, “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography,” Comm. ACM, vol. 24, pp. 381-395, 1981.
[30] O. Chum, J. Philbin, and A. Zisserman, “Near Duplicate Image Detection: min-Hash and TF-IDF Weighting,” Proc. British Machine Vision Conf., 2008.
[31] O. Chum, J. Matas, and Š. Obdržálek, “Enhancing RANSAC by Generalized Model Optimization,” Proc. Asian Conf. Computer Vision, 2004.
[32] O. Chum, T. Werner, and J. Matas, “Epipolar Geometry Estimation Unaffected by the Dominant Plane,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2005.
[33] J.M. Frahm and M. Pollefeys, “RANSAC for (Quasi-) Degenerate Data (QDEGSAC),” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2006.
[34] http://www.robots.ox.ac.uk/~vgg/dataoxbuildings /, 2009.
[35] M. Perdoch, O. Chum, and J. Matas, “Efficient Representation of Local Geometry for Large Scale Object Retrieval,” Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2009.
47 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool