Proceedings 17th International Conference on Data Engineering (2001)
Apr. 2, 2001 to Apr. 6, 2001
Hakan Ferhatosmanoglu , University of California, Santa Barbara
Ertem Tuncel , University of California, Santa Barbara
Divyakant Agrawal , University of California, Santa Barbara
Amr El Abbadi , University of California, Santa Barbara
Abstract: In this paper, we develop a general framework for approximate nearest neighbor queries. We categorize the current approaches for nearest neighbor query processing based on either their ability to reduce the data set that needs to be examined, or their ability to reduce the representation size of each data object. We first propose modifications to well-known techniques to support the progressive processing of approximate nearest neighbor queries. A user may therefore stop the retrieval process once enough information has been returned. We then develop a new technique based on clustering that merges the benefits of the two general classes of approaches. Our cluster-based approach allows a user to progressively explore the approximate results with increasing accuracy. We propose a new metric for evaluation of approximate nearest neighbor searching techniques. Using both the proposed and the traditional metrics, we analyze and compare several techniques with a detailed performance evaluation. We demonstrate the feasibility and efficiency of approximate nearest neighbor searching. We perform experiments on several real data sets and establish the superiority of the proposed cluster-based technique over the existing techniques for approximate nearest neighbor searching.
H. Ferhatosmanoglu, E. Tuncel, D. Agrawal and A. E. Abbadi, "Approximate Nearest Neighbor Searching in Multimedia Databases," Proceedings 17th International Conference on Data Engineering(ICDE), Heidelberg, Germany, 2001, pp. 0503.