loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
17th International Conference on Data Engineering (ICDE'01)
High Dimensional Similarity Search With Space Filling Curves
Heidelberg, Germany
April 02-April 06
ISBN: 0-7695-1001-9
Swanwa Liao, University of Denver
Mario A. Lopez, University of Denver
Scott T. Leutenegger, University of Denver
Abstract: We present a new approach for approximate nearest neighbor queries for sets of high dimensional points under any Lt-metric, t = 1; : : : ;1. The proposed algorithm is efficient and simple to implement. The algorithm uses multiple shifted copies of the data points and stores them in up to (d +1) B-trees where d is the dimensionality of the data, sorted according to their position along a space filling curve. This is done in a way that allows us to guarantee that a neighbor within an O(d 1+1=t ) factor of the exact nearest, can be returned with at most (d + 1)log p n page accesses, where p is the branching factor of the B-trees. In practice, for real data sets, our approximate technique finds the exact nearest neighbor between 87% and 99% of the time and a point no farther than the third nearest neighbor between 98% and 100% of the time. Our solution is dynamic, allowing insertion or deletion of points in O(d log p n) page accesses and generalizes easily to find approximate k-nearest neighbors.
Citation:
Swanwa Liao, Mario A. Lopez, Scott T. Leutenegger, "High Dimensional Similarity Search With Space Filling Curves," icde, pp.0615, 17th International Conference on Data Engineering (ICDE'01), 2001
Usage of this product signifies your acceptance of the Terms of Use.