The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - July/August (2002 vol.14)
pp: 792-808
ABSTRACT
<p>In this paper, we present a clustering and indexing paradigm (called Clindex) for high-dimensional search spaces. The scheme is designed for approximate similarity searches, where one would like to find many of the data points near a target point, but where one can tolerate missing a few near points. For such searches, our scheme can find near points with high recall in very few IOs and perform significantly better than other approaches. Our scheme is based on finding clusters and, then, building a simple but efficient index for them. We analyze the trade-offs involved in clustering and building such an index structure, and present extensive experimental results.</p>
INDEX TERMS
Approximate search, clustering, high-dimensional index, similarity search.
CITATION
Chen Li, Edward Chang, Hector Garcia-Molina, Gio Wiederhold, "Clustering for Approximate Similarity Search in High-Dimensional Spaces", IEEE Transactions on Knowledge & Data Engineering, vol.14, no. 4, pp. 792-808, July/August 2002, doi:10.1109/TKDE.2002.1019214
37 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool