The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - January/February (2001 vol.13)
pp: 79-95
ABSTRACT
<p><b>Abstract</b>—Metric databases are databases where a metric distance function is defined for pairs of database objects. In such databases, similarity queries in the form of range queries or k-nearest-neighbor queries are the most important query types. In traditional query processing, single queries are issued independently by different users. In many data mining applications, however, the database is typically explored by iteratively asking similarity queries for answers of previous similarity queries. In this paper, we introduce a generic scheme for such data mining algorithms and we investigate two orthogonal approaches, reducing I/O cost as well as CPU cost, to speed-up the processing of multiple similarity queries. The proposed techniques apply to any type of similarity query and to an implementation based on an index or using a sequential scan. Parallelization yields an additional impressive speed-up. An extensive performance evaluation confirms the efficiency of our approach.</p>
INDEX TERMS
Knowledge discovery in databases, data mining, similarity search, efficient query processing, high-dimensional indexing.
CITATION
Bernhard Braunmüller, Martin Ester, Hans-Peter Kriegel, Jörg Sander, "Multiple Similarity Queries: A Basic DBMS Operation for Mining in Metric Databases", IEEE Transactions on Knowledge & Data Engineering, vol.13, no. 1, pp. 79-95, January/February 2001, doi:10.1109/69.908982
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool