Issue No. 05 - May (2011 vol. 23)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2010.157
Stavros Papadopoulos , Hong Kong University of Science and Technology, Hong Kong
Lixing Wang , Hong Kong University of Science and Technology, Hong Kong
Yin Yang , Hong Kong University of Science and Technology, Hong Kong
Dimitris Papadias , Hong Kong University of Science and Technology, Hong Kong
Panagiotis Karras , National University of Singapore, Singapore
Multistep processing is commonly used for nearest neighbor (NN) and similarity search in applications involving high-dimensional data and/or costly distance computations. Today, many such applications require a proof of result correctness. In this setting, clients issue NN queries to a server that maintains a database signed by a trusted authority. The server returns the NN set along with supplementary information that permits result verification using the data set signature. An adaptation of the multistep NN algorithm incurs prohibitive network overhead due to the transmission of false hits, i.e., records that are not in the NN set, but are nevertheless necessary for its verification. In order to alleviate this problem, we present a novel technique that reduces the size of each false hit. Moreover, we generalize our solution for a distributed setting, where the database is horizontally partitioned over several servers. Finally, we demonstrate the effectiveness of the proposed solutions with real data sets of various dimensionalities.
Query authentication, multistep nearest neighbors, similarity search.
Y. Yang, S. Papadopoulos, D. Papadias, L. Wang and P. Karras, "Authenticated Multistep Nearest Neighbor Search," in IEEE Transactions on Knowledge & Data Engineering, vol. 23, no. , pp. 641-654, 2010.