loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06)
Cost-Aware Processing of Similarity Queries in Structured Overlays
Cambridge, United Kingdom
September 06-June 08
ISBN: 0-7695-2679-9
Marcel Karnstedt, Technische Universit?at Ilmenau, Germany
Kai-Uwe Sattler, Technische Universit?at Ilmenau, Germany
Manfred Hauswirth, Ecole Polytechnique Federale de Lausanne (EPFL), Switzerland
Roman Schmidt, Ecole Polytechnique Federale de Lausanne (EPFL), Switzerland
Large-scale distributed data management with P2P systems requires the existence of similarity operators for queries as we cannot assume that all users will agree on exactly the same schema and value representations and data quality problems due to spelling errors and typos. In this paper, we present an approach for efficient processing of similarity selections and joins in a structured overlay. We show that there are several possible strategies exploiting DHT features to a different extent (i.e., key organization, routing, multicasting) and thus the choice of the best operator implementation in a given situation (selectivity, data distribution, load) should be based on cost information allowing the system to estimate the computation and communication costs of query execution plans. Hence, we present a cost model for similarity operations on structured data in a DHT and demonstrate the efficiency of our proposal by experimental results from a large-scale PlanetLab deployment.
Citation:
Marcel Karnstedt, Kai-Uwe Sattler, Manfred Hauswirth, Roman Schmidt, "Cost-Aware Processing of Similarity Queries in Structured Overlays," p2p, pp.81-89, Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.