|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Davide Martinenghi, Marco Tagliasacchi, "Cost-Aware Rank Join with Random and Sorted Access," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 12, pp. 2143-2155, Dec., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TKDE.2011.161, author = {Davide Martinenghi and Marco Tagliasacchi}, title = {Cost-Aware Rank Join with Random and Sorted Access}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {24}, number = {12}, issn = {1041-4347}, year = {2012}, pages = {2143-2155}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2011.161}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Knowledge and Data Engineering TI - Cost-Aware Rank Join with Random and Sorted Access IS - 12 SN - 1041-4347 SP2143 EP2155 EPD - 2143-2155 A1 - Davide Martinenghi, A1 - Marco Tagliasacchi, PY - 2012 KW - Aggregates KW - Upper bound KW - Nickel KW - Context KW - Optimization KW - Search engines KW - Relational databases KW - random access KW - Top-k KW - rank join KW - sorted access VL - 24 JA - IEEE Transactions on Knowledge and Data Engineering ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2011.161
Web Extra: View Supplemental Material(PDF)
In this paper, we address the problem of joining ranked results produced by two or more services on the web. We consider services endowed with two kinds of access that are often available: 1) sorted access, which returns tuples sorted by score; 2) random access, which returns tuples matching a given join attribute value. Rank join operators combine objects of two or more relations and output the k combinations with the highest aggregate score. While the past literature has studied suitable bounding schemes for this setting, in this paper we focus on the definition of a pulling strategy, which determines the order of invocation of the joined services. We propose the Cost-Aware with Random and Sorted access (CARS) pulling strategy, which is derived at compile-time and is oblivious of the query-dependent score distributions. We cast CARS as the solution of an optimization problem based on a small set of parameters characterizing the joined services. We validate the proposed strategy with experiments on both real and synthetic data sets. We show that CARS outperforms prior proposals and that its overall access cost is always within a very short margin from that of an oracle-based optimal strategy. In addition, CARS is shown to be robust w.r.t. the uncertainty that may characterize the estimated parameters.
Index Terms:
Aggregates,Upper bound,Nickel,Context,Optimization,Search engines,Relational databases,random access,Top-k,rank join,sorted access
Citation:
Davide Martinenghi, Marco Tagliasacchi, "Cost-Aware Rank Join with Random and Sorted Access," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 12, pp. 2143-2155, Dec. 2012, doi:10.1109/TKDE.2011.161
Usage of this product signifies your acceptance of the Terms of Use.

