Issue No.10 - October (2004 vol.16)
Christian A. Lang , IEEE
Yuan-Chi Chang , IEEE
John R. Smith , IEEE
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2004.60
Assume a database storing N objects with d numerical attributes or feature values. All objects in the database can be assigned an overall score that is derived from their single feature values (and the feature values of a user-defined query). The problem considered here is then to efficiently retrieve the k objects with minimum (or maximum) overall score. The well-known threshold algorithm (TA) was proposed as a solution to this problem. TA views the database as a set of d sorted lists storing the feature values. Even though TA is optimal with regard to the number of accesses, its overall access cost can be high since, in practice, some list accesses may be more expensive than others. We therefore propose to make TA access cost aware by choosing the next list to access such that the overall cost is minimized. Our experimental results show that this overall cost is close to the optimal cost and significantly lower than the cost of prior approaches.
Multifeature query, threshold algorithm, adaptive, cost-awareness.
Christian A. Lang, Yuan-Chi Chang, John R. Smith, "Making the Threshold Algorithm Access Cost Aware", IEEE Transactions on Knowledge & Data Engineering, vol.16, no. 10, pp. 1297-1301, October 2004, doi:10.1109/TKDE.2004.60