Issue No. 04 - July/August (2002 vol. 14)
<p>Foreign functions have been considered in the advanced database systems to support complex applications. In this paper, we consider optimizing queries with foreign functions in a distributed environment. In traditional distributed query processing, selection operations are locally processed before joins as much as possible so that the size of relations being transmitted and joined can be reduced. However, if selection predicates involve foreign functions, the cost of evaluating selections cannot be ignored. As a result, the execution order of selections and joins becomes significant, and the trade-off for reducing the costs of data transmission, join processing, and selection predicate evaluation needs to be carefully considered in query optimization. In this paper, a response time model is developed for estimating the cost of distributed query processing involving foreign functions. We explore the property of the problem and find an optimal algorithm with polynomial complexity for a special case of it. However, finding the optimal execution plan for the general case is NP-hard. We propose an efficient heuristic algorithm for solving the problem and the simulation result shows its good quality. The research result can also be applied to the advanced database systems and the multidatabase systems where the conversion function defined for the need of schema integration can be considered a type of foreign functions.</p>
Distributed environment, foreign function, query optimization, response time model, simulation.
Pauray S.M. Tsai, Arbee L.P. Chen, "Optimizing Queries with Foreign Functions in a Distributed Environment", IEEE Transactions on Knowledge & Data Engineering, vol. 14, no. , pp. 809-824, July/August 2002, doi:10.1109/TKDE.2002.1019215