Issue No. 02 - March/April (1999 vol. 11)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/69.761667
<p><b>Abstract</b>—Traditional sampling-based estimators infer the actual selectivity of a query based purely on runtime information gathering, excluding the previously collected information, which underutilizes the information available. Table-based and parametric estimators extrapolate the actual selectivity of a query based only on the previously collected information, ignoring on-line information, which results in inaccurate estimation in a frequently updated environment. We propose a novel hybrid estimator that utilizes and optimally combines the on-line and previously collected information. Theoretical analysis demonstrates that the on-line and previously collected information is complementary and that the comprehensive utilization of the on-line and previously collected information is of value for further performance improvement. Our theoretical results are validated by a comprehensive experimental study using a practical database, in the presence of insert, delete, and update operations. The hybrid approach is very promising in the sense that it provides the adaptive mechanism that allows the optimal combination of information obtained from different sources in order to achieve a higher estimation accuracy and reliability.</p>
Hybrid estimator, sampling estimator, parametric estimator, table-based estimator, query optimization, estimation accuracy, estimation reliability.
W. Sun, X. Xiang, Y. Ling and N. D. Rishe, "A Hybrid Estimator for Selectivity Estimation," in IEEE Transactions on Knowledge & Data Engineering, vol. 11, no. , pp. 338-354, 1999.