This Article 
 Bibliographic References 
 Add to: 
The Query Clustering Problem: A Set Partitioning Approach
December 1995 (vol. 7 no. 6)
pp. 885-899

Abstract—In this research, we address the query clustering problem which involves determining globally optimal execution strategies for a set of queries. The need to process a set of queries together often arises in deductive database systems, scientific database systems, large bibliographic retrieval systems and several other database applications. We address the optimization problem from the perspective of overlaps in data requirements, and model the batched operations using a set-partitioning approach In this model, we first consider the case of m queries each involving a two-way join operation. We develop a recursive methodology to determine all the processing strategies in this case. Next, we establish certain dominance properties among the strategies, and develop exact as well as heuristic algorithms for selecting an appropriate strategy. We extend this analysis to a clustering approach, and outline a framework for optimizing multiway joins. The results show that the proposed approach is viable and efficient, and can easily be incorporated into the query processing component of most database systems.

[1] U.S. Chakravarthy and J. Minker,“Processing multiple queries in database systems,” Database Engineering, vol. 5, no. 3, 1982.
[2] W.R. Dillon and M. Goldstein,Multivariate Analysis—Methods and Applications.New York: John Wiley, 1984.
[3] S. Finkelstein,“Common expression analysis in database applications,” Proc. ACM-SIGMOD Int’l Conf. Management of Data,Orlando, Fla., 1982.
[4] H. Gallaire and J. Minker,Logic and Databases.New York: Plenum Press, 1978.
[5] J. Grant and J. Minker,“Optimization in conventional and relational database systems,” Advances in Database Theory—Vol. 1, H. Gallaire, J. Minker, and J.M. Nichols, eds. New York: Plenum Press, 1981.
[6] M. Jarke,“Common subexpression isolation in multiple query optimization,” Query Processing in Database Systems, W. Kim, D. Reiner, and D. Batory, eds. New York: Springer-Verlag, 1984.
[7] W. Kim,“Global optimization of relational queries—A first step,” Query Processing in database Systems, W. Kim, D. Reiner, and D. Batory, eds. New York: Springer-Verlag, 1984.
[8] D.E. Knuth, The Art of Computer Programming. Addison-Wesley, 1973.
[9] R. Krishnamurthy, H. Boral, and C. Zaniolo,“Optimization of nonrecursive queries,”inProc. 12th Int. Conf. Very Large Databases, Kyoto, Japan, Aug. 1986, pp. 128–137.
[10] P. Larson and H. Yang,“Computing queries from derived relations,” Proc. Int’l Conf. Very Large Databases,Stockholm, Sweden, 1985.
[11] J. Park and A. Segev,“Using common subexpressions to optimize multiple queries,” Proc. Fourth Int’l Conf. Data Engineering,Los Angeles, 1988.
[12] A. Rosenthal and S. Chakravarthy,“Anatomy of a modular multiple query optimizer,” Proc. Int’l Conf. Very Large Data Bases, pp. 230-239,Los Angeles, Sept. 1988.
[13] N. Roussopoulos, "The Incremental Access Method of View Cache: Concept, Algorithms, and Cost Analysis," ACM Trans. Database Systems, vol. 16, no. 3, pp. 535-563, Sept. 1991.
[14] K. Satoh,M. Tsuchida,F. Nakamura,, and K. Oomachi,“Local and global query optimization mechanisms for relational databases,” Proc. Int’l Conf. Very Large Databases,Stockholm, Sweden, 1985.
[15] P. Selinger,D. Astrahan,D. Chamberlin,R. Lorie,, and T. Price,“Access path selection in a relational database management system,” Proc. 1979 ACM-SIGMOD Int’l Conf. Management of Data, pp. 23-34,Boston, May 1979.
[16] T.K. Sellis,“Multiple query optimization,” ACM Trans. Database Systems, vol. 13, pp. 23-52, 1988.

Index Terms:
Batch processing, clustering techniques, combinatorial optimization, join operator, query clustering relational databases, set partitions.
Ram D. Gopal, R. Ramesh, "The Query Clustering Problem: A Set Partitioning Approach," IEEE Transactions on Knowledge and Data Engineering, vol. 7, no. 6, pp. 885-899, Dec. 1995, doi:10.1109/69.476495
Usage of this product signifies your acceptance of the Terms of Use.