
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Myra Spiliopoulou, Michael Hatzopoulos, Yannis Cotronis, "Parallel Optimization of Large Join Queries with Set Operators and Aggregates in a Parallel Environment Supporting Pipeline," IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 3, pp. 429445, June, 1996.  
BibTex  x  
@article{ 10.1109/69.506710, author = {Myra Spiliopoulou and Michael Hatzopoulos and Yannis Cotronis}, title = {Parallel Optimization of Large Join Queries with Set Operators and Aggregates in a Parallel Environment Supporting Pipeline}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {8}, number = {3}, issn = {10414347}, year = {1996}, pages = {429445}, doi = {http://doi.ieeecomputersociety.org/10.1109/69.506710}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Knowledge and Data Engineering TI  Parallel Optimization of Large Join Queries with Set Operators and Aggregates in a Parallel Environment Supporting Pipeline IS  3 SN  10414347 SP429 EP445 EPD  429445 A1  Myra Spiliopoulou, A1  Michael Hatzopoulos, A1  Yannis Cotronis, PY  1996 KW  Parallel query optimization KW  parallelism in optimization KW  iterative improvement KW  large join queries KW  bushy parallelism KW  pipeline KW  shareddisk architectures KW  query optimization KW  parallelism KW  databases. VL  8 JA  IEEE Transactions on Knowledge and Data Engineering ER   
Abstract—We propose a parallel optimizer for queries containing a large number of joins, as well as set operators and aggregate functions. The platform of execution is a shareddisk multiprocessor machine supporting bushy parallelism and pipeline. Our model partitions the query into almost independent subtrees that can be optimized simultaneously and applies an enhanced variation of the iterative improvement technique on those of the subtrees, which contain a large number of joins. This technique is parallelized, too. In order to estimate the cost of the states constructed during optimization of join subtrees, cost formulae are developed that estimate the cost of relational algebra operators when executed across coalescing pipes.
[1] M.S. Chen,P.S. Yu,, and K.L. Wu,“Scheduling and processor allocation for parallel execution of multijoin queries,” Proc. Eighth Int’l Conf. Data Engineering, pp. 5867, Feb. 1992.
[2] U. Dayal, “Of Nests and Trees: A Unified Approach to Processing Queries that Contain Nested Queries, Aggregates, and Quantifiers,” Proc. Very Large Databases, 1987.
[3] C. GalindoLegaria, A. Pellenkoft, and M. Kersten, "Fast, Randomized JoinOrder Selection—Why Use Transformations?" Proc. Int'l Conf. Very Large Databases, pp. 8595,Santiago, Chile, 1994.
[4] S. Ganguly, W. Hasan, and R. Krishnamurthy,“Query optimization for parallel execution,”inProc. ACM SIGMOD, June 1992, pp. 9–18.
[5] G. Graefe, "Query Evaluation Techniques for Large Databases," ACM Computing Surveys, vol. 25, no. 2, pp. 73170, June 1993.
[6] W. Hasan and R. Motwani, "Optimization Algorithms for Exploiting the ParallelismCommunication Tradeoff in Pipelined Parallelism" Proc. Int'l Conf. Very Large Databases, pp. 3647,Santiago, Chile, 1994.
[7] W. Hong,“Exploiting interoperator parallelism in XPRS,”inProc. ACM SIGMOD, San Diego, CA, June 1992, pp. 19–28.
[8] T. Hu, A.B. Kahng, and C.W.A. Tsao, "Old Bachelor Acceptance: A New Class of NonMonotone Threshold Accepting Methods" technical report, UCLA Dept. of Computer Science, Los Angeles, and UC San Diego Computer Science and Engineering Dept., La Jolla, Calif., 1995.
[9] Y.E. Ioannidis and Y.C. Kang,“Randomized algorithms for optimizing large join queries,” Proc. ACMSIGMOD Conf., vol. 19, pp. 312321, 1990.
[10] Y.E. Ioannidis and Y.C. Kang,“Leftdeep vs. bushy trees: An analysis of strategy spaces and its implication for query optimization,” Proc. ACMSIGMOD Conf., vol. 20, pp. 168177, 1991.
[11] Y. Ioannidis, R.T. Ng, K. Shim, and T.K. Sellis, "Parametric Query Optimisation" Proc. Int'l Conf. Very Large Databases, pp. 103114,Vancouver, Canada, 1992.
[12] Y.E. Ioannidis and E. Wong,“Query optimization by simulated annealing,” Proc. ACMSIGMOD Conf., pp. 922, 1987.
[13] M. Jarke and J. Koch, “Query Optimization in Database Systems,” ACM Computer Surveys, vol. 16, pp. 111–152, 1984.
[14] W. Kim, “On Optimizing an SQLlike Nested Query,” ACM Trans. Data Systems, Sept. 1982.
[15] R. Krishnamurthy, H. Boral, and C. Zaniolo,“Optimization of nonrecursive queries,”inProc. 12th Int. Conf. Very Large Databases, Kyoto, Japan, Aug. 1986, pp. 128–137.
[16] R.S.G. Lanzelotte,P. Valduriez,, and M. Zaït,“On the effectiveness of optimization search strategies for parallel execution spaces,” Proc. 19th Int’l Conf. Very Large Databases, pp. 493504,Dublin, 1993.
[17] E. Lin, E. Omiecinski, and S. Yalamanchili, "Large Join Optimization on a Hypercube Multiprocessor" IEEE Trans. Knowledge and Data Eng., vol. 6, no. 2, pp. 304315, 1994.
[18] H. Lu, M.C. Shan, and K.L. Tan,“Optimization of multiway join queries for parallel execution,”inProc. 17th Int. Conf. Very Large Databases, Barcelona, Spain, Sept. 1991, pp. 549–560.
[19] T. Morzy, M. Matysiak, and S. Salza, "Tabu Search Optimization of Large Join Queries" Proc. EDBT '94 Int'l Conf., pp. 309322,Cambridge, U.K., SpringerVerlag, 1994.
[20] C.H. Papadimitriu and K. Steiglitz, Combinatorial Optimization: Algorithms and Complexity. Prentice Hall, 1987.
[21] G. Patil, M. Boswell, S. Joshi, and M. Ratnaparkhi, "Discrete Models" Dictionary and Classified Bibliography of Statistical Distributions in Scientific Work, vol. 1. Maryland: International Cooperative Publications House, 1984.
[22] D.A. Schneider, "Complex Query Processing in Multiprocessor Database Machines" Technical Report TR965, Univ. of Wisconsin, Madison, 1990.
[23] P. Selinger,D. Astrahan,D. Chamberlin,R. Lorie,, and T. Price,“Access path selection in a relational database management system,” Proc. 1979 ACMSIGMOD Int’l Conf. Management of Data, pp. 2334,Boston, May 1979.
[24] L. Shapiro, "Join Processing in Database Systems with Large Main Memories," ACM Trans. Database Systems, vol. 11, no. 3, Sept. 1986.
[25] E. Shekita, H. C. Young, and K. Tan,“Multijoin optimization for symmetric multiprocessors,”inProc. 19th Int. Conf. Very Large Databases, Aug. 1993, pp. 479–492.
[26] M. Spiliopoulou, "Parallel Optimization and Execution of Queries towards an RDBMS in a Parallel Environment Supporting Pipeline" (in Greek), PhD thesis, Dept. of Informatics, Univ. of Athens, Athens, Greece, 1992.
[27] M. Spiliopoulou, Y. Cotronis, and M. Hatzopoulos, "Parallel Optimisation of Join Queries Using an Enhanced Iterative Improvement Technique" Proc. 1993 PARLE Conf., Poster Session, pp. 716719,Munich, Germany, 1993.
[28] M. Spiliopoulou and J.C. Freytag, "Modelling Resource Utilization in Pipelined Query Execution" Proc. EuroPar Conf.,Lyon, France, to appear in 1996.
[29] M. Spiliopoulou and M. Hatzopoulos, "Translation of SQL Queries into a Graph Structure: Query Transformations and Preoptimisation Issues in a Pipeline Multiprocessor Environment" Information Systems, vol. 17, no. 2, pp. 161170, 1992.
[30] M. Spiliopoulou, M. Hatzopoulos, and C. Vassilakis, "Using Parallelism and Pipeline for the Optimisation of Join Queries" Proc. 1992 PARLE Conf., pp. 279294,Paris, 1992.
[31] M. Spiliopoulou, M. Hatzopoulos, and C. Vassilakis, "Parallel Optimization of Join Queries Using a Technique of Exhaustive Nature" Computers and Artificial Intelligence, vol. 12, no. 2, pp. 145166, 1993.
[32] M. Spiliopoulou, M. Hatzopoulos, and C. Vassilakis, "A Cost Model for the Estimation of Query Execution Time in a Parallel Environment Supporting Pipeline" Computers and Artificial Intelligence, to appear in 1996.
[33] M. Steinbrunn, G. Moerkotte, and A. Kemper, "Optimizing Join Orders" Technical Report MIP9307, Faculty of Mathematic, Univ. of Passau, Passau, Germany, 1993.
[34] A. Swami,“Optimization of large join queries: Combining heuristics with combinatorial techniques,”inProc. ACM SIGMOD, Chicago, IL, June 1989, pp. 367–376.
[35] A. Swami and A. Gupta,“Optimization of large join queries,” Proc. ACMSIGMOD Conf., pp. 817, 1988.
[36] M. Ziane, M. Zaït, and P. BorlaSalamet, "Parallel Query Processing with Zigzag Trees" The VLDB J., vol. 2, no. 3, pp. 277301, 1993.