This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Control Versus Data Flow in Parallel Database Machines
November 1993 (vol. 4 no. 11)
pp. 1265-1279

The execution of a query in a parallel database machine can be controlled in either acontrol flow way, or in a data flow way. In the former case a single system node controlsthe entire query execution. In the latter case the processes that execute the query,although possibly running on different nodes of the system, trigger each other. Lately,many database research projects focus on data flow control since it should enhanceresponse times and throughput. The authors study control versus data flow with regardto controlling the execution of database queries. An analytical model is used to comparecontrol and data flow in order to gain insights into the question which mechanism isbetter under which circumstances. Also, some systems using data flow techniques aredescribed, and the authors investigate to which degree they are really data flow. Theresults show that for particular types of queries data flow is very attractive, since it reduces the number of control messages and balances these messages over the nodes.

[1] D. J. Dewitt and J. Gray, "Parallel database systems: The future of database processing or a passing fad?"SIGMOD Rec., vol. 19, pp. 104-112, Dec. 1990.
[2] M. Stonebraker, "The case for shared nothing,"IEEE Data Eng., vol. 9, no. 1, pp. 4-9, 1986.
[3] K. Salem and H. Garcia-Molina, "Disk striping," inProc. IEEE Data Eng., pp. 336-342, Feb. 1986.
[4] D. Dewitt, R. H. Gerber, G. Graefe, M. L. Heytens, K. B. Kumar, and M. Muralikrishna, "GAMMA--A high performance dataflow database machine," inProc. 12th Int. Conf. VLDB, Kyoto, Japan, Aug. 1986, pp. 228-237.
[5] H. Boral, W. Alexander, L. Clay, G. Copeland, S. Danforth, M. Franklin, B. Hart, M. Smith, and P. Valduriez, "Prototyping Bubba, a highly parallel database system,"IEEE Trans. Knowledge Data Eng., vol. 2, no. 1, pp. 4-24, 1990.
[6] D. J. DeWitt, S. Ghandeharizadeh, D. A. Schneider, A. Bricker, H. I. Hsiao, and R. Rasmussen, "The Gamma database machine project,"IEEE Trans. Knowledge Data Eng., vol. 2, no. 1, pp. 44-62, 1990.
[7] P. M. G. Apers, C. A. van den Berg, J. Flokstra, P. W. P. J. Grefen, M. L. Kersten, and A. N. Wilschut, "PRISMA/DB: A parallel, mainmemory relational DBMS,"IEEE Trans. Knowledge Data Eng., vol. 4, no. 6, Dec. 1992.
[8] D. Schneider and D. Dewitt, "A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment," inProc. ACM SIGMOD Conf.(Portland, OR), May-June 1989, p. 110.
[9] P. C. Treleaven, D. R. Brownbridge, and R. P. Hopkins, "Data-driven and demand-driven computer architecture,"ACM Comput. Surveys, vol. 14, pp. 93-143, Mar. 1982.
[10] A. N. Wilschut and P. M. G. Apers, "Dataflow query execution in a parallel main memory environment," inProc. First Int. Conf. on Parallel and Distributed Information Syst., Miami Beach, FL, Dec. 1991.
[11] D. Bitton, D. J. DeWitt, and C. Turbyfill, "Benchmarking database systems--A systematic approach," inProc. 1983 Very Large Data-base conf., Oct. 1983.
[12] A. S. Tanenbaum, R. van Renesse, H. van Staveren, G. J. Sharp, S. J. Mullender, J. Jansen, and G. van Rossum, "Experiences with the Amoeba distributed operating system," Vrije Universiteit, Amsterdam, The Netherlands, IR-194, July 1989.
[13] G. Copeland, W. Alexander, E. Boughter, and T. Keller, "Data placement in bubba," inProc. ACM SIGMOD, Chicago, IL, June 1-3, 1988, pp. 99-109.
[14] W. Alexander and G. Copeland, "Process and dataflow control in distributed data-intensive systems," inProc. ACM SIGMOD Conf., Chicago, IL, June 1988.
[15] P. America, Ed.,Parallel Database Systems. Proceedings PRISMA Workshop, Noordwijk, The Netherlands, Sept. 24-26, 1990, Lecture Notes in Computer Science, vol 503. Berlin, Germany: Springer-Verlag, 1990.

Index Terms:
Index Termsparallel database machines; data flow; control flow; query execution; database queries;control messages; database system performance; distributed databases; local areanetworks; message management; parallel query execution; distributed databases; parallelmachines; query processing
Citation:
W.B. Teeuw, H.M. Blanken, "Control Versus Data Flow in Parallel Database Machines," IEEE Transactions on Parallel and Distributed Systems, vol. 4, no. 11, pp. 1265-1279, Nov. 1993, doi:10.1109/71.250104
Usage of this product signifies your acceptance of the Terms of Use.