This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
On the Benefit of Processor Coallocation in Multicluster Grid Systems
June 2010 (vol. 21 no. 6)
pp. 778-789
Omer Ozan Sonmez, Technical University of Delft, Delft
Hashim Mohamed, Technical University of Delft, Delft
Dick H.J. Epema, Technical University of Delft, Delft
In multicluster grid systems, parallel applications may benefit from processor coallocation, that is, the simultaneous allocation of processors in multiple clusters. Although coallocation allows the allocation of more processors than available in a single cluster, it may severely increase the execution time of applications due to the relatively slow wide-area communication. The aim of this paper is to investigate the benefit of coallocation in multicluster grid systems, despite this drawback. To this end, we have conducted experiments in a real multicluster grid environment, as well as in a simulated environment, and we evaluate the performance of coallocation for various applications that range from computation-intensive to communication-intensive and for various system load settings. In addition, we compare the performance of scheduling policies that are specifically designed for coallocation. We demonstrate that considering latency in the resource selection phase improves the performance of coallocation, especially for communication-intensive parallel applications.

[1] H. Mohamed and D. Epema, "Koala: A Co-Allocating Grid Scheduler," Concurrency and Computation: Practice and Experience, vol. 20, no. 16, pp. 1851-1876, 2008.
[2] H.H. Mohamed and D.H.J. Epema, "An Evaluation of the Close-to-Files Processor and Data Co-Allocation Policy in Multiclusters," Proc. IEEE Int'l Conf. Cluster Computing (CLUSTER '04), pp. 287-298, 2004.
[3] O.O. Sonmez, H.H. Mohamed, and D.H.J. Epema, "Communication-Aware Job Placement Policies for the KOALA Grid Scheduler," Proc. Second IEEE Int'l Conf. e-Science and Grid Computing (E-SCIENCE '06), p. 79, 2006.
[4] "The Distributed ASCI Supercomputer," http://www.cs.vu.nldas3/, 2009.
[5] K. Czajkowski, I. Foster, and C. Kesselman, "Resource Co-Allocation in Computational Grids," Proc. Eighth IEEE Int'l Symp. High Performance Distributed Computing (HPDC '99), p. 37, 1999.
[6] R.V. van Nieuwpoort, J. Maassen, R. Hofman, T. Kielmann, and H.E. Bal, "Ibis: An Efficient Java-Based Grid Programming Environment," Proc. Joint ACM ISCOPE Conf. Java Grande (JGI '02), pp. 18-27, 2002.
[7] E. Gabriel, G.E. Fagg, G. Bosilca, T. Angskun, J.J. Dongarra, J.M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R.H. Castain, D.J. Daniel, R.L. Graham, and T.S. Woodall, "Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation," Proc. 11th European PVM/MPI Users' Group Meeting, pp. 97-104, Sept. 2004.
[8] Sun Grid Computing, http://wwws.sun.com/softwaregrid/, 2009.
[9] The Prime Number Application, http://www.mhpcc.edu/ training/workshop/ mpi/samples/Cmpi_prime.c, 2009.
[10] H.H. Mohamed and D.H.J. Epema, "The Design and Implementation of the KOALA Co-Allocating Grid Scheduler," Proc. European Grid Conf., pp. 640-650, 2005.
[11] G.C. Fox, M.A. Johnson, G.A. Lyzenga, S.W. Otto, J.K. Salmon, and D.W. Walker, Solving Problems on Concurrent Processors. Vol. 1: General Techniques and Regular Problems. Prentice-Hall, Inc., 1988.
[12] A. Iosup and D.H.J. Epema, "Grenchmark: A Framework for Analyzing, Testing, and Comparing Grids," Proc. Sixth IEEE Int'l Symp. Cluster Computing and the Grid (CCGRID '06), pp. 313-320, 2006.
[13] "The Dynamically-Updated Request Online Coallocator (DUROC)," http://www.globus.org/toolkit/docs/2.4duroc /, 2009.
[14] "MPICH-G2," http://www3.niu.edumpi/, 2009.
[15] "Distributed Resource Management Application Api," http://www.drmaa.netw/, 2008.
[16] A. Iosup, O. Sonmez, and D. Epema, "DGSim: Comparing Grid Resource Management Architectures through Trace-Based Simulation," Proc. 14th Int'l Euro-Par Conf. Parallel Processing (Euro-Par '08), pp. 13-25, 2008.
[17] "Grid Ready MPI Library: MC-MPI," http://www.logos.ic.i. u-tokyo.ac.jp/ h_saitomcmpi/, 2008.
[18] "GridMPI," http:/www.gridmpi.org/, 2009.
[19] "Portable Batch System-PRO," http://www.pbspro.com platforms.html, 2009.
[20] "Maui Cluster Scheduler," http://www.clusterresources.com/pages/products maui-cluster-scheduler.php, 2009.
[21] F. Azzedin, M. Maheswaran, and N. Arnason, "A Synchronous Co-Allocation Mechanism for Grid Computing Systems," Cluster Computing, vol. 7, no. 1, pp. 39-49, 2004.
[22] J. Sauer, "Modeling and Solving Multi-Site Scheduling Problems," Planning in Intelligent Systems: Aspects, Motivations and Methods, A.M. Meystel, ed., pp. 281-299, Wiley, 2006.
[23] A.C. Sodan, C. Doshi, L. Barsanti, and D. Taylor, "Gang Scheduling and Adaptive Resource Allocation to Mitigate Advance Reservation Impact," Proc. Int'l Symp. Cluster Computing and the Grid (CCGRID), pp. 649-653, 2006.
[24] J. Li and R. Yahyapour, "Negotiation Model Supporting Co-Allocation for Grid Scheduling," Proc. IEEE/ACM Int'l Conf. Grid Computing, pp. 254-261, 2006.
[25] C. Qu, "A Grid Advance Reservation Framework for Co-Allocation and Co-Reservation across Heterogeneous Local Resource Management Systems," Proc. Int'l Conf. Parallel Processing and Applied Math. (PPAM), pp. 770-779, 2007.
[26] C. Castillo, G.N. Rouskas, and K. Harfoush, "Efficient Resource Management Using Advance Reservations for Heterogeneous Grids," Proc. IEEE Int'l Parallel and Distributed Processing Symp. (IPDPS '08), pp. 1-12, 2008.
[27] A.I.D. Bucur and D.H.J. Epema, "The Maximal Utilization of Processor Co-Allocation in Multicluster Systems," Proc. 17th Int'l Symp. Parallel and Distributed Processing (IPDPS '03), p. 60.1, 2003.
[28] A.I.D. Bucur and D.H.J. Epema, "The Performance of Processor Co-Allocation in Multicluster Systems," Proc. Third Int'l Symp. Cluster Computing and the Grid (CCGRID '03), p. 302, 2003.
[29] A.I.D. Bucur and D.H.J. Epema, "Scheduling Policies for Processor Coallocation in Multicluster Systems," IEEE Trans. Parallel Distributed Systems, vol. 18, no. 7, pp. 958-972, July 2007.
[30] C. Ernemann, V. Hamscher, U. Schwiegelshohn, A. Streit, and R. Yahyapour, "On Advantages of Grid Computing for Parallel Job Scheduling," Proc. Second IEEE/ACM Int'l Symp. Cluster Computing and the Grid (CCGRID '02), pp. 39-46, May 2002.
[31] T. Roblitz and A. Reinefeld, "Co-Reservation with the Concept of Virtual Resources," Proc. Fifth IEEE Int'l Symp. Cluster Computing and the Grid (CCGrid '05), pp. 398-406, 2005.
[32] T. Röblitz, F. Schintke, and A. Reinefeld, "Resource Reservations with Fuzzy Requests: Research Articles," Concurrency and Computation: Practice and Experience, vol. 18, no. 13, pp. 1681-1703, 2006.
[33] W. Jones, L. Pang, W. Ligon, and D. Stanzione, "Bandwidth-Aware Co-Allocating Meta-Schedulers for Mini-Grid Architectures," Proc. IEEE Int'l Conf. Cluster Computing, pp. 45-54, 2004.
[34] H. Dail, F. Berman, and H. Casanova, "A Decoupled Scheduling Approach for Grid Application Development Environments," J. Parallel and Distributed Computing, vol. 63, no. 5, pp. 505-524, 2003.
[35] A. Plaat, H.E. Bal, and R.F.H. Hofman, "Sensitivity of Parallel Applications to Large Differences in Bandwidth and Latency in Two-Layer Interconnects," Future Generation Computer Systems, vol. 17, no. 6, pp. 769-782, 2001.
[36] T. Kielmann, H.E. Bal, S. Gorlatch, K. Verstoep, and R.F. Hofman, "Network Performance-Aware Collective Communication for Clustered Wide-Area Systems," Parallel Computing, vol. 27, no. 11, pp. 1431-1456, 2001.
[37] R.V. van Nieuwpoort, T. Kielmann, and H.E. Bal, "Efficient Load Balancing for Wide-Area Divide-and-Conquer Applications," Proc. Eighth ACM SIGPLAN Symp. Principles and Practices of Parallel Programming (PPoPP '01), pp. 34-43, 2001.
[38] F.J. Seinstra and J.M. Geusebroek, "Color-Based Object Recognition by a Grid-Connected Robot Dog," Proc. Conf. Computer Vision and Pattern Recognition (CVPR), 2006.
[39] F.J. Seinstra, J.-M. Geusebroek, D. Koelma, C.G. Snoek, M. Worring, and A.W. Smeulders, "High-Performance Distributed Video Content Analysis with Parallel-Horus," IEEE MultiMedia, vol. 14, no. 4, pp. 64-75, Oct.-Dec. 2007.

Index Terms:
Coallocation, grid, multicluster, parallel job scheduling.
Citation:
Omer Ozan Sonmez, Hashim Mohamed, Dick H.J. Epema, "On the Benefit of Processor Coallocation in Multicluster Grid Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 21, no. 6, pp. 778-789, June 2010, doi:10.1109/TPDS.2009.121
Usage of this product signifies your acceptance of the Terms of Use.