This Article 
 Bibliographic References 
 Add to: 
Resource-Aware Distributed Scheduling Strategies for Large-Scale Computational Cluster/Grid Systems
October 2007 (vol. 18 no. 10)
pp. 1450-1461
In this paper, we propose distributed algorithms, referred to as Resource Aware Dynamic Incremental Scheduling (RADIS) strategies. Our strategies are specifically designed to handle large volumes of computationally intensive arbitrarily divisible loads submitted for processing at Cluster / Grid systems involving multiple sources and sinks (processing nodes). We consider a reallife scenario wherein buffer space (memory) available at the sinks (required for holding and processing the loads) vary over time and the loads have deadlines, and propose efficient "pull-based" scheduling strategies with admission control policy that ensures that the admitted loads are processed satisfying their deadline requirements. The design of our proposed strategies adopts the divisible load paradigm, referred to as divisible load theory (DLT), which is shown to be efficient in handling large volume loads. We demonstrate detailed workings of the proposed algorithms via a simulation study using real-life parameters obtained from a major physics experiment.

[1] H.M. Wong, V. Bharadwaj, and B. Gerassimos, “Design and Performance Evaluation of Load Distribution Strategies for Multiple Divisible Loads on Heterogeneous Linear Daisy Chain Networks,” J. Parallel and Distributed Computing, vol. 65, no. 12, pp.1558-1577, Dec. 2005.
[2] D. Ghose, H.J. Kim, and T.H. Kim, “Adaptive Divisible Load Scheduling Strategies for Workstation Clusters with Unknown Network Resources,” IEEE Trans. Parallel and Distributed Systems, vol. 16, no. 10, pp. 897-907, Oct. 2005.
[3] L. Ping, B. Veeravalli, and A.A. Kassim, “Design and Implementation of Parallel Video Encoding Strategies Using Divisible Load Analysis,” IEEE Trans. Circuits and Systems for Video Technology, vol. 15, no. 9, pp. 1098-1112, Sept. 2005.
[4] L. Marchal, Y. Yang, H. Casanova, and Y. Robert, “A Realistic Network/Application Model for Scheduling Divisible Loads on Large-Scale Platforms,” Proc. 19th Int'l Parallel and Distributed Processing Symp. (IPDPS '05), p. 48b, Apr. 2005.
[5] O. Beaumont, H. Casanova, A. Legrand, Y. Robert, and Y. Yang, “Scheduling Divisible Loads on Star and Tree Networks: Results and Open Problems,” IEEE Trans. Parallel and Distributed Systems, vol. 16, no. 3, pp. 207-218, Mar. 2005.
[6] B. Veeravalli, “Design and Performance Analysis of Heuristic Load Balancing Strategies for Processing Divisible Loads on Ethernet Clusters,” Int'l J. Computers and Applications, vol. 27, no. 2, pp. 97-107, 2005.
[7] M.A. Moges, D. Yu, and T.G. Robertazzi, “Grid Scheduling Divisible Loads from Multiple Sources via Linear Programming,” Proc. 17th Int'l Conf. Parallel and Distributed Computing and Systems (PDCS '04), pp. 423-428, Nov. 2004.
[8] M. Wu and X.H. Sun, “Memory-Conscious Task Partition and Scheduling in Grid Environments,” Proc. Fifth IEEE/ACM Int'l Workshop Grid Computing (Grid '04), pp. 138-145, Nov. 2004.
[9] S. Kim and J.B. Weissman, “A Genetic Algorithm-Based Approach for Scheduling Decomposable Data Grid Applications,” Proc. 33rd Int'l Conf. Parallel Processing (ICPP '04), vol. 1, pp. 406-413, Aug. 2004.
[10] H.M. Wong, D. Yu, B. Veeravalli, and T.G. Robertazzi, “Data-Intensive Grid Scheduling: Multiple Sources with Capacity Constraints,” Proc. 16th Int'l Conf. Parallel and Distributed Computing and Systems (PDCS '03), pp. 7-11, Nov. 2003.
[11] A.E. Darling, L. Carey, and W. Feng, “The Design, Implementation and Evaluation of mpiBLAST,” Proc. Fourth LCI Int'l Conf. Linux Clusters: The HPC Revolution 2003, June 2003.
[12] V. Bharadwaj, D. Ghose, and T.G. Robertazzi, “Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems,” Cluster Computing on Divisible Load Scheduling, vol. 6, no. 1, pp. 7-18, Jan. 2003.
[13] H.J. Kim, “A Novel Optimal Load Distribution Algorithm for Divisible Loads,” Cluster Computing on Divisible Load Scheduling, vol. 6, no. 1, pp. 41-46, Jan. 2003.
[14] B. Veeravalli and S. Ranganath, “Theoretical and Experimental Study on Large-Size Image Processing Applications Using Divisible Load Paradigm on Distributed Bus Networks,” Image and Vision Computing, vol. 20, nos. 13-14, pp. 917-936, Dec. 2002.
[15] S.K. Chan, V. Bharadwaj, and D. Ghose, “Large Matrix-Vector Products on Distributed Bus Networks with Communication Delays Using the Divisible Load Paradigm: Performance Analysis and Simulation,” Math. and Computers in Simulation, vol. 58, pp. 71-79, 2001.
[16] X. Li, V. Bharadwaj, and C.C. Ko, “Divisible Load Scheduling on Single-Level Tree Networks with Buffer Constraints,” IEEE Trans. Aerospace and Electronic Systems, vol. 36, no. 4, pp. 1298-1308, Oct. 2000.
[17] M. Drozdowski and P. Wolniewicz, “Experiments with Scheduling Divisible Tasks in Clusters of Workstations,” Proc. Sixth European Conf. Parallel Computing (Euro-Par '00), pp. 311-319, 2000.
[18] J. Blazewicz, K. Ecker, B. Plateau, and D. Trystram, Handbook on Parallel and Distributed Processing. Springer, 2000.
[19] I. Foster and C. Kesselman, The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, 1999.
[20] H. Attiya and J. Welch, Distributed Computing: Fundamentals, Simulations and Advanced Topics. McGraw Hill, 1998.
[21] D.A.L. Piriyakumar and C.S.R. Murthy, “Distributed Computation for a Hypercube Network of Sensor-Driven Processors with Communication Delays Including Setup Time,” IEEE Trans. Systems, Man, and Cybernetics, Part A, vol. 28, no. 2, pp. 245-251, Mar. 1998.
[22] V. Bharadwaj, D. Ghose, V. Mani, and T.G. Robertazzi, Scheduling Divisible Loads in Parallel and Distributed Systems. CS Press, Sept. 1996.
[23] R. Agrawal and H.V. Jagadish, “Partitioning Technologies for Large-Grained Parallelism,” IEEE Trans. Computers, vol. 37, no. 12, pp. 1627-1634, Dec. 1988.
[24] Y.C. Cheng and T.G. Robertazzi, “Distributed Computation with Communication Delays,” IEEE Trans. Aerospace and Electronic Systems, vol. 24, no. 6, pp. 700-712, Nov. 1988.

Index Terms:
Divisible loads, Grid computing, Cluster computing, buffer constraints, processing time, deadlines
Sivakumar Viswanathan, Bharadwaj Veeravalli, Thomas G. Robertazzi, "Resource-Aware Distributed Scheduling Strategies for Large-Scale Computational Cluster/Grid Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 18, no. 10, pp. 1450-1461, Oct. 2007, doi:10.1109/TPDS.2007.1073
Usage of this product signifies your acceptance of the Terms of Use.