This Article 
 Bibliographic References 
 Add to: 
On the Design of Adaptive and Decentralized Load Balancing Algorithms with Load Estimation for Computational Grid Environments
December 2007 (vol. 18 no. 12)
pp. 1675-1686
In this paper, we address several issues that are imperative to Grid environments such as, handling resource heterogeneity and sharing, communication latency, job migration from one site to other, and load balancing.We address these issues by proposing two job migration algorithms, which are MELISA (Modified ELISA) and LBA (Load Balancing on Arrival). The algorithms differ in the way load balancing is carried out and is shown to be efficient in minimizing the response time on large and small scale heterogeneous Grid environments, respectively. MELISA, applicable to large scale systems (i.e., interGrid [1]), is a modified version of ELISA [2] in which we consider job migration cost, resource heterogeneity and network heterogeneity when load balancing is considered. LBA algorithm, applicable for small scale systems (i.e., intraGrid [1]), performs load balancing by estimating expected finish time of job on buddy processors on each job arrival. Both algorithms estimate system parameters such as job arrival rate, CPU processing rate, load at processor and balance the load by migrating jobs to buddy processors by taking into account job transfer cost, resource heterogeneity and network heterogeneity. We quantify the performance of our algorithms using several influencing parameters such as, job size, data transfer rate, status exchange period, migration limit, and we discuss the implications of the performance and choice of our approaches.

[1] I. Foster and C. Kesselman, The Grid: Blueprint for a Future Computing Infrastructure. Morgan Kaufmann, 1999.
[2] L. Anand, D. Ghose, and V. Mani, “ELISA: An Estimated Load Information Scheduling Algorithm for Distributed Computing Systems,” Int'l J. Computers and Math. with Applications, vol. 37, no. 8, pp. 57-85, Apr. 1999.
[3] I. Foster, C. Kesselman, and S. Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations,” Int'l J. High Performance Computing Applications, vol. 15, no. 3, pp. 200-222, 2001.
[4] L. Smarr and C.E. Catlett, “Metacomputing,” Comm. ACM, vol. 35, no. 6, pp. 44-52, June 1992.
[5] Y. Feng, D. Li, H. Wu, and Y. Zhang, “A Dynamic Load Balancing Algorithm Based on Distributed Database System,” Proc. Fourth Int'l Conf. High-Performance Computing in the Asia-Pacific Region, pp. 949-952, May 2000.
[6] M. Willebeek-LeMair and A. Reeves, “Strategies for Dynamic Load Balancing on Highly Parallel Computers,” IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 4, pp. 979-993, Sept. 1993.
[7] N. Shivaratri, P. Krueger, and M. Singhal, “Load Distributing for Locally Distributed Systems,” Computer, vol. 25, no. 12, pp. 33-44, Dec. 1992.
[8] H. Lin and C. Raghavendra, “A Dynamic Load-Balancing Policy with a Central Job Dispatcher (LBC),” IEEE Trans. Software Eng., vol. 18, no. 2, pp. 148-158, Feb. 1992.
[9] J. Watts and S. Taylor, “A Practical Approach to Dynamic Load Balancing,” IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 3, pp. 235-248, Mar. 1998.
[10] G. Manimaran and C. Siva Ram Murthy, “An Efficient Dynamic Scheduling Algorithm for Multiprocessor Real-Time Systems,” IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 3, pp. 312-319, Mar. 1998.
[11] M.J. Zaki and W.L.S. Parthasarathy, “Customized Dynamic Load Balancing for a Network of Workstations,” J. Parallel and Distributed Computing, vol. 43, no. 2, pp. 156-162, June 1997.
[12] J. Krallmann, U. Schwiegelshohn, and R. Yahyapour, “On the Design and Evaluation of Job Scheduling Algorithms,” Proc. Fifth Workshop Job Scheduling Strategies for Parallel Processing, pp. 17-42, 1999.
[13] D.G. Feitelson, L. Rudolph, U. Schwiegelshohn, K.C. Sevcik, and P. Wong, “Theory and Practice in Parallel Job Scheduling,” Proc. Third Workshop Job Scheduling Strategies for Parallel Processing, pp. 1-34, 1997.
[14] R. Shah, B. Veeravalli, and M. Misra, “Estimation Based Load Balancing Algorithm for Data-Intensive Heterogeneous Grid Environments,” Proc. 13th Int'l Conf. High Performance Computing (HiPC '06), pp. 72-83, 2006.
[15] Y. Murata, H. Takizawa, T. Inaba, and H. Kobayashi, “A Distributed and Cooperative Load Balancing Mechanism for Large-Scale P2P Systems,” Proc. Int'l Symp. Applications and Internet (SAINT '06) Workshops, pp. 126-129, Jan. 2006.
[16] L. Oliker, R. Biswas, H. Shan, and W. Smith, “Job Scheduling in Heterogeneous Grid Environment,” Technical Report LBNL-54906, Lawrence Berkeley Nat'l Laboratory, 2004.
[17] Z. Zeng and B. Veeravalli, “Design and Analysis of a Non-Preemptive Decentralized Load Balancing Algorithm for Multi-Class Jobs in Distributed Networks,” Computer Comm., vol. 27, pp.679-693, 2004.
[18] H. Shan, L. Oliker, and R. Biswas, “Job Superscheduler Architecture and Performance in Computational Grid Environments,” Proc. ACM/IEEE Conf. Supercomputing, Nov. 2003.
[19] Y. Wong, K. Leung, and K. Lee, “A Stochastic Load Balancing Algorithm for I-Computing,” Concurrency and Computation: Practice and Experience, vol. 15, no. 1, pp. 55-787, Jan. 2003.
[20] V. Subramani, R. Kettimuthu, S. Srinivasan, and P. Sadayappan, “Distributed Job Scheduling on Computational Grid Using Multiple Simultaneous Requests,” Proc. 11th IEEE Symp. High Performance Distributed Computing (HPDC '02), July 2002.
[21] M. Arora, S.K. Das, and R. Biswas, “A De-Centralized Scheduling and Load Balancing Algorithm for Heterogeneous Grid Environments,” Proc. Int'l Conf. Parallel Processing Workshops (ICPPW '02), pp. 499-505, 2002.
[22] H.A. James and K.A. Hawick, “Scheduling Independent Tasks on Metacomputing Systems,” Proc. 12th Int'l Conf. Parallel and Distributed Computing Systems (PDCS '99), Mar. 1999.
[23] R. Martin, A. Vahdat, D. Culler, and T. Anderson, “Effects of Communication Latency, Overhead, and Bandwidth in a Cluster Architecture,” Proc. 24th Ann. Int'l Symp. Computer Architecture (ISCA '97), pp. 85-97, June 1997.
[24] M. Mitzenmacher, “How Useful Is Old Information,” IEEE Trans. Parallel and Distributed Systems, vol. 11, no. 1, pp. 6-20, Jan. 2000.

Index Terms:
Grid systems, load balancing, average response time, communication delays, migration
Ruchir Shah, Bhardwaj Veeravalli, Manoj Misra, "On the Design of Adaptive and Decentralized Load Balancing Algorithms with Load Estimation for Computational Grid Environments," IEEE Transactions on Parallel and Distributed Systems, vol. 18, no. 12, pp. 1675-1686, Dec. 2007, doi:10.1109/TPDS.2007.1115
Usage of this product signifies your acceptance of the Terms of Use.