loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
12th International Conference on Parallel and Distributed Systems - Volume 1 (ICPADS'06)
The Impact of Information Availability and Workload Characteristics on the Performance of Job Co-allocation in Multi-clusters
Minneapolis, Minnesota
July 12-July 15
ISBN: 0-7695-2612-8
William M. Jones, Clemson University, USA
Walter B. Ligon III, Clemson University, USA
Nishant Shrivastava, Clemson University, USA
In this paper, we utilize a bandwidth-centric job communication model that captures the interaction and impact of simultaneously co-allocating jobs across multiple clusters. We make use of a parallel job model that seeks to capture both local and global communication access patterns. By doing so, we are able to explore scheduling strategies that attempt to improve average job turnaround time by selectively mapping jobs across cluster boundaries in a process known as job co-allocation.

In this research, we focus on scheduling strategies that make use of available information such as network link utilization, per-processor bandwidths, and job communication topology in order to make intelligent decisions regarding application partition sizes and job placement. We provide results that help to establish the relationship between the quantity of information available a priori to the scheduler and its ability to improve overall system performance. Additionally, we demonstrate the dramatic impact that salient workload characteristics can have on the effectiveness of co-allocation.

Index Terms:
parallel job scheduling, multiple computational clusters, workload effects, network contention, multisite scheduling
Citation:
William M. Jones, Walter B. Ligon III, Nishant Shrivastava, "The Impact of Information Availability and Workload Characteristics on the Performance of Job Co-allocation in Multi-clusters," icpads, vol. 1, pp.123-134, 12th International Conference on Parallel and Distributed Systems - Volume 1 (ICPADS'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.