loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06)
The Failure-rate Aware Scheduling Policies for Large-scale Cluster Systems
Taipei, Taiwan
December 04-December 07
ISBN: 0-7695-2736-1
Linping Wu, Chinese Academy of Sciences, China; Graduate School of the Chinese Academy of Sciences, China
Chao Ren, Chinese Academy of Sciences, China; Graduate School of the Chinese Academy of Sciences, China
Dan Meng, Chinese Academy of Sciences, China
Zhan Jianfeng, Chinese Academy of Sciences, China
Bibo Tu, Chinese Academy of Sciences, China
With the scale expanding, node failures become one of the important obstacles when using large-scale cluster systems. The traditional scheduling policies of cluster only took into account the factors such as jobs priority and node load with the node failure rate omitted. The function of job scheduling in cluster system can be divided into two sub-processes: job selection process and node allocation process. In this paper, we introduce several scheduling policies considering the node failure rate with which the more dependable nodes are selected during the node allocation process. In the end, we use the Discrete Event-Driven Simulation method to evaluate the policies and the simulation results show that the failure-rate aware scheduling policies do better than random node allocation policy for the system performance.
Citation:
Linping Wu, Chao Ren, Dan Meng, Zhan Jianfeng, Bibo Tu, "The Failure-rate Aware Scheduling Policies for Large-scale Cluster Systems," pdcat, pp.364-367, Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.