SC16: International Conference for High Performance Computing, Networking, Storage and Analysis (SC) (2016)
Salt Lake City, Utah, USA
Nov. 13, 2016 to Nov. 18, 2016
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SC.2016.55
Modern schedulers running on HPC systems traditionally consider the number of resources and the time requested for each job that is to be executed when making scheduling decisions. Until recently this has been sufficient, however as systems get larger, other metrics like power consumption become necessary to ensure system stability. In this paper, we propose a data driven scheduling approach for controlling the power consumption of the entire system under any user defined budget. Here, “data driven” means that our approach actively observes, analyzes, and assesses power behaviors of the system and user jobs to guide scheduling decisions for power management. This design is based on the key observation that HPC jobs have distinct power profiles. Our work contains an empirical analysis of workload power characteristics on a production system, dynamic learner to estimate the job power profile for scheduling, and an online power-aware scheduler for managing the overall system power. Using real workload traces, we demonstrate that our design effectively controls system power consumption while minimizing the impact on system utilization.
Power demand, Monitoring, Dynamic scheduling, Resource management, Job shop scheduling, Control systems
S. Wallace et al., "A Data Driven Scheduling Approach for Power Management on HPC Systems," SC16: International Conference for High Performance Computing, Networking, Storage and Analysis(SC), Salt Lake City, UT, USA, 2016, pp. 656-666.