Issue No.07 - July (2009 vol.20)
Edi Shmueli , Haifa University Campus, Haifa, and The Hebrew University of Jerusalem, Jerusalem
Dror G. Feitelson , The Hebrew University of Jerusalem, Jerusalem
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2008.152
It is customary to use open-system trace-driven simulations to evaluate the performance of parallel-system schedulers. As a consequence, all schedulers have evolved to optimize the packing of jobs in the schedule, as a means to improve a number of performance metrics that are conjectured to be correlated with user satisfaction, with the premise that this will result in a higher productivity in reality. We argue that these simulations suffer from severe limitations that lead to suboptimal scheduler designs and to even dismissing potentially good design alternatives. We propose an alternative simulation methodology called site-level simulation, in which the workload for the evaluation is generated dynamically by user models that interact with the system. We present a novel scheduler called CREASY that exploits knowledge on user behavior to directly improve user satisfaction and compare its performance to the original packing-based EASY scheduler. We show that user productivity improves by up to 50 percent under the user-aware design, while according to the conventional metrics, performance may actually degrade.
Parallel job scheduling, trace-driven simulations, open-system model, user behavior, feedback.
Edi Shmueli, Dror G. Feitelson, "On Simulation and Design of Parallel-Systems Schedulers: Are We Doing the Right Thing?", IEEE Transactions on Parallel & Distributed Systems, vol.20, no. 7, pp. 983-996, July 2009, doi:10.1109/TPDS.2008.152