2017 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) (2017)
Hong Kong, Hong Kong
Dec. 11, 2017 to Dec. 14, 2017
Data-intensive applications usually need to deal with huge volumes of data within their deadlines. These applications can be modelled as DAGs and require parallel computation frameworks such as MapReduce and Spark to enhance the performance. The network communication has a crucial impact on the performance of an application. Coflow is intended to address the application-specific network level Quality-of-Service (QoS) requirements in cloud-based data centres. However, existing works mainly focus on scheduling coflows in a single stage. How to schedule coflows in multi-stage applications (represented as DAGs) remains to be an open problem. In this paper we study the problem of scheduling coflows in a DAG to meet its deadline requirement. Single stage coflow scheduling has been proven to be NP-hard. Multiple stages in a DAG make our problem even more complex. Owing to the complexity of the problem, we propose a genetic algorithm-based method for solving the problem. The effectiveness of our solution is verified through numerical evaluation. Experimental results show that our solution can effectively guarantee the deadline of the DAGs compared with existing single stage coflow scheduling algorithms.
Processor scheduling, Computational modeling, Genetic algorithms, Job shop scheduling, Sociology, Statistics, Ports (Computers)
J. Wang, H. Zhou, Y. Hu, C. d. Laat and Z. Zhao, "Deadline-Aware Coflow Scheduling in a DAG," 2017 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Hong Kong, Hong Kong, 2017, pp. 341-346.