2006 15th IEEE International Conference on High Performance Distributed Computing
Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O
Paris
June 19-June 23
ISBN: 1-4244-0307-3
This paper addresses the problem of efficient execution of a batch of data-intensive tasks with batch-shared I/O behavior, on coupled storage and compute clusters. Two scheduling schemes are proposed: 1) a 0-1 integer programming (IP) based approach, which couples task scheduling and data replication, and 2) a bi-level hypergraph partitioning based heuristic approach (BiPartition), which decouples task scheduling and data replication. The experimental results show that: 1) the IP scheme achieves the best batch execution time, but has significant scheduling overhead, thereby restricting its application to small scale workloads, and 2) the BiPartition scheme is a better fit for larger workloads and systems - it has very low scheduling overhead and no more than 5-10% degradation in solution quality, when compared with the IP based approach
Index Terms:
BiPartition scheme, task scheduling, file replication, data-intensive job, batch-shared I/O behavior, integer programming, bi-level hypergraph partitioning, heuristic approach
Citation:
G. Khanna, N. Vydyanathan, U. Catalyurek, T. Kurc, S. Krishnamoorthy, P. Sadayappan, J. Saltz, "Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O," hpdc, pp.241-252, 2006 15th IEEE International Conference on High Performance Distributed Computing, 2006