12th IEEE International Symposium on High Performance Distributed Computing (HPDC-12 '03)
Pipeline and Batch Sharing in Grid Workloads
Seattle, Washington
June 22-June 24
ISBN: 0-7695-1965-2
We present a study of six batch-pipelined scientific workloads that are candidates for execution on computational grids. Whereas other studies focus on the behavior of single applications, this study characterizes workloads composed of pipelines of sequential processes that use file storage for communication and also share significant data across a batch. This study includes measurements of the memory, CPU, and I/O requirements of individual components as well as analyses of I/O sharing within complete batches. We conclude with a discussion of the ramifications of these workloads for end-to-end scalability and overall system design.
Citation:
Douglas Thain, John Bent, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Miron Livny, "Pipeline and Batch Sharing in Grid Workloads," hpdc, pp.152, 12th IEEE International Symposium on High Performance Distributed Computing (HPDC-12 '03), 2003