Quantitative Characterization and Analysis of the I/O Behavior of a Commercial Distributed-Shared-Memory Machine
Issue No. 05 - May (2000 vol. 11)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.852403
<p><b>Abstract</b>—This paper presents a unified evaluation of the I/O behavior of a commercial clustered DSM machine, the HP Exemplar. Our study has the following objectives: 1) To evaluate the impact of different interacting system components, namely, architecture, operating system, and programming model, on the overall I/O behavior and identify possible performance bottlenecks, and 2) To provide hints to the users for achieving high out-of-box I/O throughput. We find that for the DSM machines that are built as a cluster of SMP nodes, <b>integrated clustering</b> of computing and I/O resources, both hardware and software, is not advantageous for two reasons. First, within an SMP node, the I/O bandwidth is often restricted by the performance of the peripheral components and cannot match the memory bandwidth. Second, since the I/O resources are shared as a global resource, the file-access costs become nonuniform and the I/O behavior of the entire system, in terms of both <b>scalability</b> and <b>balance</b>, degrades.</p><p>We observe that the buffered I/O performance is determined not only by the I/O subsystem, but also by the programming model, global-shared memory subsystem, and data-communication mechanism. Moreover, programming-model support can be used effectively to overcome the performance constraints created by the architecture and operating system. For example, on the HP Exemplar, users can achieve high I/O throughput by using features of the programming model that balance the <b>sharing</b> and <b>locality</b> of the user buffers and file systems. Finally, we believe that at present, the I/O subsystems are being designed in isolation, and there is a need for mending the traditional <it>memory-oriented</it> design approach to address this problem.</p>
Input-output, operating/file systems, distributed-shared-memory architecture, clustered computing, performance evaluation, shared-memory programming.
Rajesh R. Bordawekar, "Quantitative Characterization and Analysis of the I/O Behavior of a Commercial Distributed-Shared-Memory Machine", IEEE Transactions on Parallel & Distributed Systems, vol. 11, no. , pp. 509-526, May 2000, doi:10.1109/71.852403