The Community for Technology Leaders
Green Image
<p><b>Abstract</b>—Evaluating and analyzing the performance of a parallel application on an architecture to explain the disparity between projected and delivered performance is an important aspect of parallel systems research. However, conducting such a study is hard due to the vast design space of these systems. In this paper, we study two important aspects related to the performance of parallel applications on shared memory parallel architectures. Fist, we quantify <it>overheads</it> observed during the execution of these applications on three different simulated architectures. We next use these results to synthesize the bandwidth requirements for the applications with respect to different network topologies. This study is performed using an execution-driven simulation tool called SPASM, which provides a way of isolating and quantifying the different parallel system overheads in a nonintrusive manner. The first exercise shows that in shared memory machines with private caches, as long as the applications are well-structured to exploit locality, the key determinant that impacts performance is network connection. The second exercise quantifies the network bandwidth needed to minimize the effect of network connection. Specifically, it is shown that for the applications considered, as long as the problem sizes are increased commensurate with the system size, current network technologies supporting 200-300 MBytes/sec link bandwidth are sufficient to keep the network overheads (such as the latency and contention) within acceptable bounds.</p>
Parallel systems overheads, execution-driven simulation, interconnection network, application-driven study.

A. Sivasubramaniam, U. Ramachandran, H. Venkateswaran and A. Singla, "An Application-Driven Study of Parallel System Overheads and Network Bandwidth Requirements," in IEEE Transactions on Parallel & Distributed Systems, vol. 10, no. , pp. 193-210, 1999.
94 ms
(Ver 3.3 (11022016))