This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Comparing Latency-Tolerance Techniques for Software DSM Systems
November 2003 (vol. 14 no. 11)
pp. 1180-1190
Ricardo Bianchini, IEEE Computer Society

Abstract—This paper studies the isolated and combined effects of several latency-tolerance techniques for software-based distributed shared-memory systems (software DSMs). More specifically, we focus on data prefetching, update-based coherence, and single-writer optimizations for page-based software DSMs. Our experimental results with six parallel applications show that, when these techniques are carefully combined, they can provide running time and speedup improvements of up to 54 percent and 110 percent, respectively, on a cluster of eight PCs.

[1] C. Amza, A. Cox, S. Dwarkadas, and W. Zwaenepoel, Software DSM Protocols that Adapt between Single Writer and Multiple Writer Proc. Third Int'l Symp. High-Performance Computer Architecture, Feb. 1997.
[2] C. Amza, A.L. Cox, S. Dwarkadas, L.-J. Jin, K. Rajamani, and W. Zwaenepoel, “Adaptive Protocols for Software Distributed Shared Memory,” Proc. IEEE, Special Issue on Distributed Shared Memory, Mar. 1999.
[3] C. Amza, A.L. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel, “TreadMarks: Shared Memory Computing on Networks of Workstations,” Computer, vol. 29, no. 2, Feb. 1996.
[4] R. Bianchini, L.I. Kontothanassis, R. Pinto, M. De Maria, M. Abud, and C.L. Amorim, Hiding Communication Latency and Coherence Overhead in Software DSMs Proc. Seventh Int'l Conf. Architectural Support for Programming Languages and Operating Systems, Oct. 1996.
[5] R. Bianchini, R. Pinto, and C.L. Amorim, Data Prefetching for Software DSMs Proc. Int'l Conf. Supercomputing, July 1998.
[6] R. Bianchini, R. Pinto, and C.L. Amorim, Page Fault Behavior and Prefetching in Software DSMs Technical Report ES-401/96, COPPE Systems Eng., Federal Univ. of Rio de Janeiro, July 1996.
[7] A. Bilas, D. Jiang, and J.P. Singh, Accelerating Shared Virtual Memory via General-Purpose Network Interface Support ACM Trans. Computer Systems, vol. 19, pp. 1-35, Feb. 2001.
[8] M. Castro and C. de Amorim, Efficient Categorization of Memory Sharing Patterns in Software DSM Systems Proc. 15th Int'l Parallel and Distributed Processing Symp., Apr. 2001.
[9] A.L. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, and W. Zwaenepoel, Software versus Hardware Shared-Memory Implementation: A Case Study Proc. 21st Ann. Int'l Symp. Computer Architecture, Apr. 1994.
[10] L. Iftode, C. Dubnicki, E.W. Felten, and K. Li, Improving Release-Consistent Shared Virtual Memory Using Automatic Update Proc. Second IEEE Symp. High-Performance Computer Architecture, Feb. 1996.
[11] P. Keleher, Update Protocols and Cluster-Based Shared Memory Computer Comm., vol. 22, pp. 1045-1055, July 1999.
[12] L. Kontothanassis, G. Hunt, R. Stets, N. Hardavellas, M. Cierniak, S. Parthasarathy, W. Meira, S. Dwarkadas, and M. Scott, VM-Based Shared Memory on Low-Latency Remote-Memory-Access Networks Proc. Int'l Symp. Computer Architecture, June 1997.
[13] L. Monnerat and R. Bianchini, Efficiently Adapting to Sharing Patterns in Software DSMs Proc. Fourth IEEE Symp. High-Performance Computer Architecture, Feb. 1998.
[14] T. Mowry, C. Chan, and A. Lo, Comparative Evaluation of Latency Tolerance Techniques for Software Distributed Shared Memory Proc. Fourth IEEE Symp. High-Performance Computer Architecture, Feb. 1998.
[15] M. Rangarajan and L. Iftode, Software Distributed Shared Memory over Virtual Interface Architecture: Implementation and Performance Proc. Third Extreme Linux Workshop, Oct. 2000.
[16] C.B. Seidel, R. Bianchini, and C.L. Amorim, The Affinity Entry Consistency Protocol Proc. 1997 Int'l Conf. Parallel Processing, Aug. 1997.
[17] R. Stets, S. Dwarkadas, N. Hardavellas, G. Hunt, L. Kontothanassis, S. Parthasarathy, and M. Scott, Cashmere-2L: Software Coherent Shared Memory on a Clustered Remote-Write Network Proc. 16th ACM Symp. Operating Systems Principles, Oct. 1997.
[18] R. Stets, S. Dwarkadas, L. Kontothanassis, U. Rencuzogullari, and M.L. Scott, The Effect of Network Total Order and Remote-Write Capability on Network-based Shared Memory Computing Proc. Sixth IEEE Symp. High-Performance Computer Architecture, Jan. 2000.
[19] K. Thitikamol and P. Keleher, Per-Node Multi-Threading and Remote Latency IEEE Trans. Computers, vol. 47, no. 4, pp. 414-426, Apr. 1998.
[20] L. Whately, R. Pinto, M. Rangarajan, L. Iftode, R. Bianchini, and C.L. Amorim, Adaptive Techniques for Home-Based Software DSMs Proc. 13th Symp. Computer Architecture and High Performance Computing, pp. 164-171, Sept. 2001.
[21] W. Hu, W. Shi, and Z. Tang, Home Migration in Home-Based Software DSMs Proc. First Workshop Software Distributed Shared Memory, June 1999.
[22] S. Wiel and D. Lilja, When Caches Aren't Enough: Data Prefetching Techniques Computer, vol. 30, no. 7, pp. 23-30, July 1997.
[23] S. Woo, M. Ohara, E. Torrie, J.P. Singh, and A. Gupta, “The SPLASH-2 Programs: Characterization and Methodological Considerations,” Proc. Int'l Symp. Computer Architecture, pp. 24-36, June 1995.

Index Terms:
Distributed systems, performance.
Citation:
Raquel Pinto, Ricardo Bianchini, Claudio L. Amorim, "Comparing Latency-Tolerance Techniques for Software DSM Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 14, no. 11, pp. 1180-1190, Nov. 2003, doi:10.1109/TPDS.2003.1247677
Usage of this product signifies your acceptance of the Terms of Use.