The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - Aug. (2013 vol.24)
pp: 1556-1566
Samer Al-Kiswany , The University of British Columbia, Vancouver
Abdullah Gharaibeh , The University of British Columbia, Vancouver
Matei Ripeanu , The University of British Columbia, Vancouver
ABSTRACT
Massively multicore processors, such as graphics processing units (GPUs), provide, at a comparable price, a one order of magnitude higher peak performance than traditional CPUs. This drop in the cost of computation, as any order-of-magnitude drop in the cost per unit of performance for a class of system components, triggers the opportunity to redesign systems and to explore new ways to engineer them to recalibrate the cost-to-performance relation. This project explores the feasibility of harnessing GPUs' computational power to improve the performance, reliability, or security of distributed storage systems. In this context, we present the design of a storage system prototype that uses GPU offloading to accelerate a number of computationally intensive primitives based on hashing, and introduce techniques to efficiently leverage the processing power of GPUs. We evaluate the performance of this prototype under two configurations: as a content addressable storage system that facilitates online similarity detection between successive versions of the same file and as a traditional system that uses hashing to preserve data integrity. Further, we evaluate the impact of offloading to the GPU on competing applications' performance. Our results show that this technique can bring tangible performance gains without negatively impacting the performance of concurrently running applications.
INDEX TERMS
Graphics processing unit, Memory management, Instruction sets, Acceleration, Parallel processing, Resource management, Prototypes, graphics processing units (GPUs), Graphics processing unit, Memory management, Instruction sets, Acceleration, Parallel processing, Resource management, Prototypes, content addressable storage, Storage system design, massively parallel processors
CITATION
Samer Al-Kiswany, Abdullah Gharaibeh, Matei Ripeanu, "GPUs as Storage System Accelerators", IEEE Transactions on Parallel & Distributed Systems, vol.24, no. 8, pp. 1556-1566, Aug. 2013, doi:10.1109/TPDS.2012.239
REFERENCES
[1] S. Quinlan and S. Dorward, "VENTI: A New Approach to Archival Data Storage," Proc. USENIX Conf. File and Storage Technologies (FAST), 2002.
[2] S.C. Rhea, R. Cox, and A. Pesterev, "Fast, Inexpensive Content-Addressed Storage in Foundation," Proc. USENIX Ann. Technical Conf., 2008.
[3] A. Muthitacharoen, B. Chen, and D. Mazieres, "A Low-Bandwidth Network File System," Proc. Symp. Operating Systems Principles (SOSP), 2001.
[4] C. Rossbach et al., "PTask: Operating System Abstractions to Manage GPUs as Compute Devices," Proc. 23rd ACM Symp. Operating Systems Principles (SOSP), 2011.
[5] P. Bhatotia, R. Rodrigues, and A. Verma, "Shredder: GPU-Accelerated Incremental Storage and Computation," Proc. USENIX Conf. File and Storage Technologies (FAST), 2012.
[6] NVIDIA CUDA Compute Unified Device Architecture: Programming Guide v0.8, 2008.
[7] S. Al-Kiswany et al., "StoreGPU: Exploiting Graphics Processing Units to Accelerate Distributed Storage Systems," Proc. Int'l Symp. High Performance Distributed Computing (HPDC), 2008.
[8] S. Al-Kiswany, A. Gharaibeh, S. Gopalakrishnan, and M. Ripeanu, "A GPU Accelerated Storage System," Proc. Int'l Symp. High Performance Distributed Computing (HPDC), 2010.
[9] B.-G. Chun et al., "Efficient Replica Maintenance for Distributed Storage Systems," Proc. Third USENIX Symp. Networked Systems Design and Implementation (NSDI), 2006.
[10] L. Costa, S. Al-Kiswany, and M. Ripeanu, "GPU Support for Batch Oriented Workloads," Proc. IEEE Int'l Performance Computing and Comm. Conf. (IPCCC), 2009.
[11] "GZip Compression Tool," http:/www.gzip.org/, 2011.
[12] B. Zhu, K. Li, and H. Patterson, "Avoiding the Disk Bottleneck in the Data Domain Deduplication File System," Proc. USENIX Conf. File and Storage Technologies (FAST), 2008.
[13] D. Dabiri and I.F. Blake, "Fast Parallel Algorithms for Decoding Reed-Solomon Codes Based on Remainder Polynomials," IEEE Trans. Information Theory, vol. 41, no. 4, pp. 873-885, 1995.
[14] E.B. Nightingale, D. Peek, P.M. Chen, and J. Flinn, "Parallelizing Security Checks on Commodity Hardware," Proc. Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2008.
[15] R. Datta, D. Joshi, J. Li, and J.Z. Wang, "Image Retrieval: Ideas, Influences, and Trends of the New Age," ACM Computing Surveys, vol. 40, no. 2, pp. 1-60, 2008.
[16] L. Seiler et al., "Larrabee: A Many-Core x86 Architecture for Visual Computing," IEEE Micro, vol. 29, no. 1, pp. 10-21, Jan./Feb. 2009.
[17] M.D. Hill and M.R. Marty, "Amdahl's Law in the Multicore Era," IEEE Computer, vol. 41, no. 7, pp. 33-38, July 2008.
[18] M. Polte, J. Simsa, and G. Gibson, "Comparing Performance of Solid State Devices and Mechanical Disks," Proc. Petascale Data Storage Workshop, 2008.
[19] M.L. Curry, A. Skjellum, H.L. Ward, and R. Brightwell, "Accelerating Reed-Solomon Coding in RAID Systems with GPUs," Proc. IEEE Int'l Parallel and Distributed Processing Symp. (IPDPS), 2008.
[20] S. Ghemawat, H. Gobioff, and S.-T. Leung, "The Google File System," Proc. 19th ACM Symp. Operating Systems Principles, 2003.
[21] K. Eshghi et al., "JumboStore: Providing Efficient Incremental Upload and Versioning for a Utility Rendering Service," Proc. USENIX Conf. File and Storage Technologies (FAST), 2007.
[22] S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA," Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, 2008.
[23] S. Al-Kiswany, A. Gharaibeh, and M. Ripeanu, "The Case for Versatile Storage System," Proc. Workshop Hot Topics in Storage and File Systems (HotStorage), 2009.
[24] I. Stoica et al., "Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications," Proc. ACM SIGCOMM, 2001.
[25] R. Merkle, "A Certified Digital Signature," Proc. Advances in Cryptology Conf. (CRYPTO), 1989.
[26] I. Damgard, "A Design Principle for Hash Functions," Proc. Advances in Cryptology Conf. (CRYPTO), 1989.
[27] A.R. Yumerefendi and J.S. Chase, "Strong Accountability for Network Storage," Proc. USENIX Conf. File and Storage Technologies (FAST '07), 2007.
[28] S.F. Altschul et al., "Basic Local Alignment Search Tool," Molecular Biology, vol. 215, pp. 403-410, 1990.
[29] P.H. Hargrove and J.C. Duell, "Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters," Proc. SciDAC, 2006.
[30] L.P. Cox and B.D. Noble, "Samsara: Honor among Thieves in Peer-to-Peer Storage," Proc. ACM 19th Symp. Operating Systems Principles (SOSP), 2003.
[31] R. Kotla, L. Alvisi, and M. Dahlin, "SafeStore: A Durable and Practical Storage System," Proc. USENIX Ann. Technical Conf., 2007.
[32] G. DeCandia et al., "Dynamo: Amazon's Highly Available Key-Value Store," Proc. ACM Symp. Operating Systems Principles (SOSP), 2007.
[33] F. Dabek et al., "Wide-Area Cooperative Storage with CFS," Proc. 18th ACM Symp. Operating Systems Principles (SOSP '01), 2001.
[34] rsync Files Synchronization Tool, http://www.samba.orgrsync/, 2009.
[35] S. Brin, J. Davis, and H. Garcia-Molina, "Copy Detection Mechanisms for Digital Documents," Proc. ACM SIGMOD Int'l Conf. Management of Data, 1995.
[36] S. Schleimer, D.S. Wilkerson, and A. Aiken, "Winnowing: Local Algorithms for Document Fingerprinting," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2003.
[37] B. Bloom, "Space/Time Trade-Offs in Hash Coding with Allowable Errors," Comm. ACM, vol. 13, no. 7, pp. 422-426, 1970.
[38] J.D. Owens et al., "A Survey of General-Purpose Computation on Graphics Hardware," Computer Graphics Forum, vol. 26, no. 1, pp. 80-113, 2007.
[39] I.S. Reed and G. Solomon, "Polynomial Codes over Certain Finite Fields," J. Soc. for Industrial and Applied Math., vol. 8, no. 2, pp. 300-304, 1960.
[40] G. Falcao, L. Sousa, and V. Silva, "Massive Parallel LDPC Decoding on GPU," Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), 2008.
[41] O. Harrison and J. Waldron, "Practical Symmetric Key Cryptography on Modern Graphics Hardware," Proc. USENIX Security Symp., 2008.
[42] A. Moss, D. Page, and N. Smart, "Toward Acceleration of RSA Using 3D Graphics Hardware," Proc. 11th IMA Int'l Conf. Cryptography and Coding, 2007.
[43] Kaspersky Antivirus, http:/www.kaspersky.com/, 2008.
[44] Elcomsoft Password Recovery Software, http:/www. elcomsoft.com, 2008.
32 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool