This Article 
 Bibliographic References 
 Add to: 
SEA: A Striping-Based Energy-Aware Strategy for Data Placement in RAID-Structured Storage Systems
June 2008 (vol. 57 no. 6)
pp. 748-761
Many real-world applications need to frequently access data stored on large-scale parallel disk storage systems. On one hand, prompt responses to access requests are essential for these applications. On the other hand, however, with an explosive increase of data volume and the emerging of faster disks with higher power requirements, energy consumption of disk-based storage systems has become a salient issue. To achieve energy-conservation and prompt responses simultaneously, in this paper we propose a novel energy-aware strategy, called striping-based energy-aware (SEA), which can be integrated into data placement in RAID-structured storage systems to noticeably save energy while providing quick responses. Next, to illustrate the effectiveness of SEA, we implement two SEA-powered striping-based data placement algorithms, SEA0 and SEA5, by incorporating the SEA strategy into RAID-0 and RAID-5, respectively. Extensive experimental results demonstrate that compared with traditional non-stripping data placement algorithms, our algorithms significantly improve performance and save energy. Further, compared with an existing stripping-based data placement scheme, the two SEA-powered strategies noticeably reduce energy consumption with only a little performance degradation.

[1] S. Akyürek and K. Salem, “Adaptive Block Rearrangement,” ACM Trans. Computer Systems, vol. 13, no. 2, pp. 89-121, 1995.
[2] L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, “Web Caching and Zipf-Like Distributions: Evidence and Implications,” Proc. IEEE INFOCOM '99, pp. 126-134, 1999.
[3] A. Brinkmann, K. Salzwedel, and C. Scheideler, “Efficient, Distributed Data Placement Strategies for Storage Area Networks,” Proc. 12th Ann. ACM Symp. Parallel Algorithms and Architectures, pp. 119-128, 2000.
[4] E.V. Carrera, E. Pinheiro, and R. Bianchini, “Conserving Disk Energy in Network Servers,” Proc. 17th Ann. Int'l Conf. Supercomputing, pp. 86-97, 2003.
[5] P.M. Chen, E.K. Lee, G.A. Gibson, R.H. Katz, and D.A. Patterson, “RAID: High-Performance Reliable Secondary Storage,” ACM Computing Surveys, vol. 26, no. 2, pp. 145-185, 1994.
[6] P.M. Chen and D.A. Patterson, “Maximizing Performance in a Striped Disk Array,” Proc. 17th Int'l Symp. Computer Architecture, pp. 322-331, 1990.
[7] P.M. Chen and E.K. Lee, “Striping in a RAID Level 5 Disk Array,” ACM Sigmetrics Performance Evaluation Rev., vol. 23, no. 1, pp. 136-145, 1995.
[8] Y. Cho, M. Winslett, Y. Chen, and S.W. Kuo, “Parallel I/O Performance of Fine Grained Data Distributions,” Proc. Seventh Int'l Symp. High Performance Distributed Computing, pp. 163-170, 1998.
[9] A.L. Couch, N. Wu, and H. Susanto, “Toward a Cost Model for System Administration,” Proc. Usenix 19th Conf. Large Installation System Administration, pp. 125-141, 2005.
[10] C. Cunha, A. Bestavros, and M. Crovella, “Characteristics of WWW Client-Based Traces,” Technical Report 1995-010, Boston Univ., 1995.
[11] C.H.Q. Ding and Y. He, “Data Organization and I/O in a Parallel Ocean Circulation Model,” Proc. 13th Ann. Int'l Conf. Supercomputing, 1999.
[12] W. Dowdy and D. Foster, “Comparative Models of the File Assignment Problem,” ACM Computing Surveys, vol. 14, no. 2, pp.287-313, 1982.
[13] S. Ghandeharizadeh, S.H. Kim, and C. Shababi, “On Disk Scheduling and Data Placement for Video Servers,” Sigmetrics Performance Evaluation, vol. 23, no. 1, pp. 37-46, 1995.
[14] S. Glassman, “A Caching Relay for the World Wide Web,” Proc. First Conf. World Wide Web, pp. 165-173, 1994.
[15] R.L. Graham, “Bounds on Multiprocessing Timing Anomalies,” SIAM J. Applied Math., vol. 7, no. 2, pp. 416-429, 1969.
[16] P. Greenawalt, “Modeling Power Management for Hard Disks,” Proc. Second Int'l Workshop Modeling, Analysis, and Simulation of Computer and Telecomm. Systems, pp. 62-66, Jan. 1994.
[17] S. Gurumurthi, A. Sivasubramaniam, M. Kandemir, and H. Franke, “DRPM: Dynamic Speed Control for Power Management in Server Class Disks,” Proc. 30th Int'l Symp. Computer Architecture, pp. 169-179, June 2003.
[18] “Hitachi Power & Acoustic Management: Quietly Cool,” white paper, Hitachi Corp., Mar. 2004.
[19] H. Huang, W. Hung, and K.G. Shin, “FS2: Dynamic Data Replication in Free Disk Space for Improving Disk Performance and Energy Consumption,” Proc. 12th ACM Symp. Operating Systems Principles, pp. 263-276, 2005.
[20] B. Inmon, “Information Management: Real-Time Decision Support Systems,” DM Rev. Magazine, Aug. 2006.
[21] M. Kandemir, S.W. Son, and G. Chen, “An Evaluation of Code and Data Optimizations in the Context of Disk Power Reduction,” Proc. Int'l Symp. Low-Power Electronics and Design, pp. 209-214, 2005.
[22] T. Kwan, R. Mcgrath, and D. Reed, “Ncsas World Wide Web Server Design and Performance,” Computer, vol. 28, no. 11, pp. 67-74, Nov. 1995.
[23] R. Latham, N. miller, R. Ross, and P. Carns, “A Next-Generation Parallel File System for Linux Clusters: An Introduction to the Second Parallel Virtual File System,” Linux World Magazine, pp.56-59, Jan. 2004.
[24] L.W. Lee, P. Scheuermann, and R. Vingralek, “File Assignment in Parallel I/O Systems with Minimal Variance of Service Time,” IEEE Trans. Computers, vol. 49, no. 2, pp. 127-140, Feb. 2000.
[25] P. Merialdo, P. Atzeni, and G. Mecca, “Design and Development of Data-Intensive Web Sites: The Araneus Approach,” ACM Trans. Internet Technology, vol. 3, no. 1, pp. 49-92, 2003.
[26] M. Narris and J. Obal, “Performance Analysis of the Linux Buffer Cache while Running an Oracle OLTP Workload,” Worcester Polytechnic Inst., Jan. 2002.
[27] N. Nishikawa, T. Hosokawa, Y. Mori, K. Yoshida, and H. Tsuji, “Memory-Based Architecture for Distributed WWW Caching Proxy,” Proc. Seventh Int'l Conf. World Wide Web, pp. 205-214, 1998.
[28] A.E. Papathanasiou and M.L. Scott, “Power-Efficient Server-Class Performance from Arrays of Laptop Disks,” Proc. Usenix Ann. Technical Conf. Work-in-Progress Presentation, 2004.
[29] Z. Peterson, D.E. Long, and S.A. Brandt, “Data Placement Based on Seek Time Analysis of a MEMS-Based Storage Device,” Proc. Conf. File and Storage Technology Work-in-Progress Session, Jan. 2002.
[30] E. Pinheiro and R. Bianchini, “Energy Conservation Techniques for Disk Array-Based Servers,” Proc. 18th Ann. Int'l Conf. Supercomputing, pp. 68-78, June 2004.
[31] “Power, Heat, and Sledgehammer,” white paper, Maximum Inst., PowerHeat20411.pdf , 2002.
[32] X. Ruan, X. Qin, M. Nijim, Z. Zong, and K. Bellam, “An Energy-Efficient Scheduling Algorithm Using Dynamic Voltage Scaling for Parallel Applications on Clusters,” Proc. 16th IEEE Int'l Conf. Computer Comm. and Networks, pp. 735-740, Aug. 2007.
[33] N.J. Sarhan and C.R. Das, “Adaptive Block Rearrangement Algorithms for Video-on-Demand Servers,” Proc. 30th Int'l Conf. Parallel Processing, pp. 452-459, 2001.
[34] S.W. Son, G. Chen, and M. Kandemir, “Disk Layout Optimization for Reducing Energy Consumption,” Proc. 19th Ann. Int'l Conf. Supercomputing, pp. 274-283, 2005.
[35] S.W. Son, G. Chen, M. Kandemir, and A. Choudhary, “Exposing Disk Layout to Compiler for Reducing Energy Consumption of Parallel Disk-Based Systems,” Proc. 10th ACM Symp. Principles and Practice of Parallel Programming, pp. 174-185, 2005.
[36] P. Triantafillou, S. Christodoulakis, and C. Georgiadis, “Optimal Data Placement on Disks: A Comprehensive Solution for Different Technologies,” IEEE Trans. Knowledge and Data Eng., vol. 12, no. 2, pp. 324-330, Feb./Mar. 2000.
[37] T. Xie and Y. Sun, “No More Energy-Performance Trade-Off: A New Data Placement Strategy for RAID-Structured Storage Systems,” Proc. 14th Ann. IEEE Int'l Conf. High-Performance Computing, pp. 35-46, Dec. 2007.
[38] Q. Zhu, Z. Chen, L. Tan, Y. Zhou, K. Keeton, and J. Wilkes, “Hibernator: Helping Disk Arrays Sleep through the Winter,” Proc. 12th ACM Symp. Operating Systems Principles, pp. 177-190, 2005.
[39] Z. Zong, X. Qin, M. Nijim, X. Ruan, K. Bellam, and M. Alghamdi, “Energy-Efficient Scheduling for Parallel Applications Running on Heterogeneous Clusters,” Proc. 36th Int'l Conf. Parallel Processing, 2007.

Index Terms:
Energy-aware systems, Load balancing and task assignment, Real-time distributed, Scheduling and task partitioning, Distributed applications, Real-time and embedded systems, Reliability, availability, and serviceability
Tao Xie, "SEA: A Striping-Based Energy-Aware Strategy for Data Placement in RAID-Structured Storage Systems," IEEE Transactions on Computers, vol. 57, no. 6, pp. 748-761, June 2008, doi:10.1109/TC.2008.27
Usage of this product signifies your acceptance of the Terms of Use.