This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Issues and Challenges in the Performance Analysis of Real Disk Arrays
June 2004 (vol. 15 no. 6)
pp. 559-574

Abstract—The performance modeling and analysis of disk arrays is challenging due to the presence of multiple disks, large array caches, and sophisticated array controllers. Moreover, storage manufacturers may not reveal the internal algorithms implemented in their devices, so real disk arrays are effectively black-boxes. We use standard performance techniques to develop an integrated performance model that incorporates some of the complexities of real disk arrays. We show how measurement data and baseline performance models can be used to extract information about the various features implemented in a disk array. In this process, we identify areas for future research in the performance analysis of real disk arrays.

[1] G.A. Alvarez, E. Borowsky, S. Go, T.H. Romer, R. Becker-Szendy, R. Golding, A. Merchant, M. Spasojevic, A. Veitch, and J. Wilkes, Minerva: An Automated Resource Provisioning Tool for Large-Scale Storage Systems ACM Trans. Computer Systems, vol. 19, no. 4, pp. 483-518, Nov. 2001.
[2] E. Anderson, Simple Table-Based Modeling of Storage Devices technical report, HP Laboratories SSP, July 2001.
[3] E. Anderson, M. Hobbs, K. Keeton, S. Spence, M. Uysal, and A. Veitch, Hippodrome: Running Circles around Storage Administration Proc. USENIX Conf. File and Storage Technologies (FAST), pp. 175-188 Jan. 2002.
[4] M. Andrews, M. Bender, and L. Zhang, New Algorithms for the Disk Scheduling Problem Proc. IEEE Symp. Foundations of Computer Science, pp. 550-559, Oct. 1996.
[5] O.I. Aven, E.G. Coffman, and Y.A. Kogan, Stochatic Analysis of Computer Storage. D. Reidel, ed., May 1987.
[6] E. Bachmat, Average Case Analysis for Batched Disk Scheduling and Increasing Subsequences Proc. 34th Ann. ACM Symp. Theory of Computing, pp. 277-286, May 2002.
[7] E. Bachmat and J. Schindler, Analysis of Methods for Scheduling Low Priority Disk Drive Tasks Proc. ACM SIGMETRICS, pp. 55-65, June 2002.
[8] S. Chen and D. Towsley, The Design and Evaluation of RAID 5 and Parity Striping Disk Array Architectures J. Parallel and Distributed Computing, vol. 17, nos. 1-2, pp. 58-74, Jan. 1993.
[9] S. Chen and D. Towsley, A Performance Evaluation of RAID Architectures IEEE Trans. Computers, vol. 45, no. 10, pp. 1116-1130, Oct. 1996.
[10] R.B. Cooper, Introduction to Queueing Theory. Md.: Mercury Press/Fairchild Publications, 1990.
[11] W.V. Courtright II, A Transactional Approach to Redundant Disk Array Implementation PhD thesis, Dept. of Electrical Eng. and Computer Science, Carnegie-Mellon Univ., May 1997.
[12] R.A. Dekoning and G. J. Fredin, Method and Apparatus for Efficient Management of Non-Aligned I/O Write Request in High Bandwidth Raid Applications technical report, United States Patent and Trademark Office, US Patent 5860091, June 1996.
[13] G.R. Ganger, System-Oriented Evaluation of I/O Subsystem Performance Technical Report CSE-TR-243-95, Univ. of Michigan, June 1995.
[14] Hewlett-Packard Company, HP SureStore E Disk Array FC60 User's Guide, Pub. No. A5277-90001, Dec. 2000.
[15] S.M.R. Islam and L.A. Riedle, Coalescing Raid Commands Accessing Contiguous Data in Write-through Mode technical report, United States Patent and Trademark Office, US Patent 6195727, Mar. 1999.
[16] S.M.R. Islam and L.A. Riedle, Method and System for Updating Data in a Data Storage System technical report, United States Patent and Trademark Office, US Patent 6334168, Feb. 1999.
[17] G.J. Mcknight, L.A. Riedle, and C.T. Stephan, Method and System for Improving Raid Controller Performance through Adaptive Write Back/Write through Caching technical report, United States Patent and Trademark Office, US Patent 6629211, Apr. 2001.
[18] R. S. Mason Jr., Y. Ofek, N. Vishlitzky, D. Arnon, and E. Bachmat, Dynamic Adjustment of Mirror Service Policy for Logical Volumes in a Disk Drive System Based on Collected Statistics technical report, United States Patent and Trademark Office, US Patent 6112257, Sept. 1997.
[19] R. Karedla, J.S. Love, and B. Wherry, "Caching Strategies to Improve Disk System Performance," Computer, Mar. 1994, pp. 38-46.
[20] M.Y. Kim and A.N. Tantawi, "Asynchronized Disk Interleaving: Approximating Access Delays," IEEE Trans. Computers, vol. 40, no. 7, pp. 801-810, July 1991.
[21] A. Kuratti and W.H. Sanders, Performance Analysis of the RAID5 Disk Array Proc. IEEE Int'l Computer Performance and Dependability Symp., pp. 236-245, 1995.
[22] E.K. Lee and R.H. Katz, An Analytic Performance Model of Disk Arrays Proc. ACM SIGMETRICS, pp. 98-109, May 1993.
[23] J. May, Parallel I/O for High Performance Computing. Morgan Kaufmann Publishers, 2001.
[24] J. Menon, Performance of RAID 5 Disk Arrays with Read and Write Caching Distributed and Parallel Databases, vol. 2, no. 3, pp. 261-293, July 1994.
[25] A. Merchant and P.S. Yu, An Analytical Model of Reconstruction Time in Mirrored Disks Performance Evaluation, vol. 20, nos. 1-3, pp. 115-129, May 1994.
[26] A. Merchant and P. Yu, "Analytic Modeling and Comparisons of Striping Strategies for Replicated Disk Arrays," IEEE Trans. Computers, vol. 44, pp. 419-433, 1995.
[27] A. Merchant and P.S. Yu, Analytic Modeling of Clustered RAID with Mapping Based on Nearly Random Permutation IEEE Trans. Computers, vol. 45, no. 3, pp. 367-373, Mar. 1996.
[28] D. Muntz and P. Honeyman, Multi-Level Caching in Distributed File Systems or Your Cache Ain't Nuthin but Trash Proc. USENIX Assoc. Winter Conf., pp. 305-313, Jan. 1992.
[29] E. Ofer, N. Vishlitzky, and J. Fitzgerald, Dynamically Adaptive Data Retrieval for a Disk Drive Storage System technical report, United States Patent and Trademark Office, US Patent 5742789, Dec. 1995.
[30] C. Ruemmler and J. Wilkes, "An Introduction to Disk Drive Modeling," Computer, vol. 27, no. 3, pp. 17-28, Mar. 1994.
[31] Seagate Corp., Cheetah 73 Family: ST173404LW/LWV/LC/LCV Product Manual, vol. 1,http://www.seagate.com/support/disc/manuals/ scsi83329478f.pdf, 2004.
[32] E. Shriver, A. Merchant, and J. Wilkes, An Analytic Behavior Model for Disk Drives with Readahead Caches and Request Reordering Proc. ACM SIGMETRICS, pp. 182-191, June 1998.
[33] A. Thomasian and J. Menon, "Performance Analysis of RAID5 Disk Arrays with a Vacationing Server Model for Rebuild Mode Operation," Proc. 10th Int'l Conf. Data Eng., pp. 111-119,Houston, Feb. 1994.
[34] A. Thomasian and J. Menon, RAID5 Performance with Distributed Sparing IEEE Trans. Parallel and Distributed Systems, vol. 8, no. 6, pp. 640-657, June 1997.
[35] M. Uysal, G. Alvarez, and A. Merchant, A Modular, Analytical Model for Modern Disk Arrays Proc. IEEE Int'l Symp. Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), pp. 183-193, Aug. 2001.
[36] E. Varki, Mean Value Technique for Closed Fork-Join Networks Proc. ACM SIGMETRICS, pp. 103-112, May 1999.
[37] E. Varki, A. Merchant, J. Xu, and X. Qiu, An Analytical Model of Disk Arrays under Synchronous I/O Workloads technical report, Univ. of New Hampshire, Jan. 2003.
[38] N. Vishlitzky, Y. Ofek, and E. Bachmat, Redundant Storage with Mirroring by Logical Volume with Diverse Reading Process technical report, United States Patent and Trademark Office, US Patent 5987566, Oct. 1998.
[39] N. Vishlitzky, R. Wilson, and P. Tzelnic, Prefetching to Service Multiple Video Streams from an Integrated Cached Disk Array technical report, United States Patent and Trademark Office, US Patent 5737747, June 1996.
[40] J. Wilkes, The Pantheon Storage-System Simulator Technical Report HPL-1999-127, Hewlett-Packard Laboratories, Dec. 1995.
[41] T.M. Wong and J. Wilkes, My Cache or Yours? Making Storage More Exclusive Proc. USENIX Ann. Technical Conf., pp. 161-175, June 2002.
[42] W.S. Wong and R.J.T. Morris, Benchmark Synthesis Using the LRU Cache Hit Function IEEE Trans. Computers, vol. 37, no. 6, pp. 637-645, June 1988.
[43] Y. Yochai and R.S. Mason, Adaptive Prefetching of Data from a Disk technical report, United States Patent and Trademark Office, US Patent 6529998, Nov. 2000.

Index Terms:
RAID, analytical performance model, array cache, parallel I/O, enterprise storage systems, I/O performance evaluation, disk array.
Citation:
Elizabeth Varki, Arif Merchant, Jianzhang Xu, Xiaozhou Qiu, "Issues and Challenges in the Performance Analysis of Real Disk Arrays," IEEE Transactions on Parallel and Distributed Systems, vol. 15, no. 6, pp. 559-574, June 2004, doi:10.1109/TPDS.2004.9
Usage of this product signifies your acceptance of the Terms of Use.