This Article 
 Bibliographic References 
 Add to: 
Stochastic Models for Performance Analysis of Database Recovery Control
April 1995 (vol. 44 no. 4)
pp. 561-576

Abstract—In this paper we develop three analytical models for a comprehensive analysis of database recovery. These models, based on semi-Markov stochastic analysis and queueing networks, not only capture the details of modern recovery mechanisms, but take the complex stochastic behavior of the system into account. Furthermore, we use multiple performance measures to analyze different recovery mechanisms, the impact of environment characteristics and the effect of tunable system parameters, thus offering database designers and administrators a better understanding of the recovery system to be designed or managed. A special case of database recovery that has been studied by previous researchers is analyzed in detail; numerical experiments offer evidence of the effectiveness of our approach. The models developed in this paper, however, are applicable to much more general systems and environments.

[1] R. Agrawal and D.J. DeWitt, “Integrated Concurrency Control and Recovery Mechanisms: Design and Performance Evaluation,” ACM Trans. Database Systems, vol. 10, no. 4, pp. 529-564, 1985.
[2] P. Bernstein, V. Hadzilacos, and N. Goodman, Concurrency Control and Recovery in Database Systems. Addison-Wesley, 1987.
[3] R.C. Brooks,“An approach to high availability in high transaction rate systems,” IBM Systems J., vol. 24, no. 3/4, pp. 279-293, 1985.
[4] M. Burman,“Aspects of a high-volume production on-line banking system,” IEEE Compcon, Spring 1985.
[5] I.R. Casas and K.C. Sevcik,“A buffer management model for use in predicting overall database system performance,” Proc. IEEE Conf. Data Eng., pp. 463-469, 1989.
[6] K.M. Chandy,J.C. Browne,C.W. Dissly,, and W.R. Uhrig,“Analytic models for rollback and recovery strategies,” IEEE Trans. Software Eng., vol. 1, March, pp. 100-110, 1975.
[7] E.G. Coffman and M. Hofri,“Queueing models of secondary storage devices,” Stochastic Analysis of Computer and Comm. Systems, H. Takagi, ed. New York: Elsevier, 1990.
[8] E. de Souza e Silva and H.R. Gail, “Calculating Availability and Performability Measures of Repairable Computer Systems Using Randomization,” J. ACM, vol. 36, pp. 171-193, Jan. 1989.
[9] E. de Souza e Silva and R.R. Muntz,“Queueing networks: solutions and applications,” Stochastic Analysis of Computer and Comm. Systems, H. Takagi, ed. New York: Elsevier, 1990.
[10] K. Elhardt and R. Bayer,“A database cache for high performance and fast restart in database systems,” ACM Trans. Database Systems, vol. 9, no. 4, pp.503-525, Dec. 1984.
[11] B. Gavish and U. Sumita,“Analysis of channel and disk subsystems in computer systems,” Queueing Systems: Theory and Applications, vol. 3, no. 1, pp. 1-23, 1988.
[12] D. Gawlick,“Processing’hot spots’in high performance systems,” IEEE Compcon, Spring 1985.
[13] E. Gelenbe, "On the Optimum Checkpoint Interval," J. ACM, vol. 26, pp. 259-270, Apr. 1979.
[14] E. Gelenbe and D. Derochette, "Performance of Rollback Recovery Systems Under Intermittent Failures," Comm. ACM, vol. 21, pp. 493-499, June 1978.
[15] D. Gifford and A. Spector,“The TWA reservation system,” Comm. ACM, vol. 27, no. 7, July 1984.
[16] P.B. Goes,“Performance of recovery control for database systems: A hierarchical analysis,” PhD Thesis, William E. Simon Graduate School of BusinessAdministration, Univ. of Rochester, 1991.
[17] P.B. Goes,“A stochastic model for performance evaluation of main memory resident database systems,” ORSA J. Computing, to appear.
[18] P.B. Goes and U. Sumita,“A closed queueing network model to evaluate the impact of recovery operations on the performance of database servers,” Working Paper, Univ. of Connecticut, 1994.
[19] V. Grassi,L. Donatiello,, and S. Tucci,“On the optimal checkpointing of critical tasks and transaction-oriented systems,” IEEE Trans. Software Eng., vol. 18, no. 1, pp. 72-77, 1992.
[20] J. Gray,“Notes on database operating systems,” Technical Report RJ 3120. San Jose, CA: IBM Research Center, 1978.
[21] T. Haerder and A. Reuter,“Principles of transaction-oriented database recovery,” ACM Computing Surveys, vol. 15, no. 4, pp. 287-317, Dec. 1983.
[22] IBM, IMS/VS,Version 1, Fast Path Feature, General Information Manual, 1984.
[23] IBM, Database 2 Version 2 General Information, 1988.
[24] IBM, IBM 3390 Direct Access Storage Introduction, 1989.
[25] R. Jain,The Art of Computer Systems Performance Analysis.New York: John Wiley and Sons, 1991.
[26] J. Kent,H. Garcia-Molina,, and J. Chung,“An experimental evaluation of crash recovery mechanisms,” 4th ACM Sigact-Sigmod Symp. Principles Database Systems, 1985.
[27] S.S. Lavenberg, Computer Performance Modeling Handbook. Academic Press, 1983.
[28] E.D. Lazowska, J. Zahorjan, G.S. Graham, and K.C. Sevcik, Quantitative System Performance, Prentice Hall, pp 64-66, 1984.
[29] P. L’Ecuyer and J. Malenfant,“Computing optimal checkpointing strategies for rollback and recovery systems,” IEEE Trans. Computers, vol. 37, no. 4, pp. 491-496, 1988.
[30] A. Reuter,“Performance analysis of recovery techniques,” ACM Trans. on Database Systems, vol. 9, no. 4, pp. 526-559, Dec. 1984.
[31] Schweitzer, P.J.,“Approximate analysis of multiclass closed network of queues,” Proc. Int’l Conf. on Stochastic Control and Optimization, Free Univ., Amsterdam, pp. 5-16, 1979.
[32] A. Silberschatz,M. Stonebraker, and J. D. Ullman (1991), “, Database Systems: Achievements and Opportunities,” Communications of the ACM, vol. 34, no. 10, pp. 110-120.
[33] M. Stonebraker, “Operating System Support for Database Management,” Comm. ACM, vol. 24, no. 7, pp. 412–418, July 1981.
[34] M. Stonebraker, "The Design of the Postgres Storage System," Proc. VLDB Conf., pp. 289-300, 1987.
[35] U. Sumita,N. Kaio,, and P.B. Goes,“Analysis of effective service Time with age dependent interruptions and its application to optimal rollback Policyfor database management,” Queueing Systems: Theory and Applications, vol. 4, pp. 193-212, 1989.
[36] A.N. Tantawi and M. Ruschitzka, "Performance Analysis of Checkpointing Strategies," ACM Trans. Computer Systems, vol. 2, pp. 123-144, May 1984.
[37] J.W. Young, "A First Order Approximation to the Optimum Checkpoint Interval," Comm. ACM, vol. 17, pp. 530-531, Sept. 1974.

Index Terms:
Database recovery, performance evaluation, semi-Markov models, database management, availability, checkpointing, stochastic analysis, queueing networks.
Ushio Sumita, Paulo B. Goes, "Stochastic Models for Performance Analysis of Database Recovery Control," IEEE Transactions on Computers, vol. 44, no. 4, pp. 561-576, April 1995, doi:10.1109/12.376170
Usage of this product signifies your acceptance of the Terms of Use.