The Community for Technology Leaders
RSS Icon
Issue No.06 - June (2013 vol.62)
pp: 1072-1085
Dario Bruneo , Università di Messina, Messina
Salvatore Distefano , Politecnico di Milano, Milano
Francesco Longo , Università di Messina, Messina
Antonio Puliafito , Universitàdi Messina, Messina
Marco Scarpa , Università di Messina, Messina
Cloud computing is a promising paradigm able to rationalize the use of hardware resources by means of virtualization. Virtualization allows to instantiate one or more virtual machines (VMs) on top of a single physical machine managed by a virtual machine monitor (VMM). Similarly to any other software, a VMM experiences aging and failures. Software rejuvenation is a proactive fault management technique that involves terminating an application, cleaning up the system internal state, and restarting it to prevent the occurrence of future failures. In this work, we propose a technique to model and evaluate the VMM aging process and to investigate the optimal rejuvenation policy that maximizes the VMM availability under variable workload conditions. Starting from dynamic reliability theory and adopting symbolic algebraic techniques, we investigate and compare existing time-based VMM rejuvenation policies. We also propose a time-based policy that adapts the rejuvenation timer to the VMM workload condition improving the system availability. The effectiveness of the proposed modeling technique is demonstrated through a numerical example based on a case study taken from the literature.
Software, Aging, Availability, Clocks, Computational modeling, Stochastic processes, Degradation, Kronecker algebra, Time-based rejuvenation, cloud computing, dynamic availability, phase type distributions
Dario Bruneo, Salvatore Distefano, Francesco Longo, Antonio Puliafito, Marco Scarpa, "Workload-Based Software Rejuvenation in Cloud Systems", IEEE Transactions on Computers, vol.62, no. 6, pp. 1072-1085, June 2013, doi:10.1109/TC.2013.30
[1] L. Bittencourt, C. Senna, and E. Madeira, "Scheduling Service Workflows for Cost Optimization in Hybrid Clouds," Proc. Int'l Conf. Network and Service Management, pp. 394-397, 2010.
[2] S. Pearson and A. Benameur, "Privacy, Security and Trust Issues Arising from Cloud Computing," Proc. IEEE second Int'l Conf. Cloud Computing Technology and Science, pp. 693-702, 2010.
[3] R. Ghosh, K. Trivedi, V. Naik, and D.S. Kim, "End-to-End Performability Analysis for Infrastructure-as-a-Service Cloud: An Interacting Stochastic Models Approach," Proc. IEEE 16th Pacific Rim Int'l Symp. Dependable Computing, pp. 125-132, 2010.
[4] S. Distefano, F. Longo, and M. Scarpa, "Availability Assessment of HA Standby Redundant Clusters," Proc. IEEE 29th Symp. Reliable Distributed Systems, pp. 265-274, 2010.
[5] M. Grottke and K.S. Trivedi, "Fighting Bugs: Remove, Retry, Replicate, and Rejuvenate," Computer, vol. 40, no. 2, pp. 107-109, Feb. 2007.
[6] Y. Huang, C. Kintala, N. Kolettis, and N. Fulton, "Software Rejuvenation: Analysis, Module and Applications," Proc. 25th Int'l Symp. Fault-Tolerant Computing (FTCS), pp. 381-390, 1995.
[7] K. Vaidyanathan and K.S. Trivedi, "A Comprehensive Model for Software Rejuvenation," IEEE Trans. Dependable and Secure Computing, vol. 2, no. 2, pp. 124-137, Apr.-June 2005.
[8] S. Garg, A. Puliafito, M. Telek, and K. Trivedi, "Analysis of Software Rejuvenation Using Markov Regenerative Stochastic Petri Net," Proc. Sixth Int'l Symp. Software Reliability Eng., pp. 180-187, 1995.
[9] K. Vaidyanathan and K. Trivedi, "A Measurement-Based Model for Estimation of Resource Exhaustion in Operational Software Systems," Proc. 10th Int'l Symp. Software Reliability Eng., pp. 84-93, 1999.
[10] S. Garg, A. van Moorsel, K. Vaidyanathan, and K. Trivedi, "A Methodology for Detection and Estimation of Software Aging," Proc. Ninth Int'l Symp. Software Reliability Eng., pp. 283-292, 1998.
[11] P. Mell and T. Grance, "The NIST Definition of Cloud Computing," Nat'l Inst. of Standards and Technology, vol. 53, no. 6, p. 50, 2011.
[12] A. Rezaei and M. Sharifi, "Rejuvenating High Available Virtualized Systems," Proc. Int'l Conf. Availability, Reliability, and Security, pp. 289-294, 2010.
[13] P.H. Kvam and E.A. Peña, "Estimating Load-Sharing Properties in a Dynamic Reliability System," J. Am. Statistical Assoc., vol. 100, no. 469, pp. 262-272, 2005.
[14] L. Huang and Q. Xu, "Lifetime Reliability for Load-Sharing Redundant Systems with Arbitrary Failure Distributions," IEEE Trans. Reliability, vol. 59, no. 2, pp. 319-330, June 2010.
[15] D. Kececioglu, Reliability Engineering Handbook (Vol. 1 and 2). Prentice-Hall, Inc., 1991.
[16] F. Machida, D. Kim, and K. Trivedi, "Modeling and Analysis of Software Rejuvenation in a Server Virtualized System," Proc. IEEE Second Int'l Workshop Software Aging and Rejuvenation, pp. 1-6, 2010.
[17] M.S. Finkelstein, "Wearing-Out of Components in a Variable Environment," Reliability Eng. and System Safety, vol. 66, no. 3, pp. 235-242, 1999.
[18] D.R. Cox, Renewal Theory. Methuen Ltd, 1967.
[19] F. Longo and M. Scarpa, "Applying Symbolic Techniques to the Representation of Non-Markovian Models with Continuous ph Distributions," Proc. Sixth European Performance Eng. Workshop Computer Performance Eng. (EPEW), pp. 44-58, 2009.
[20] S. Distefano, F. Longo, and M. Scarpa, "Symbolic Representation Techniques in Dynamic Reliability Evaluation," Proc. IEEE Int'l Symp. High-Assurance Systems Eng., pp. 45-53, 2010.
[21] M. Neuts, "Probability Distributions of Phase Type," Liber Amicorum Professor Emeritus H. Florin, pp. 173-206, Louvain Univ., 1975.
[22] M.F. Neuts, Matrix-Geometric Solutions in Stochastic Models: An Algorithmic Approach. Johns Hopkins Univ. Press, 1981.
[23] M. Scarpa, "Non Markovian Stochastic Petri Nets with Concurrent Generally Distributed Transtions," PhD dissertation, Univ. of Turin, 1999.
[24] R. Bellman, Introduction to Matrix Analysis, second ed. SIAM, 1997.
[25] A. Bobbio, A. Puliafito, M. Scarpa, and M. Telek, "Webspn: A Web-Accessible Petri Net Tool," Proc. Conf. Web-Based Modeling and Simulation, 1998.
[26] D.R. Cox, "Regression Models and Life-Tables," J. Royal Statistical Soc. Series B (Methodological), vol. 34, no. 2, pp. 187-220, 1972.
[27] A. Bobbio, A. Horváth, and M. Telek, "PhFit: A General Phase-Type Fitting Tool," Proc. Int'l Conf. Dependable Systems and Networks (DSN), p. 543, 2002.
[28] S. Garg, A. Puliafito, M. Telek, and K. Trivedi, "Analysis of Preventive Maintenance in Transactions Based Software Systems," IEEE Trans. Computers, vol. 47, no. 1, pp. 96-107, Jan. 1998.
[29] K. Vaidyanathan, R.E. Harper, S.W. Hunter, and K.S. Trivedi, "Analysis and Implementation of Software Rejuvenation in Cluster Systems," ACM SIGMETRICS Performance Evaluation Rev., vol. 29, pp. 62-71, June 2001.
[30] F. Salfner and K. Wolter, "Analysis of Service Availability for Time-Triggered Rejuvenation Policies," J. Systems and Software, vol. 83, no. 9, pp. 1579-1590, 2010.
[31] V. Castelli, R.E. Harper, P. Heidelberger, S.W. Hunter, K.S. Trivedi, K. Vaidyanathan, and W.P. Zeggert, "Proactive Management of Software Aging," IBM J. R&D, vol. 45, no. 2, pp. 311-332, 2001.
[32] L. Silva, H. Madeira, and J. Silva, "Software Aging and Rejuvenation in a Soap-Based Server," Proc. IEEE Fifth Int'l Symp. Network Computing and Applications, pp. 56-65, 2006.
[33] M. Grottke, L. Li, K. Vaidyanathan, and K. Trivedi, "Analysis of Software Aging in a Web Server," IEEE Trans. Reliability, vol. 55, no. 3, pp. 411-420, Sept 2006.
[34] D. Wang, W. Xie, and K.S. Trivedi, "Performability Analysis of Clustered Systems with Rejuvenation Under Varying Workload," Performance Evaluation, vol. 64, pp. 247-265, 2007.
[35] L. Silva, J. Alonso, and J. Torres, "Using Virtualization to Improve Software Rejuvenation," IEEE Trans. Computers, vol. 58, no. 11, pp. 1525-1538, Nov. 2009.
[36] D. Simeonov and D.R. Avresky, "Proactive Software Rejuvenation Based on Machine Learning Techniques," Proc. First Int'l Conf. Cloud Computing, pp. 186-200, 2010.
232 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool