The Community for Technology Leaders
RSS Icon
Issue No.04 - Fourth Quarter (2012 vol.5)
pp: 540-550
Zibin Zheng , The Chinese University of Hong Kong, Hunan
Tom Chao Zhou , The Chinese University of Hong Kong, Hunan
Michael R. Lyu , The Chinese University of Hong Kong, Hunan
Irwin King , The Chinese University of Hong Kong, Hunan
Cloud computing is becoming a mainstream aspect of information technology. More and more enterprises deploy their software systems in the cloud environment. The cloud applications are usually large scale and include a lot of distributed cloud components. Building highly reliable cloud applications is a challenging and critical research problem. To attack this challenge, we propose a component ranking framework, named FTCloud, for building fault-tolerant cloud applications. FTCloud includes two ranking algorithms. The first algorithm employs component invocation structures and invocation frequencies for making significant component ranking. The second ranking algorithm systematically fuses the system structure information as well as the application designers' wisdom to identify the significant components in a cloud application. After the component ranking phase, an algorithm is proposed to automatically determine an optimal fault-tolerance strategy for the significant cloud components. The experimental results show that by tolerating faults of a small part of the most significant components, the reliability of cloud applications can be greatly improved.
Cloud computing, Service oriented architecture, Information technology, Fault tolerance, Software reliability, software reliability, Cloud application, component ranking, fault tolerance
Zibin Zheng, Tom Chao Zhou, Michael R. Lyu, Irwin King, "Component Ranking for Fault-Tolerant Cloud Applications", IEEE Transactions on Services Computing, vol.5, no. 4, pp. 540-550, Fourth Quarter 2012, doi:10.1109/TSC.2011.42
[1] "Cloud Computing in Wikipedia," , 2012.
[2] D. Ardagna and B. Pernici, "Adaptive Service Composition in Flexible Processes," IEEE Trans. Software Eng., vol. 33, no. 6, pp. 369-384, June 2007.
[3] M. Armbrust et al., "A View of Cloud Computing," Comm. ACM, vol. 53, no. 4, pp. 50-58, 2010.
[4] M. Armbrust et al., "Above the Clouds: A Berkeley View of Cloud Computing," Technical Report EECS-2009-28, Electrical Eng. and Computer Science Dept., Univ. of California, Berkeley, 2009.
[5] A. Avizienis, "The Methodology of N-Version Programming," Software Fault Tolerance, M.R. Lyu, ed., pp. 23-46, Wiley, 1995.
[6] V. Batagelj and A. Mrvar, "Pajek - Program for Large Network Analysis," Connections, vol. 21, pp. 47-57, 1998.
[7] S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine," Proc. Int'l Conf. World Wide Web, 1998.
[8] M. Creeger, "Cloud Computing: An Overview," ACM Queue, vol. 7, no. 5, p. 2, June 2009.
[9] A. Danak and S. Mannor, "Resource Allocation with Supply Adjustment in Distributed Computing Systems," Proc. Int'l Conf. Distributed Computing Systems (ICDCS '10), pp. 498-506, 2010.
[10] A.P.S. de Moura, Y.-C. Lai, and A.E. Motter, "Signatures of Small-World and Scale-Free Properties in Large Computer Programs," Physical Rev. E, vol. 68, pp. 017102.1-017102.4, 2003.
[11] C.-L. Fang, D. Liang, F. Lin, and C.-C. Lin, "Fault Tolerant Web Services," J. System Architecture, vol. 53, no. 1, pp. 21-38, 2007.
[12] S.S. Gokhale and K.S. Trivedi, "Reliability Prediction and Sensitivity Analysis Based on Software Architecture," Proc. Int'l Symp. Software Reliability Eng. (ISSRE '02), pp. 64-78, 2002.
[13] S. Gorender, R.J. de Araujo Macedo, and M. Raynal, "An Adaptive Programming Model for Fault-Tolerant Distributed Computing," IEEE Trans. Dependable and Secure Computing, vol. 4, no. 1, pp. 18-31, Jan.-Mar. 2007.
[14] A. Goscinski and M. Brock, "Toward Dynamic and Attribute-Based Publication, Discovery and Selection for Cloud Computing," Future Generation Computer Systems, vol. 26, no. 7, pp. 947-970, 2010.
[15] D. Hyland-Wood, D. Carrington, and Y. Kaplan, "Scale-Free Nature of Java Software Package, Class and Method Collaboration Graphs," Proc. Int'l Symp. Empirical Software Eng., pp. 439-446, 2005.
[16] K. Inoue, R. Yokomori, T. Yamamoto, M. Matsushita, and S. Kusumoto, "Ranking Significance of Software Components Based on Use Relations," IEEE Trans. Software Eng., vol. 31, no. 3, pp. 213-225, Mar. 2005.
[17] K. Kim and H. Welch, "Distributed Execution of Recovery Blocks: An Approach for Uniform Treatment of Hardware and Software Faults in Real-Time Applications," IEEE Trans. Computers, vol. 38, no. 5, pp. 626-636, May 1989.
[18] J. Laprie, J. Arlat, C. Beounes, and K. Kanoun, "Definition and Analysis of Hardware- and Software-Fault-Tolerant Architectures," Computer, vol. 23, no. 7, pp. 39-51, July 1990.
[19] W. Li, J. He, Q. Ma, I.-L. Yen, F. Bastani, and R. Paul, "A Framework to Support Survivable Web Services," Proc. IEEE 19th Int'l Symp. Parallel and Distributed Processing, p. 93b, 2005.
[20] M.R. Lyu, Software Fault Tolerance, Wiley, 1995.
[21] M.R. Lyu, Handbook of Software Reliability Engineering. McGraw-Hill, 1996.
[22] E. Maximilien and M. Singh, "Conceptual Model of Web Service Reputation," ACM SIGMOD Record, vol. 31, no. 4, pp. 36-41, 2002.
[23] M.G. Merideth, A. Iyengar, T. Mikalsen, S. Tai, I. Rouvellou, and P. Narasimhan, "Thema: Byzantine-Fault-Tolerant Middleware Forweb-Service Applications," Proc. IEEE 24th Symp. Reliable Distributed Systems (SRDS '05), pp. 131-142, 2005.
[24] S.L. Pallemulle, H.D. Thorvaldsson, and K.J. Goldman, "Byzantine Fault-Tolerant Web Services for N-Tier and Service Oriented Architectures," Proc. 28th Int'l Conf. Distributed Computing Systems (ICDCS '08), pp. 260-268, 2008.
[25] B. Randell and J. Xu, "The Evolution of the Recovery Block Concept," Software Fault Tolerance, M.R. Lyu, ed., pp. 1-21, Wiley, 1995.
[26] P. Rooney, "Microsoft's CEO: 80-20 Rule Applies to Bugs, Not Just Features," ChannelWeb, Oct. 2002.
[27] S. Rosario, A. Benveniste, S. Haar, and C. Jard, "Probabilistic QoS and Soft Contracts for Transaction-Based Web Services Orchestrations," IEEE Trans. Services Computing, vol. 1, no. 4, pp. 187-200, Oct. 2008.
[28] J. Salas, F. Perez-Sorrosal, M. Patiño-Martínez, and R. Jiménez-Peris, "WS-Replication: A Framework for Highly Available Web Services," Proc. 15th Int'l Conf. World Wide Web, pp. 357-366, 2006.
[29] G.T. Santos, L.C. Lung, and C. Montez, "FTWeb: A Fault Tolerant Infrastructure for Web Services," Proc. IEEE Ninth Int'l Conf. Enterprise Computing, pp. 95-105, 2005.
[30] Q.Z. Sheng, B. Benatallah, Z. Maamar, and A.H. Ngu, "Configurable Composition and Adaptive Provisioning of Web Services," IEEE Trans. Services Computing, vol. 2, no. 1, pp. 34-49, Jan.-Mar. 2009.
[31] G.-W. Sheu, Y.-S. Chang, D. Liang, S.-M. Yuan, and W. Lo, "A Fault-Tolerant Object Service on Corba," Proc. 17th Int'l Conf. Distributed Computing Systems (ICDCS '97), pp. 393-400, 1997.
[32] S. Sivathanu, L. Liu, M. Yiduo, and X. Pu, "Storage Management in Virtualized Cloud Environment," Proc. IEEE Third Int'l Conf. Cloud Computing (Cloud '10), 2010.
[33] W.-T. Tsai, X. Zhou, Y. Chen, and X. Bai, "On Testing and Evaluating Service-Oriented Software," Computer, vol. 41, no. 8, pp. 40-46, Aug. 2008.
[34] K. Tsakalozos, M. Roussopoulos, V. Floros, and A. Delis, "Nefeli: Hint-Based Execution of Workloads in Clouds," Proc. Int'l Conf. Distributed Computing Systems (ICDCS '10), pp. 74-85, 2010.
[35] G. Wu, J. Wei, X. Qiao, and L. Li, "A Bayesian Network Based QoS Assessment Model for Web Services," Proc. Int'l Conf. Services Computing (SCC '07), pp. 498-505, 2007.
[36] S.M. Yacoub, B. Cukic, and H.H. Ammar, "Scenario-Based Reliability Analysis of Component-Based Software," Proc. Int'l Symp. Software Reliability Eng. (ISSRE '99), pp. 22-31, 1999.
[37] T. Yu, Y. Zhang, and K.-J. Lin, "Efficient Algorithms for Web Services Selection with End-to-End QoS Constraints," ACM Trans. Web, vol. 1, no. 1, pp. 1-26, 2007.
[38] L. Zeng, B. Benatallah, A.H. Ngu, M. Dumas, J. Kalagnanam, and H. Chang, "QoS-Aware Middleware for Web Services Composition," IEEE Trans. Software Eng., vol. 30, no. 5, pp. 311-327, May 2004.
[39] Z. Zheng and M.R. Lyu, "A Distributed Replication Strategy Evaluation and Selection Framework for Fault Tolerant Web Services," Proc. Sixth Int'l Conf. Web Services, pp. 145-152, 2008.
[40] Z. Zheng and M.R. Lyu, "A QoS-Aware Fault Tolerant Middleware for Dependable Service Composition," Proc. 39th Int'l Conf. Dependable Systems and Networks (DSN '09), pp. 239-248, 2009.
[41] Z. Zheng, Y. Zhang, and M.R. Lyu, "CloudRank: A QoS-Driven Component Ranking Framework for Cloud Computing," Proc. Int'l Symp. Reliable Distributed Systems (SRDS '10), 2010.
[42] Z. Zheng, T.C. Zhou, M.R. Lyu, and I. King, "FTCloud: A Ranking-Based Framework for Fault Tolerant Cloud Applications," Proc. Int'l Symp. Software Reliability Eng. (ISSRE '10), 2010.
9 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool