This Article 
 Bibliographic References 
 Add to: 
Dependability and Rollback Recovery for Composite Web Services
October-December 2011 (vol. 4 no. 4)
pp. 328-339
H. Elfawal Mansour, Curtin University of Technology, Australia
T. Dillon, Curtin University of Technology, Australia
In this paper, we propose a service-oriented reliability model that dynamically calculates the reliability of composite web services with rollback recovery based on the real-time reliabilities of the atomic web services of the composition. Our model is a hybrid reliability model based on both path-based and state-based models. Many reliability models assume that failure or error arrival times are exponentially distributed. This is inappropriate for web services as error arrival times are dependent on the operating state including workload of servers where the web service resides. In this manuscript, we modify our previous model (for software based on the Doubly Stochastic Model and Renewal Processes) to evaluate the reliability of atomic web services. In order to fix our idea, we developed the case of one simple web service which contains two states, i.e., idle and active states. In real-world applications, where web services could contain quite a large number of atomic services, the calculus as well as the computing complexity increases greatly. To limit our computing efforts and calculus, we chose the bounded set techniques that we apply using the previously developed stochastic model. As a first type of system combination, we proposed to study a scheme based on combining web services into parallel and serial configurations with centralized coordination. In this case, the broker has an acceptance testing mechanism that examines the results returned from a particular web service. If it was acceptable, then the computation continues to the next web service. Otherwise, it involves rollback and invokes another web service already specified by a checkpoint algorithm. Finally, the acceptance test is conducted using the broker. The broker can be considered as a single point of failure. To increase the reliability of the broker introduced in our systems and mask out errors at the broker level, we suggest a modified general scheme based on Triple modular redundancy and N-version programming. To imitate a real scenario where errors could happen at any stage of our application and improve the quality of Service QoS of the proposed model, we introduce fault-tolerance techniques using an adaption of the recovery block technique.

[1] H.K.G. Alonso, F. Casati, and V. Machiraju, Web Services: Concepts, Architectures and Applications. Springer-Verlag, 2004.
[2] H.K.N. Shadbolt, W. Halt, and T. Berners-Lee, "Web Services: Concepts, Architectures and Applications," IEEE Intelligent Systems, vol. 21, no. 3, pp. 96-101, 2006.
[3] T.S. Dillon, W. Chen, and E. Chang, "Grid Space: Semantic Grid Services on the Web—Evolution towards a Soft Grid," Proc. IEEE Semantics, Knowledge and Grid Conf., 2007.
[4] T.S. Dillon, W. Chen, and E. Chang, "Reference Architectural Styles for Service-Oriented Computing," Proc. IFIP Network and Parallel Conf., 2008.
[5] W3C, SOAP Specifications,, 2011.
[6] Business Process Execution Language for Web Services Version 1.1, specificationws-bpel, 2011.
[7] Semantic Web Services Language Requirements Version 1, swsl- requirements.shtml , 2011.
[8] W3C, Web Services Choreography Working Group,, 2011.
[9] T. Mikalsen, S. Tai, and E. Wholstadter, "Transaction Policies for Service-Oriented Computing," J. Data and Knowledge Eng., vol. 51, pp. 59-79, Oct. 2004.
[10] Hitachi, WS-Reliability, wsrm, 2011.
[11] A.S.J. Cardoso, J. Miller, and J. Arnold, "Quality for Service for Workflows and Web Services Processes," technical report, LSDIS Lab, Computer Science Dept., Univ. of Georgia, 2002.
[12] D. Zhang, W.T. Tsai, and Y. Chen, "A Software Reliability Model for Web Services," Proc. Eighth Latest Conf. and Applications, pp. 144-149, 2004.
[13] L. Zhang and J. Zhang, "Criteria Analysis and Validation of the Reliability of Web Services Oriented Systems," Proc. IEEE Conf. Web Services (ICWS '05), pp. 621-628, 2005.
[14] V. Cortellessa and V. Grassi, Reliability Modeling and Analysis of Service-Oriented Architectures. Springer, 2007.
[15] B. Kiepuszewski, A.H.M. ter Hofstede, and W. van der Aalst, "Fundamentals of Control Flow in Workflows," Acta Informatica, vol. 39, pp. 143-209, 2003.
[16] K. Lew, "Software Design and Architectural Aspects of Reliable Computer Systems," PhD dissertation, Monach Univ., 1986.
[17] F. Leyman, "Workflow-Based Cooperation and Coordination in a Service World," CoopIS '06 keynote speech, Montpellier, France, , 2006.
[18] Reliability of Series-Parallel Systems, http://www.mathpages. com/home/kmath560kmath560.htm , 2011.
[19] W. Abramowicz, M. Kaczmarek, and D. Zyskowski, "Duality in Web Services Reliability," Proc. Advanced Int'l Conf. Telecomm. and Int'l Conf. Internet and Web Applications and Services (AICT/ICIW), 2006.
[20] Y. Pan, "Will Reliability Kill the Web Service Composition?" technical report, Dept. of Computer Science, Rutgers Univ., 2009.
[21] Y. Yak, "Reliability Modeling of Fault Tolerant Computer Systems," PhD dissertation, Univ. of Sydney, 1985.
[22] T. Dillon, Y.W. Yak, and K. Forward, "The Effect of Incomplete and Deleterious Periodic Maintenance on Fault-Tolerant Computing Systems," IEEE Trans. Reliability, vol. 35, no. 1, pp. 85-90, Apr. 1986.
[23] T. Dillon, Y.W. Yak, and K. Forward, "Incorporation of Reconfiguration and Repair Time in the Reliability Modelling of Fault Tolerant Systems," Proc. IEE/IFAC Int'l Conf. Computer Safety, Reliability, and Security (Safecomp '83), 1983.
[24] J. Cao and T. Dillon, "Checkpointing and Rollback of Wide Area Distributed Applications Using Mobile Agents," Proc. 15th Int'l Parallel and Distributed Processing Symp. (IPDPS '01), 2001.
[25] L. Chen and A. Avizieni, "N-Version Programming: A Fault Tolerant Approach to Reliability of Software Operation," Proc. IEEE Int'l Symp. Fault-Tolerant Computing (FTCS-8), pp. 3-9, 1978.
[26] R. Iyer and P. Velardi, "A Statistical Study of Hardware Related Software Errors in MVS," Proc. IEEE Int'l Symp. Fault-Tolerant Computing (FTCS '84), pp. 192-197, 1984.
[27] D. Gaver, "Random Hazards in Reliability Problems," Technometrics, vol. 5, pp. 211-226, 1963.
[28] A. Birolini, "Some Applications of Regenerative Stochastic Processes to Reliability Theory Part 1," IEEE Trans. Reliability, vol. 23, no. 3, pp. 186-194, Aug. 1974.
[29] A. Mansour, Probabilites et Statistiques Pour Les Ingenieurs. Hermes, Lavoisier, 2007.
[30] J.L. Romeu, "Understanding Series and Parallel Systems Reliability," Reliability Information Analysis Center (RIAC), , 2011.
[31] J.S. Uphadyaya, "A Study of Rollback Recovery Techniques: Hardware, Models, Algorithms and Evaluation," PhD dissertation, Newcastle Univ., 1985.
[32] K. Chandy and C. Ramamoorthy, "Rollback and Recovery Strategies for Computer Programs," IEEE Trans. Computers, vol. 21, no. 6, pp. 546-556, June 2007.
[33] S. Upadawa and K. Saluja, "A Watchdog Processor Based General Rollback Technique with Multiple Retries," IEEE Trans. Software Eng., vol. 12, no. 1, pp. 87-95, Jan. 1986.
[34] J. Shambhu, "Rollback Recovery in Real-Time Systems with Dynamic Constraints," Proc. IEEE Ann. Int'l Computer Software and Applications Conf., 1990.
[35] QCWS: QoS Web Service Composition, , 2011.

Index Terms:
Bounded set, broker, checkpoint, dependability, doubly stochastic model, rollback, semi-Markov process, series-parallel systems.
H. Elfawal Mansour, T. Dillon, "Dependability and Rollback Recovery for Composite Web Services," IEEE Transactions on Services Computing, vol. 4, no. 4, pp. 328-339, Oct.-Dec. 2011, doi:10.1109/TSC.2010.16
Usage of this product signifies your acceptance of the Terms of Use.