This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Evaluating Web Software Reliability Based on Workload and Failure Data Extracted from Server Logs
November 2004 (vol. 30 no. 11)
pp. 754-769
Jeff Tian, IEEE
Zhao Li, IEEE Computer Society
In this paper, we characterize usage and problems for Web applications, evaluate their reliability, and examine the potential for reliability improvement. Based on the characteristics of Web applications and the overall Web environment, we classify Web problems and focus on the subset of source content problems. Using information about Web accesses, we derive various measurements that can characterize Web site workload at different levels of granularity and from different perspectives. These workload measurements, together with failure information extracted from recorded errors, are used to evaluate the operational reliability for source contents at a given Web site and the potential for reliability improvement. We applied this approach to the Web sites www.seas.smu.edu and www.kde.org. The results demonstrated the viability and effectiveness of our approach.

[1] M.F. Arlitt and C.L. Williamson, “Internet Web Servers: Workload Characterization and Performance Implications,” IEEE/ACM Trans. Networking, vol. 5, no. 5, pp. 631-645, Oct. 1997.
[2] B. Behlandorf, Running a Perfect Web Site with Apache, second ed. MacMillan Computer Publishing, 1996.
[3] M.E. Crovella and A. Bestavros, “Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes,” IEEE/ACM Trans. Networking, vol. 5, no. 6, pp. 835-846, Dec. 1997.
[4] A.L. Goel and K. Okumoto, “A Time Dependent Error Detection Rate Model For Software Reliability and Other Performance Measures,” IEEE Trans. Reliability, vol. 28, no. 3, pp. 206-211, 1979.
[5] IEEE Standard Glossary of Software Engineering Terminology, Number STD 610.12-1990, IEEE, 1990.
[6] C. Kallepalli and J. Tian, “Measuring and Modeling Usage and Reliability for Statistical Web Testing,” IEEE Trans. Software Eng., vol. 27, no. 11, pp. 1023-1036, Nov. 2001.
[7] A.G. Koru and J. Tian, “Defect Handling in Medium and Large Open Source Software Projects,” IEEE Software, vol. 21, no. 4, pp. 54-61, July 2004.
[8] Z. Li and J. Tian, “Analyzing Web Logs to Identify Common Errors and Improve Web Reliability,” Proc. IADIS Int'l Conf. E-Society, pp. 235-242, June 2003.
[9] Handbook of Software Reliability Engineering. M.R. Lyu, ed. McGraw-Hill, 1995.
[10] L. Ma and J. Tian, “Analyzing Errors and Referral Pairs to Characterize Common Problems and Improve Web Reliability,” Proc. Third Int'l Conf. Web Eng., pp. 314-323, July 2003.
[11] A.L. Montgomery and C. Faloutsos, “Identifying Web Browsing Trends and Patterns,” IEEE Computer, vol. 34, no. 7, pp. 94-95, July 2001.
[12] J.D. Musa, A. Iannino, and K. Okumoto, Software Reliability: Measurement, Prediction, Application. McGraw-Hill, 1987.
[13] E. Nelson, “Estimating Software Reliability from Test Data,” Microelectronics and Reliability, vol. 17, no. 1, pp. 67-73, 1978.
[14] J. Offutt, “Quality Attributes of Web Applications,” Software, vol. 19, no. 2, pp. 25-32, Mar. 2002.
[15] J.E. Pitkow, “Summary of WWW Characterizations,” World Wide Web, vol. 2, nos. 1-2, pp. 3-13, 1999.
[16] N.F. Schneidewind, “Software Reliability Model with Optimal Selection of Failure Data,” Trans. Software Eng., vol. 19, no. 11, pp. 1095-1104, Nov. 1993.
[17] N. Singpurwalla, “Software Reliability Modeling by Concatenating Failure Rates,” Proc. Ninth Int'l Symp. Software Reliability Eng., pp. 106-110, Nov. 1998.
[18] R. Thayer, M. Lipow, and E. Nelson, Software Reliability. North-Holland, 1978.
[19] J. Tian, “Integrating Time Domain and Input Domain Analyses of Software Reliability Using Tree-Based Models,” IEEE Trans. Software Eng., vol. 21, no. 12, pp. 945-958, Dec. 1995.
[20] J. Tian, “Better Reliability Assessment and Prediction through Data Clustering,” IEEE Trans. Software Eng., vol. 28, no. 10, pp. 997-1007, Oct. 2002.
[21] K.S. Trivedi, Probability and Statistics with Reliability, Queuing, and Computer Science Applications, second ed. John Wiley & Sons, 2001.

Index Terms:
World Wide Web (WWW) and Internet, Web applications and Web server logs, quality and reliability, reliability modeling, workload measurement
Citation:
Jeff Tian, Sunita Rudraraju, Zhao Li, "Evaluating Web Software Reliability Based on Workload and Failure Data Extracted from Server Logs," IEEE Transactions on Software Engineering, vol. 30, no. 11, pp. 754-769, Nov. 2004, doi:10.1109/TSE.2004.87
Usage of this product signifies your acceptance of the Terms of Use.