This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Resource Allocation for Session-Based Two-Dimensional Service Differentiation on e-Commerce Servers
August 2006 (vol. 17 no. 8)
pp. 838-850
Xiaobo Zhou, IEEE Computer Society

Abstract—A scalable e-commerce server should be able to provide different levels of quality of service (QoS) to different types of requests based on clients' navigation patterns and the server capacity. E-Commerce workloads are composed of sessions. In this paper, we propose a session-based two-dimensional (2D) service differentiation model for online transactions: intersession and intrasession. The intersession model aims to provide different levels of QoS to sessions from different customer classes, and the intrasession model aims to provide different levels of QoS to requests in different states of a session. A primary performance metric of online transactions is slowdown. It measures the waiting time of a request relative to its service time. We present a processing rate allocation scheme for 2D proportional slowdown differentiation. We then introduce service slowdown as a systemwide QoS metric of an e-commerce server. It is defined as the weighted sum of request slowdown in different sessions and in different session states. We formulate the problem of 2D service differentiation as an optimization of processing rate allocation with the objective of minimizing the service slowdown of the server. We prove that the derived rate allocation scheme based on the optimization guarantees client requests' slowdown to be square-root proportional to their prespecified differentiation weights in both intersession and intrasession dimensions. We evaluate this square-root proportional rate allocation scheme and a proportional rate allocation scheme via extensive simulations. Results validate that both schemes can achieve predictable, controllable, and fair 2D service differentiation on e-commerce servers. The square-root proportional rate allocation scheme provides 2D service differentiation at a minimum cost of service slowdown.

[1] T.F. Abdelzaher, K.G. Shin, and N. Bhatti, “Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach,” IEEE Trans. Parallel and Distributed Systems, vol. 13, no. 1, pp. 80-96, 2002.
[2] M. Arlitt, D. Krishnamurthy, and J. Rolia, “Characterizing the Scalability of a Large Web-Based Shopping System,” ACM Trans. Internet Technology, vol. 1, no. 1, pp. 44-69, 2001.
[3] G. Banga, P. Druschel, and J. Mogul, “Resource Containers: A New Facility for Resource Management in Server Systems,” Proc. USENIX Symp. Operating Systems Design and Implementation, pp. 45-58, 1999.
[4] M.A. Bender, S. Chakrabarti, and S. Muthukrishnan, “Flow and Stretch Metrics for Scheduling Continuous Job Streams,” Proc. ACM-SIAM Symp. Discrete Algorithms, 1998.
[5] H. Chen and P. Mohapatra, “Session-Based Overload Control in QoS-Aware Web Servers,” Proc. IEEE Infocom, pp. 516-524, 2002.
[6] X. Chen and P. Mohapatra, “Performance Evaluation of Service Differentiating Internet Servers,” IEEE Trans. Computers, vol. 51, no. 11, pp. 1368-1375, Nov. 2002.
[7] L. Cherkasova and P. Phaal, “Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites,” IEEE Trans. Computers, vol. 51, no. 6, pp. 669-685, June 2002.
[8] C. Dovrolis, D. Stiliadis, and P. Ramanathan, “Proportional Differentiated Services: Delay Differentiation and Packet Scheduling,” IEEE/ACM Trans. Networking, vol. 10, no. 1, pp. 12-26, 2002.
[9] L. Eggert and J. Heidemann, “Application-Level Differentiated Services for Web Servers,” World Wide Web J., vol. 3, no. 2, pp. 133-142, 1999.
[10] S. Elnikety, E. Nahum, J. Tracey, and W. Zwaenepoel, “A Method for Transparent Admission Control and Request Scheduling in e-Commerce Web Sites,” Proc. ACM World Wide Web Conf., pp. 276-286, 2004.
[11] M. Harchol-Balter, “Task Assignment with Unknown Duration,” J. ACM, vol. 29, no. 2, pp. 260-288, 2002.
[12] M. Harchol-Balter, B. Schroeder, N. Bansal, and M. Agrawal, “Size-Based Scheduling to Improve Web Performance,” ACM Trans. Computer Systems, vol. 21, no. 2, pp. 207-233, 2003.
[13] K. Kang, S.H. Son, and J. Stankovic, “Differentiated Real-Time Data Services for e-Commerce Applications,” Electronic Commerce Research, vol. 3, no. 1, pp. 113-142, 2003.
[14] J. Kay and P. Lauder, “A Fair Share Scheduler,” Comm. ACM, vol. 31, no. 1, pp. 44-55, 1988.
[15] S.C.M. Lee, J.C.S. Lui, and D.K.Y. Yau, “A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation,” IEEE Trans. Parallel and Distributed Systems, vol. 15, no. 5, pp. 385-400, 2004.
[16] M.K.H. Leung, J.C.S. Lui, and D.K.Y. Yau, “Adaptive Proportional Delay Differentiated Services: Characterization and Performance Evaluation,” IEEE/ACM Trans. Networking, vol. 9, no. 6, pp. 908-817, 2001.
[17] C. Lu, X. Wang, and X. Koutsoukos, “Feedback Utilization Control in Distributed Real-Time Systems with End-to-End Tasks,” IEEE Trans. Parallel and Distributed Systems, vol. 16, no. 6, pp. 550-561, June 2005.
[18] D.A. Menascé, V.A.F. Almeida, R. Fonseca, and M.A. Mendes, “A Methodology for Workload Characterization of e-Commerce Sites,” Proc. First ACM Conf. Electronic Commerce, pp. 119-128, 1999.
[19] D.A. Menascé, V.A.F. Almeida, R. Fonseca, and M.A. Mendes, “Resource Management Policies for e-Commerce Servers,” ACM SIGMETRICS Performance Evaluation Rev., vol. 27, no. 4, pp. 27-35, 2000.
[20] K. Nichols, V. Jacobson, and L. Zhang, “A Two-Bit Differentiated Services Architecture for the Internet,” Internet Eng. Task Force RFC 2638, 1999.
[21] J. Nielsen, “Why People Shop on the Web,” http://www.useit. com/alertbox990207.html , 2005.
[22] A. Riska, W. Sun, E. Smirni, and G. Ciardo, “ADAPTLOAD: Effective Balancing in Clustered Web Servers under Transient Load Conditions,” Proc. IEEE Int'l Conf. Distributed Computing Systems, pp. 104-111, 2002.
[23] W.D. Smith, “TPC-W: Benchmarking an Ecommerce Solution,” http://www.tpc.orgtpcw, 2005.
[24] D.C. Steere, A. Goel, J. Gruenberg, D. McNamee, C. Pu, and J. Walpole, “A Feedback-Driven Proportion Allocator for Real-Rate Scheduling,” Proc. USENIX Symp. Operating Systems Design and Implementation, pp. 145-158, 1999.
[25] C.A. Waldspurger and W.E. Weihl, “Lottery Scheduling: Flexible Proportional-Share Resource Management. ” Proc. USENIX Symp. Operating Systems Design and Implementation, pp. 1-11, 1994.
[26] J. Wei, C.-Z. Xu, and X. Zhou, “A Robust Packet Scheduling Algorithm for Proportional Delay Differentiation Services,” Proc. IEEE Globecom, pp. 4854-4858, 2004.
[27] J. Wei and C. Xu, “A Self-Tuning Fuzzy Control Approach for End-to-End QoS Guarantees in Web Servers,” Proc. Int'l Workshop Quality of Service, 2005.
[28] T. Zhao and V. Karamcheti, “Enforcing Resource Sharing Agreements among Distributed Server Clusters, Proc. IEEE Int'l Parallel and Distributed Processing Symp., 2002.
[29] X. Zhou, J. Wei, and C.-Z. Xu, “Modeling and Analysis of 2D Service Differentiation on e-Commerce Servers,” Proc. Int'l Conf. Distributed Computing Systems, pp. 740-747, Mar. 2004.
[30] X. Zhou and C.-Z. Xu, “Harmonic Proportional Bandwidth Allocation and Scheduling for Service Differentiation on Streaming Servers,” IEEE Trans. Parallel and Distributed Systems, vol. 15, no. 9, pp. 835-848, Sept. 2004.
[31] H. Zhu, H. Tang, and T. Yang, “Demand-Driven Service Differentiation for Cluster-Based Network Servers,” Proc. IEEE Infocom, pp. 679-688, 2001.

Index Terms:
Two-dimensional service differentiation, session, slowdown, rate allocation, resource management.
Citation:
Xiaobo Zhou, Jianbin Wei, Cheng-Zhong Xu, "Resource Allocation for Session-Based Two-Dimensional Service Differentiation on e-Commerce Servers," IEEE Transactions on Parallel and Distributed Systems, vol. 17, no. 8, pp. 838-850, Aug. 2006, doi:10.1109/TPDS.2006.111
Usage of this product signifies your acceptance of the Terms of Use.