This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Efficient Server Provisioning with Control for End-to-End Response Time Guarantee on Multitier Clusters
January 2012 (vol. 23 no. 1)
pp. 78-86
Xiaobo Zhou, Dept. of Comput. Sci., Univ. of Colorado at Colorado Springs, Colorado Springs, CO, USA
P. Lama, Dept. of Comput. Sci., Univ. of Colorado at Colorado Springs, Colorado Springs, CO, USA
Dynamic virtual server provisioning is critical to quality-of-service assurance for multitier Internet applications. In this paper, we address three important challenging problems. First, we propose an efficient server provisioning approach on multitier clusters based on an end-to-end resource allocation optimization model. It is to minimize the number of virtual servers allocated to the system while the average end-to-end response time guarantee is satisfied. Second, we design a model-independent fuzzy controller for bounding an important performance metric, the 90th-percentile response time of requests flowing through the multitier architecture. Third, to compensate for the latency due to the dynamic addition of virtual servers, we design a self-tuning component that adaptively adjusts the output scaling factor of the fuzzy controller according to the transient behavior of the end-to-end response time. Extensive simulation results, using two representative customer behavior models in a typical three-tier web cluster, demonstrate that the provisioning approach is able to significantly reduce the number of virtual servers allocated for the performance guarantee compared to an existing representative approach. The approach integrated with the model-independent self-tuning fuzzy controller can efficiently assure the average and the 90th-percentile end-to-end response time guarantees on multitier clusters.

[1] T.F. Abdelzaher, K.G. Shin, and N. Bhatti, “Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach,” IEEE Trans. Parallel and Distributed Systems, vol. 13, no. 1, pp. 80-96, Jan. 2002.
[2] J. Chen, G. Soundararajan, and C. Amza, “Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers,” Proc. IEEE Int'l Conf. Autonomic Computing (ICAC), 2006.
[3] Y. Diao, J.L. Hellerstein, S. Parekh, H. Shaihk, and M. Surendra, “Controlling Quality of Service in Multi-Tier Web Applications,” Proc. IEEE 26th Int'l Conf. Distributed Computing Systems (ICDCS), 2006.
[4] S. Elnikety, S.G. Dropsho, and W. Zwaenepoel, “Tashkent+: Memory-Aware Load Balancing and Update Filtering in Replicated Databases,” Proc. European Conf. Computer Systems (EuroSys), 2007.
[5] M. Harchol-Balter, “Task Assignment with Unknown Duration,” J. ACM, vol. 29, no. 2, pp. 260-288, 2002.
[6] A. Kamra, V. Misra, and E.M. Nahum, “Yaksha: A Self-Tuning Controller for Managing the Performance of 3-Tiered Web Sites,” Proc. Int'l Workshop Quality of Service (IWQoS), 2004.
[7] A. Karve, T. Kimbrel, G. Pacifici, M. Spreitzer, M. Steinder, M. Sviridenko, and A. Tantawi, “Dynamic Placement for Clustered Web Applications,” Proc. ACM 15th Int'l Conf. World Wide Web, 2006.
[8] P. Lama and X. Zhou, “Efficient Server Provisioning for End-to-End Delay Guarantee on Multi-Tier Clusters,” Proc. IEEE Int'l Workshop Quality of Service (IWQoS), 2009.
[9] X. Liu, L. Sha, and Y. Diao, “Online Response Time Optimization of Apache Web Server,” Proc. Int'l Workshop Quality of Service (IWQoS), 2003.
[10] X. Liu, J. Heo, L. Sha, and X. Zhu, “Queuing-Model-Based Adaptive Control of Multi-Tiered Web Applications,” IEEE Trans. Network and Service Management, vol. 5, no. 3, pp. 157-167, Sept. 2008.
[11] C. Lu, Y. Lu, T.F. Abdelzaher, J.A. Stankovic, and S.H. Son, “Feed Back Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers,” IEEE Trans. Parallel and Distributed Systems, vol. 17, no. 9, pp. 1014-1027, Sept. 2006.
[12] C. Lu, X. Wang, and X. Koutsoukos, “Feedback Utilization Control in Distributed Real-Time Systems with End-To-End Tasks,” IEEE Trans. Parallel and Distributed Systems, vol. 16, no. 6, pp. 550-561, June 2005.
[13] D.A. Menascé, R. Fonseca, V.A.F. Almeida, and M.A. Mendes, “Resource Management Policies for E-Commerce Servers,” ACM SIGMETRICS Performance Evaluation Rev., vol. 27, no. 4, pp. 27-35, 2000.
[14] P. Padala, K.G. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, A. Merchant, and K. Salem, “Adaptive Control of Virtualized Resources in Utility Computing Environments,” Proc. European Conf. Computer Systems (EuroSys), 2007.
[15] P. Padala, K.-Y. Hou, K.G. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant, “Automated Control of Multiple Virtualized Resources,” Proc. European Conf. Computer Systems (EuroSys), 2009.
[16] L. Sha, X. Liu, Y. Lu, and T. Abdelzaher, “Queuing Model Based Network Server Performance Control,” Proc. IEEE Real-Time Systems Symp. (RTSS), 2002.
[17] C. Stewart and K. Shen, “Performance Modeling and System Management for Multi-Component Online Services,” Proc. USENIX Second Symp. Networked Systems Design and Implementation (NSDI), 2005.
[18] C. Stewart, T. Kelly, and A. Zhang, “Exploiting Nonstationarity for Performance Prediction,” Proc. Second Conf. Computer Systems (EuroSys), 2007.
[19] B. Urgaonkar, G. Pacific, P. Shenoy, M. Spreitzer, and A. Tantawi, “An Analytical Model for Multi-Tier Internet Services and Its Applications,” Proc. ACM SIGMETRICS Int'l Conf. Measurement and Modeling of Computer Systems, 2005.
[20] B. Urgaonkar, P. Shenoy, A. Chandra, P. Goyal, and T. Wood, “Agile Dynamic Provisioning of Multi-Tier Internet Applications,” ACM Trans. Autonomous and Adaptive Systems, vol. 3, no. 1, pp. 1-39, 2008.
[21] D. Villela, P. Pradhan, and D. Rubenstein, “Provisioning Servers in the Application Tier for E-Commerce Systems,” ACM Trans. Internet Technology, vol. 7, no. 1, pp. 1-23, 2007.
[22] X. Wang, C. Lu, and X. Koutsoukos, “DEUCON: Decentralized End-To-End Utilization Control for Distributed Real-Time Systems,” IEEE Trans. Parallel and Distributed Systems, vol. 18, no. 7, pp. 996-1009, July 2007.
[23] J. Wei and C.-Z. Xu, “eQoS: Provisioning of Client-Perceived End-to-End QoS Guarantee in Web Servers,” IEEE Trans. Computers, vol. 55, no. 12, pp. 1543-1556, Dec. 2006.
[24] J. Wei, X. Zhou, and C.-Z. Xu., “Robust Processing Rate Allocation for Proportional Slowdown Differentiation on Internet Servers,” IEEE Trans. Computers, vol. 54, no. 8, pp. 964-977, Aug. 2005.
[25] M. Welsh and D. Culler, “Adaptive Overload Control for Busy Internet Servers,” Proc. USENIX Symp. Internet Technologies and Systems (USITS), 2003.
[26] Q. Zhang, L. Cherkasova, and E. Smirni, “A Regression-Based Analytic Model for Dynamic Resource Provisioning of Multi-Tier Internet Applications,” Proc. IEEE Int'l Conf. Autonomic Computing (ICAC), 2007.
[27] Q. Zhang and Y.A. Phillis, “Fuzzy Control of Arrivals to Tandem Queues with Two Stations,” IEEE Trans. Fuzzy Systems, vol. 7, no. 3, pp. 361-367, June 1999.
[28] X. Zhou and D. Ippolit, “Resource Allocation Optimization for Quantitative Service Differentiation on Server Clusters,” J. Parallel and Distributed Computing, vol. 68, no. 9, pp. 1250-1262, 2008.
[29] X. Zhou, J. Wei, and C.-Z. Xu, “Resource Allocation For Session-Based Two-Dimensional Service Differentiation on E-Commerce Servers,” IEEE Trans. Parallel and Distributed Systems, vol. 17, no. 8, pp. 838-850, Aug. 2006.

Index Terms:
workstation clusters,Internet,quality of service,model-independent self-tuning fuzzy controller,end-to-end response time,multitier cluster,dynamic virtual server provisioning,quality-of-service assurance,multitier Internet application,end-to-end resource allocation optimization model,multitier architecture,output scaling factor,customer behavior,three-tier Web cluster,Servers,Time factors,Resource management,Optimization,Fuzzy control,Internet,Pragmatics,control.,Autonomic resource provisioning,performance assurance,multitier Internet services
Citation:
Xiaobo Zhou, P. Lama, "Efficient Server Provisioning with Control for End-to-End Response Time Guarantee on Multitier Clusters," IEEE Transactions on Parallel and Distributed Systems, vol. 23, no. 1, pp. 78-86, Jan. 2012, doi:10.1109/TPDS.2011.88
Usage of this product signifies your acceptance of the Terms of Use.