This Article 
 Bibliographic References 
 Add to: 
High-Performance Resource Allocation and Request Redirection Algorithms for Web Clusters
September 2008 (vol. 19 no. 9)
pp. 1186-1200
With increasing richness in features such as personalization of content, web applications are becoming increasingly complex and hence compute intensive. Traditional approaches for improving performance of static content web sites have been based on the assumption that static content such as images are network intensive. However, these methods are not applicable to the dynamic content applications which are more compute intensive than static content. This paper proposes a suite of algorithms which jointly optimize the performance of dynamic content applications by reducing the client access times while also minimizing the resource utilization. A server migration algorithm allocates servers on-demand within a cluster such that the client access times are not affected even under sudden overload conditions. Further, a server selection mechanism enables statistical multiplexing of resources across clusters by redirecting requests away from overloaded clusters. We also propose a cluster decision algorithm which decides whether to migrate in additional servers at the local cluster or redirect requests remotely under different workload conditions. Through a combination of analytical modeling, trace-driven simulation over traces from large e-commerce sites and testbed implementation, we explore the performance savings achieved by the proposed algorithms.

[1] Akamai, http:/, 2008.
[2] Amazon Elastic Compute Cloud (EC2), http://developer. connect/servlet/KbServlet/download/865-102-1397 ec2-dg-2007-%03-01.pdf, 2006.
[3] Cisco System: Distributed Director, ddindex.shtml, 2008.
[4] Data Center Power and Cooling, , 2008.
[5] Exploring Autonomous System Numbers, http://ispcolumn.isoc. org/2005-08as1.html , 2008.
[6] NISTNET: Network Emulation Package, http://snad.ncsl.nist. gov/itgnistnet/, 2008.
[7] TPC-W: Transaction Processing Council, http:/www.tpc.orgi, 2006.
[8] C. Amza, A. Cox, and W. Zwaenepoel, “Conflict-Aware Scheduling for Dynamic Content Applications,” Proc. Fifth Usenix Symp. Internet Technologies and Systems (USITS), 2003.
[9] K. Appleby et al., “Oceano—SLA Based Management of a Computing Utility,” Proc. IFIP/IEEE Int'l Symp. Integrated Network Management (IM), 2001.
[10] M. Arlitt, D. Krishnamurthy, and J. Rolia, “Characterizing the Scalability of a Large Web-Based Shopping System,” ACM Trans. Internet Technology, vol. 1, no. 1, 2001.
[11] M. Arlitt and C. Williamson, “Internet Web Servers: Workload Characterization and Performance Implications,” IEEE/ACM Trans. Networking, vol. 5, no. 5, 1997.
[12] M. Aron, P. Druschel, and W. Zwaenepoel, “Cluster Reserves: A Mechanism for Resource Management in Cluster-Based Network Servers,” Proc. ACM SIGMETRICS, 2000.
[13] M. Aron, D. Sanders, P. Druschel, and W. Zwaenepoel, “Scalable Content-Aware Request Distribution in Cluster-Based Network Servers,” Proc. Usenix Ann. Technical Conf., 2000.
[14] G. Banga, P. Druschel, and J. Mogul, “Resource Containers: A New Facility for Resource Management in Server Systems,” Proc. Third Usenix Symp. Operating Systems Design and Implementation (OSDI), 1999.
[15] M. Bennani and D. Menasce, “Assessing the Robustness of Self-Managing Computer Systems under Highly Variable Workloads,” Proc. IEEE Int'l Conf. Autonomic Computing (ICAC), 2004.
[16] M. Bennani and D. Menasce, “Resource Allocation for Autonomic Data Centers Using Analytic Performance Models,” Proc. IEEE Int'l Conf. Autonomic Computing (ICAC), 2005.
[17] N. Bhatti and R. Friedrich, “Web Server Support for Tiered Services,” IEEE Network, vol. 13, no. 5, 1999.
[18] S. Bouchenak, S. Mittal, and W. Zwaenepoel, “Using Code Transformation for Consistent and Transparent Caching of Dynamic Web Content,” Technical Report 200383, EPFL, 2003.
[19] J. Bruno, E. Gabber, B. Ozden, and A. Silberschatz, “The Eclipse Operating System: Providing Quality of Service via Reservation Domains,” Proc. Usenix Ann. Technical Conf., 1998.
[20] V. Cardellini, M. Colajanni, and P.S. Yu, “Geographic Load Balancing for Scalable Distributed Web Systems,” Proc. Eighth Int'l Symp. Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), 2000.
[21] R.L. Carter and M. Crovella, “Server Selection Using Dynamic Path Characterization in Wide-Area Networks,” Proc. IEEE INFOCOM, 1997.
[22] J. Chase et al., “Managing Energy and Server Resources in Hosting Centers,” Proc. 18th ACM Symp. Operating System Principles (SOSP), 2001.
[23] Y. Chen, R.H. Katz, and J.D. Kubiatowicz, “Dynamic Replica Placement for Scalable Content Delivery,” Proc. First Int'l Workshop Peer-to-Peer Systems (IPTPS), 2002.
[24] I. Cohen, M. Goldszmidt, T. Kelly, J. Symons, and J. Chase, “Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control,” Proc. Sixth Usenix Symp. Operating Systems Design and Implementation (OSDI), 2004.
[25] Z. Fei, S. Bhattacharjee, E.W. Zegura, and M.H. Ammar, “A Novel Server Selection Technique for Improving the Response Time of a Replicated Service,” Proc. IEEE INFOCOM, 1998.
[26] Hewlett-Packard, HP Utility Data Center Architecture, solutions/utilitydata/architectureindex.html , 2006.
[27] S. Jamin, C. Jin, A.R. Kurc, D. Raz, and Y. Shavitt, “Constrained Mirror Placement on the Internet,” Proc. IEEE INFOCOM, 2001.
[28] R. Jump, Yacsim Reference Manual. Electrical and Computer Eng. Dept., Rice Univ., Mar. 1993.
[29] J. Kangasharju, K.W. Ross, and J.W. Roberts, “Performance Evaluation of Redirection Schemes in Content Distribution Networks,” Computer Comm., vol. 24, no. 2, 2001.
[30] V. Kanodia and E. Knightly, “Multi-Class Latency Bounded Web Services,” Proc. Eighth Int'l Workshop Quality of Service (IWQoS), 2000.
[31] D. Karger, A. Sherman, A. Berkhemier, B. Bogstad, R. Dhanidina, K. Iwamoto, B. Kim, L. Matkins, and Y. Yerushalmi, “Web Caching with Consistent Hashing,” Proc. Eighth Int'l World Wide Web Conf., 1999.
[32] L. Kleinrock, Queueing Systems, Volume II: Computer Applications. John Wiley & Sons, 1976.
[33] J.G. Koomey, “Estimating Total Power Consumption by Servers in the US and the World,” Technical Report TR-02-390, Lawrence Berkeley Nat'l Laboratory, 2007.
[34] K. Li and S. Jamin, “A Measurement-Based Admission Controlled Web Server,” Proc. IEEE INFOCOM, 2000.
[35] C. Lu, Y. Lu, T.F. Abdelzaher, J.A. Stankovic, and S.H. Son, “Feedback Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers,” IEEE Trans. Parallel and Distributed Systems, vol. 17, no. 9, Sept. 2006.
[36] S. Ranjan, “High Performance DDoS-Resilient Web Cluster Architecture,” PhD dissertation, Rice Univ., 2005.
[37] S. Ranjan, R. Karrer, and E. Knightly, “Wide Area Redirection of Dynamic Content in Internet Data Centers,” Proc. IEEE INFOCOM, 2004.
[38] S. Ranjan, J. Rolia, H. Fu, and E. Knightly, “QoS-Driven Server Migration for Internet Data Centers,” Proc. 10th Int'l Workshop Quality of Service (IWQoS), 2002.
[39] J. Rolia, S. Singhal, and R. Friedrich, “Adaptive Internet Data Centers,” Proc. Int'l Conf. Advances in Infrastructure for Electronic Business, Science, and Education on the Internet (SSGRR), 2000.
[40] K. Shen, H. Tang, T. Yang, and L. Chu, “Integrated Resource Management for Cluster-Based Internet Services,” Proc. Fifth Usenix Symp. Operating Systems Design and Implementation (OSDI), 2002.
[41] B. Urgaonkar, G. Pacifici, P. Shenoy, M. Spreitzer, and A. Tantawi, “An Analytical Model for Multi-Tier Internet Services and Its Applications,” Proc. ACM SIGMETRICS, 2005.
[42] D. Villela and D. Rubenstein, “Performance Analysis of Server Sharing Collectives for Content Distribution,” Proc. 11th Int'l Workshop Quality of Service (IWQoS), 2003.
[43] Vmware, http:/, 2008.
[44] L. Wang, V. Pai, and L. Peterson, “The Effectiveness of Request Redirection on CDN Robustness,” Proc. Fifth Usenix Symp. Operating Systems Design and Implementation (OSDI), 2002.
[45] X. Zhou, J. Wei, and C. Xu, “Resource Allocation for Session-Based Two-Dimensional Service Differentiation on E-commerce Servers,” IEEE Trans. Parallel and Distributed Systems, vol. 17, no. 8, 2006.
[46] H. Zhu, H. Tang, and T. Yang, “Demand-Driven Service Differentiation for Cluster-Based Network Servers,” Proc. IEEE INFOCOM, 2001.

Index Terms:
Distributed Systems, Client/server, Client/server and multitier systems, Electronic commerce
Supranamaya Ranjan, Edward Knightly, "High-Performance Resource Allocation and Request Redirection Algorithms for Web Clusters," IEEE Transactions on Parallel and Distributed Systems, vol. 19, no. 9, pp. 1186-1200, Sept. 2008, doi:10.1109/TPDS.2007.70810
Usage of this product signifies your acceptance of the Terms of Use.