The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - Sept.-Oct. (2013 vol.10)
pp: 253-272
Bernardetta Addis , Università degli Studi di Torino, Torino
Danilo Ardagna , Politecnico di Milano, Milan
Barbara Panicucci , Politecnico di Milano, Milan and Università di Modena e Reggio Emilia, Reggio Emilia
Mark S. Squillante , IBM Thomas J. Watson Research Center, Yorktown Heights
Li Zhang , IBM Thomas J. Watson Research Center, Yorktown Heights
ABSTRACT
Worldwide interest in the delivery of computing and storage capacity as a service continues to grow at a rapid pace. The complexities of such cloud computing centers require advanced resource management solutions that are capable of dynamically adapting the cloud platform while providing continuous service and performance guarantees. The goal of this paper is to devise resource allocation policies for virtualized cloud environments that satisfy performance and availability guarantees and minimize energy costs in very large cloud service centers. We present a scalable distributed hierarchical framework based on a mixed-integer nonlinear optimization of resource management acting at multiple timescales. Extensive experiments across a wide variety of configurations demonstrate the efficiency and effectiveness of our approach.
INDEX TERMS
Servers, Resource management, Optimization, Availability, Quality of service, Load management, Radio spectrum management, virtualized system QoS-based migration policies, Performance attributes, performance of systems, quality concepts, optimization of cloud configurations, QoS-based scheduling and load balancing
CITATION
Bernardetta Addis, Danilo Ardagna, Barbara Panicucci, Mark S. Squillante, Li Zhang, "A Hierarchical Approach for the Resource Management of Very Large Cloud Platforms", IEEE Transactions on Dependable and Secure Computing, vol.10, no. 5, pp. 253-272, Sept.-Oct. 2013, doi:10.1109/TDSC.2013.4
REFERENCES
[1] B. Abrahao, V. Almeida, J. Almeida, A. Zhang, D. Beyer, and F. Safai, "Self-Adaptive SLA-Driven Capacity Management for Internet Services," Proc. IEEE/IFIP 10th Network Operations and Management Symp. (NOMS '06), 2006.
[2] C. Adam and R. Stadler, "Service Middleware for Self-Managing Large-Scale Systems," IEEE Trans. Network and Service Management, vol. 4, no. 3, pp. 50-64, Dec. 2007.
[3] B. Addis, D. Ardagna, B. Panicucci, and L. Zhang, "Autonomic Management of Cloud Service Centers with Availability Guarantees," Proc. IEEE Third Int'l Conf. Cloud Computing (CLOUD), 2010.
[4] J. Almeida, V. Almeida, D. Ardagna, I. Cunha, C. Francalanci, and M. Trubian, "Joint Admission Control and Resource Allocation in Virtualized Servers," J. Parallel and Distributed Computing, vol. 70, no. 4, pp. 344-362, 2010.
[5] Amazon, Inc., "Amazon Elastic Cloud," http://aws.amazon.comec2/, 2013.
[6] Amazon, Inc., "AWS Elastic Beanstalk," http://aws.amazon. comelasticbeanstalk/, 2013.
[7] D. Ardagna, S. Casolari, M. Colajanni, and B. Panicucci, "Dual Time-Scale Distributed Capacity Allocation and Load Redirect Algorithms for Cloud Systems," J. Parallel and Distributed Computing, vol. 72, no. 6, pp. 796-808, 2012.
[8] D. Ardagna, B. Panicucci, M. Trubian, and L. Zhang, "Energy-Aware Autonomic Resource Allocation in Multitier Virtualized Environments," IEEE Trans. Services Computing, vol. 5, no. 1, pp. 2-19, Jan.-Mar. 2012.
[9] M. Armbrust, A. Fox, R. Griffith, A.D. Joseph, R.H. Katz, A. Konwinski, G. Lee, D.A. Patterson, I.S.A. Rabkin, and M. Zaharia, "Above the Clouds: A Berkeley View of Cloud Computing," http://www.eecs.berkeley.edu/Pubs/TechRpts/ 2009EECS-2009-28.pdf, 2013.
[10] A. Avizienis, J.C. Laprie, B. Randell, and C. Landwehr, "Basic Concepts and Taxonomy of Dependable and Secure Computing," IEEE Trans. Dependable and Secure Computing, vol. 1, no. 1, pp. 11-33, Jan.-Mar. 2004.
[11] M. Bennani and D. Menascé, "Resource Allocation for Autonomic Data Centers Using Analytic Performance Models," Proc. IEEE Int'l Conf. Autonomic Computing Proc., 2005.
[12] Bitcurrent, "Cloud Performance from the End User Perspective," http://www.bitcurrent.com/downloadcloud-performance-from-the-end-user-perspective /, 2013.
[13] N.M. Calcavecchia, O. Biran, E. Hadad, and Y. Moatti, "VM Placement Strategies for Cloud Scenarios," Proc. IEEE Fifth Int'l Conf. Cloud Computing (CLOUD), 2012.
[14] J. Cao, K. Hwang, K. Li, and A.Y. Zomaya, "Optimal Multiserver Configuration for Profit Maximization in Cloud Computing," IEEE Trans. Parallel Distributed Systems, preprint, no. 99, 2012.
[15] D. Carrera, M. Steinder, I. Whalley, J. Torres, and E. Ayguadé, "Autonomic Placement of Mixed Batch and Transactional Workloads," IEEE Trans. Parallel Distributed Systems, vol. 23, no. 2, pp. 219-231, Feb. 2012.
[16] S. Casolari and M. Colajanni, "Short-Term Prediction Models for Server Management in Internet-Based Contexts," Decision Support Systems, vol. 48, no. 1, pp. 212-223, 2009.
[17] L. Cherkasova and P. Phaal, "Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites," IEEE Trans. Computers, vol. 51, no. 6, pp. 669-685, June 2002.
[18] M.D. Dikaiakos, D. Katsaros, P. Mehra, G. Pallis, and A. Vakali, "Cloud Computing: Distributed Internet Computing for IT and Scientific Research," IEEE Internet Computing, vol. 13, no. 5, pp. 10-13, Sept./Oct. 2009.
[19] G.S.G.E.D. Lazowska, J. Zahorjan, and K.C. Sevcik, Quantitative System Performance, Computer System Analysis Using Queueing Network Models. Prentice-Hall, 1984.
[20] E. Feller, C. Rohr, D. Margery, and C. Morin, "Energy Management in IaaS Clouds: A Holistic Approach," Proc. IEEE Fifth Int'l Conf. Cloud Computing (CLOUD), 2012.
[21] D. Gamarnik, Y. Lu, and M.S. Squillante, "Fundamentals of Stochastic Modeling and Analysis for Self-∗ Properties in Autonomic Computing Systems," technical report, IBM Research, Oct. 2004.
[22] D. Gamarnik, Y. Lu, and M.S. Squillante, "Workload Management Based on the Heavy-Traffic Theory of Queueing Systems," technical report, IBM Research, Sept. 2004.
[23] A. Gandhi, Y. Chen, D. Gmach, M. Arlitt, and M. Marwah, "Minimizing Data Center SLA Violations and Power Consumption via Hybrid Resource Provisioning," Proc. Int'l Green Computing Conf. and Workshops (IGCC), 2011.
[24] A. Gandhi, M. Harchol-Balter, R. Das, and C. Lefurgy, "Optimal Power Allocation in Server Farms," Proc. 11th Int'l Joint Conf. Measurement and Modeling of Computer Systems (SIGMETRICS), 2009.
[25] Gartner, "2012 Cloud Computing Planning Guide," http:// my.gartner.com/portalserver.pt?open=512&objID=249&mode= 2&PageID= 864059&resId=1837017&ref=Browse , 2013.
[26] Z. Gong and X. Gu, "PAC: Pattern-Driven Application Consolidation for Efficient Cloud Computing," Proc. IEEE/ACM 18th Ann. Int'l Symp. Modeling, Analysis and Simulation of Computer and Telecomm. Systems (MASCOTS), 2010.
[27] Greenpeace, "How Clean Is Your Cloud?" http://www. greenpeace.org/international/ Global/international/ publications/climate/ 2012/iCoalHowCleanisYourCloud.pdf, 2013.
[28] V. Gupta and M. Harchol-Balter, "Self-Adaptive Admission Control Policies for Resource-Sharing Systems," Proc. 11th Int'l Joint Conf. Measurement and Modeling of Computer Systems (SIGMETRICS), 2009.
[29] Q. Huang, F. Gao, R. Wang, and Z. Qi, "Power Consumption of Virtual Machine Live Migration in Clouds," Proc. Third Int'l Conf. Comm. and Mobile Computing (CMC '11), 2011.
[30] J. Kephart, H. Chan, R. Das, D. Levine, G. Tesauro, F. Rawson, and C. Lefurgy, "Coordinating Multiple Autonomic Managers to Achieve Specified Power-performance Tradeoffs," Proc. Fourth Int'l Conf. Autonomic Computing (ICAC), 2007.
[31] H. Khazaei, J. Misic, V.B. Misic, and S. Rashwand, "Analysis of a Pool Management Scheme for Cloud Computing Centers," IEEE Trans. Parallel Distributed Systems, preprint, no. 99, 2012.
[32] L. Kleinrock, Queueing Systems. John Wiley & Sons, 1975.
[33] D. Kumar, A. Tantawi, and L. Zhang, "Estimating Model Parameters of Adaptive Software Systems in Real-Time," Autonomic Systems, pp. 45-71, D. Ardagna, L. Zhang, eds., Springer, 2010.
[34] D. Kumar, L. Zhang, and A. Tantawi, "Enhanced Inferencing: Estimation of a Workload Dependent Performance Model," Proc. Fourth Int'l ICST Conf. Performance Evaluation Methodologies and Tools (VALUETOOLS), 2009.
[35] D. Kusic, J.O. Kephart, N. Kandasamy, and G. Jiang, "Power and Performance Management of Virtualized Computing Environments via Lookahead Control," Proc. Int'l Conf. Autonomic Computing (ICAC), 2008.
[36] H. Li and S. Venugopal, "Using Reinforcement Learning for Controlling an Elastic Web Application Hosting Platform," Proc. ACM Eighth Int'l Conf. Autonomic Computing (ICAC), 2011.
[37] F. Longo, R. Ghosh, V.K. Naik, and K.S. Trivedi, "A Scalable Availability Model for Infrastructure-as-a-Service Cloud," Proc. IEEE/IFIP 41st Int'l Conf. Dependable Systems and Networks (DSN), 2011.
[38] Microsoft, "Windows Azure," http://msdn.microsoft.com/ en-us/library/ windowsazuredd163896, 2013.
[39] T. Nowicki, M.S. Squillante, and C.W. Wu, "Fundamentals of Dynamic Decentralized Optimization in Autonomic Computing Systems," Self-Star Properties in Complex Information Systems, pp. 204-218, Springer-Verlag, 2005.
[40] G. Pacifici, W. Segmuller, M. Spreitzer, and A. Tantawi, "CPU Demand for Web Serving: Measurement Analysis and Dynamic Estimation," Performance Evaluation, vol. 65, no. 6/7, pp. 531-553, 2008.
[41] G. Pacifici, M. Spreitzer, A.N. Tantawi, and A. Youssef, "Performance Management for Cluster-Based Web Services," IEEE J. Selected Areas in Comm., vol. 23, no. 12, pp. 2333-2343, Dec. 2005.
[42] H. Qian, D. Medhi, and K.S. Trivedi, "A Hierarchical Model to Evaluate Quality of Experience of Online Services Hosted by Cloud Computing," Proc. IFIP/IEEE Int'l Symp. Integrated Network Management (IM), 2011.
[43] R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu, "No 'Power' Struggles: Coordinated Multi-Level Power Management for the Data Center," SIGARCH Computer Architecture News, vol. 36, no. 1, pp. 48-59, 2008.
[44] T. Register, "Microsoft's Azure Cloud Down and Out for 8 Hours," http://www.theregister.co.uk/2012/02/29 windows_azure_outage /, 2013.
[45] A. Riska, M. Squillante, S.Z. Yu, Z. Liu, and L. Zhang, "Matrix-Analytic Analysis of a MAP/PH/1 Queue Fitted to Web Server Data," Matrix-Analytic Methods: Theory and Applications, G. Latouche and P. Taylor eds., World Scientific, 2002.
[46] S. Rivoire, P. Ranganathan, and C. Kozyrakis, "A Comparison of High-Level Full-System Power Models," Proc. Conf. Power Aware Computing and Systems (HotPower), 2008.
[47] J. Rolia, L. Cherkasova, and C. McCarthy, "Configuring Workload Manager Control Parameters for Resource Pools," Proc. IEEE 10th Network Operations and Management Symp. (NOMS), Apr. 2006.
[48] S. Casolari and M. Colajanni, "On the Selection of Models for Runtime Prediction of System Resources," Autonomic Systems, D. Ardagna, L. Zhang, eds., Springer, 2010.
[49] M. Steinder, I. Whalley, and D. Chess, "Server Virtualization in Autonomic Management of Heterogeneous Workloads," SIGOPS Operating Systems Rev., vol. 42, no. 1, pp. 94-95, 2008.
[50] C. Tang, M. Steinder, M. Spreitzer, and G. Pacifici, "A Scalable Application Placement Controller for Enterprise Data Centers," Proc. 16th Int'l Conf. World Wide Web (WWW '07), 2007.
[51] B. Urgaonkar, G. Pacifici, P.J. Shenoy, M. Spreitzer, and A.N. Tantawi, "Analytic Modeling of Multitier Internet Applications," ACM Trans. Web, vol. 1, no. 1,article 2, Jan. 2007.
[52] B. Urgaonkar and P. Shenoy, "SHARC: Managing CPU and Network Bandwidth in Shared Clusters," IEEE Trans. Parallel and Distributed Systems, vol. 15, no. 1, pp. 2-17, Jan. 2004.
[53] X. Wang, M. Chen, C. Lefurgy, and T.W. Keller, "SHIP: A Scalable Hierarchical Power Control Architecture for Large-Scale Data Centers," IEEE Trans. Parallel Distributed Systems, vol. 23, no. 1, pp. 168-176, Jan. 2012.
[54] X. Wang, Z. Du, Y. Chen, and S. Li, "Virtualization-Based Autonomic Resource Management for Multi-Tier Web Applications in Shared Data Center," J. Systems Software, vol. 81, no. 9, pp. 1591-1608, 2008.
[55] X. Wang and Y. Wang, "Coordinating Power Control and Performance Management for Virtualized Server Clusters," IEEE Trans. Parallel Distributed Systems, vol. 22, no. 2, pp. 245-259, Feb. 2011.
[56] W. Whitt, Stochastic-Process Limits. Springer-Verlag, 2002.
[57] A. Wolke and G. Meixner, "Twospot: A Cloud Platform for Scaling out Web Applications Dynamically," Proc. European Conf. ServiceWave, 2010.
[58] F. Wuhib, R. Stadler, and M. Spreitzer, "A Gossip Protocol for Dynamic Resource Management in Large Cloud Environments," IEEE Trans. Network and Service Management, vol. 9, no. 2, pp. 213-225, June 2012.
[59] C.Z. Xu, J. Rao, and X. Bu, "URL: A Unified Reinforcement Learning Approach for Autonomic Cloud Management," J. Parallel Distributed Computing, vol. 72, no. 2, pp. 95-105, 2012.
[60] L. Zhang, C.H. Xia, M.S. Squillante, and W.N. Mills, "Workload Service Requirements Analysis: A Queueing Network Optimization Approach," Proc. IEEE 10th Int'l Symp. Modeling, Analysis and Simulation of Computer and Telecomm. Systems (MASCOTS), pp. 23-32, 2002.
[61] X. Zhu, D. Young, B. Watson, Z. Wang, J. Rolia, S. Singhal, B. McKee, C. Hyser, D. Gmach, R. Gardner, T. Christian, and L. Cherkasova, "1000 Islands: An Integrated Approach to Resource Management for Virtualized Data Centers," J. Cluster Computing, vol. 12, no. 1, pp. 45-57, 2009.
17 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool