Ostro: Scalable Placement Optimization of Complex Application Topologies in Large-Scale Data Centers
2015 IEEE 35th International Conference on Distributed Computing Systems (ICDCS) (2015)
Columbus, OH, USA
June 29, 2015 to July 2, 2015
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDCS.2015.23
A complex cloud application consists of virtual machines (VMs) running software such as web servers and load balancers, storage in the form of disk volumes, and network connections that enable communication between VMs and between VMs and disk volumes. The application is also associated with various requirements, including not only quantities such as the sizes of the VMs and disk volumes, but also quality of service (QoS) attributes such as throughput, latency, and reliability. This paper presents Ostro, an Open Stack-based scheduler that optimizes the utilization of data center resources, while satisfying the requirements of the cloud applications. The novelty of the approach realized by Ostro is that it makes holistic placement decisions, in which all the requirements of an application -- described using an application topology abstraction -- are considered jointly. Specific placement algorithms for application topologies are described including an estimate-based greedy algorithm and a time-bounded A algorithm. These algorithms can deal with complex topologies that have heterogeneous resource requirements, while still being scalable enough to handle the placement of hundreds of VMs and volumes across several thousands of host servers. The approach is evaluated using both extensive simulations and realistic experiments. These results show that Ostro significantly improves resource utilization when compared with naive approaches.
Topology, Network topology, Bandwidth, Switches, Heating, Quality of service, Reliability
G. Jung, M. Hiltunen, K. Joshi, R. Panta and R. Schlichting, "Ostro: Scalable Placement Optimization of Complex Application Topologies in Large-Scale Data Centers," 2015 IEEE 35th International Conference on Distributed Computing Systems (ICDCS), Columbus, OH, USA, 2015, pp. 143-152.