Issue No. 05 - May (2014 vol. 25)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2013.82
Ron C. Chiang , Department of Electrical and Computer Engineering at the George Washington University, Room 624B, 801 22nd Street NW, Washington,
H. Howie Huang , Department of Electrical and Computer Engineering at the George Washington University, Room 624B, 801 22nd Street NW, Washington,
Large-scale data centers leverage virtualization technology to achieve excellent resource utilization, scalability, and high availability. Ideally, the performance of an application running inside a virtual machine (VM) shall be independent of co-located applications and VMs that share the physical machine. However, adverse interference effects exist and are especially severe for data-intensive applications in such virtualized environments. In this work, we present TRACON, a novel Task and Resource Allocation CONtrol framework that mitigates the interference effects from concurrent data-intensive applications and greatly improves the overall system performance. TRACON utilizes modeling and control techniques from statistical machine learning and consists of three major components: the interference prediction model that infers application performance from resource consumption observed from different VMs, the interference-aware scheduler that is designed to utilize the model for effective resource management, and the task and resource monitor that collects application characteristics at the runtime for model adaption. We implement and validate TRACON with a variety of cloud applications. The evaluation results show that TRACON can achieve up to 25 percent improvement on application throughput on virtualized servers.
Interference, Predictive models, Servers, Clustering algorithms, Monitoring, Resource management, Virtual machining,scheduling, Cloud computing, virtualization
Ron C. Chiang, H. Howie Huang, "TRACON: Interference-Aware Schedulingfor Data-Intensive Applicationsin Virtualized Environments", IEEE Transactions on Parallel & Distributed Systems, vol. 25, no. , pp. 1349-1358, May 2014, doi:10.1109/TPDS.2013.82