The Community for Technology Leaders
2018 IEEE International Conference on Cluster Computing (CLUSTER) (2018)
Belfast, United Kingdom
Sep 10, 2018 to Sep 13, 2018
ISSN: 2168-9253
ISBN: 978-1-5386-8319-4
pp: 162-163
Reducing tail latency becomes increasingly important to improve user-perceived service experience. User-facing latency-sensitive cloud applications typically contain multiple interactive tiers running in different virtual machines (VMs) with complex interaction patterns. Consolidation of those applications is a challenge. In this paper we study the consolidation of multi-tier interactive workloads from a new perspective of user-perceived tail latency. We propose a novel profiling-based consolidation methodology. The objective is to satisfy tail latency while reducing the number of physical machines. We consider two key factors that affecting the tail latency of multi-tier workloads: interference with neighboring VMs and interaction between different tiers. We model the consolidation of multi-tier workloads as an optimization problem with different objectives and constraints. We implement and evaluate the proposed models, as well as comparing with other methods (i.e., without profiling or without considering interaction influence). Experimental results show that the proposed method is able to greatly reduce the tail latency compared with the traditional consolidation method.
cloud computing, resource allocation, virtual machines

K. Ye and C. Xu, "Reducing Tail Latency of Interactive Multi-tier Workloads in the Cloud," 2018 IEEE International Conference on Cluster Computing (CLUSTER), Belfast, United Kingdom, 2018, pp. 162-163.
688 ms
(Ver 3.3 (11022016))