Dependability and performance issues are strategic in Cloud computing, especially in business context where they often become mandatory. To investigate such issues and present the most recent findings to the scientific community we arranged this special issue on Cloud computing assessment: metrics, algorithms, policies, models, and evaluation techniques. Due to the complexity of the subject and the high number of qualified papers, the special issue was organized into two parts. In the first issue ( IEEE Transactions on Dependable and Secure Computing, Vol. 10, no 4, 2013) we highlighted the importance of Cloud security, dealing with aspects such as trustworthiness, privacy, security and vulnerabilities. Having in mind a serviceoriented, heterogeneous Cloud scenario, we also presented new techniques, countermeasures and threats to deal with such problems, both at the physical (hardware) and logical (software, applications, data) layers.
This second part of the special issue mainly deals with performance and dependability issues in Cloud computing, ranging from infrastructure to application, from resource management to programming models, from consolidation strategies to content management. With this special issue we intend to provide an (noncomprehensive) overview of Cloud computing security and dependability, with the aim to rise the attention of new researchers on this topic. Before entering into technical details, we would like to thank all the Authors, the Reviewers, the IEEE Transactions on Dependable and Secure Computing Editorial Office, with specific regards to Pam Gimzo and Erica Hardison, as well as the Editor in Chief Ravi Sandhu for their valuable support.
Part II is composed of the following five papers. The first paper “A Hierarchical Approach for the Resource Management of Very Large Cloud Platforms,” by Bernardetta Addis, Danilo Ardagna, Barbara Panicucci, Mark S. Squillante, and Li Zhang, focuses on performance/energy-driven resource management in Infrastructure-as-a-Service (IaaS) Cloud. More specifically, it implements resource allocation policies for the management of multitier virtualized Cloud systems, aiming at maximizing the provider incomes taking into account specific service level agreements while minimizing the infrastructure energy costs. A multiple time-scale hierarchical framework has been developed and tested also considering realistic workloads and management system interrelationships. The results obtained demonstrated the effectiveness and the scalability of the proposed technique.
The second paper, “BtrPlace: A Flexible Consolidation Manager for Highly Available Applications,”by Fabien Hermenier, Julia Lawall, and Gilles Muller, deals with consolidation of virtual machines into datacenter considering placement constraints on functional and nonfunctional properties. The placement constraints are expressed by specific configuration scripts, interpreted on the y to extend a composable reconguration algorithm that is used to x nonviable placements. In depth experiments have been performed to demonstrate the flexibility of scripts and the effectiveness of the consolidation solution proposed. The results obtained shown adequate performance, fault tolerance and scalability of the approach adopted.
The third paper “A Cloud-Oriented Content Delivery Network Paradigm: Modeling and Assessment,” by Chrysa Papagianni, Aris Leivadeas, and Symeon Papavassiliou, characterizes the Cloud in the content delivery scenario, aiming at provisioning content delivery networks (CDNs) deployed on Cloud infrastructure. The problem is approachedby decomposition into graph partitioning and replica placement subproblems, specifying a framework to identify potential customers of a Cloud CDN mappedinto abstract content distribution graph and then investigated through graph partitioning heuristics to evaluate performance and costs. The paper also proposes a solution for replica management inspired by social network. The proposed approaches have been evaluated in the paper through specific models and simulation.
The fourth paper “On the Performance of Byzantine Fault-Tolerant MapReduce,” by Pedro Costa, Marcelo Pasin, Alysson Bessani, and Miguel P. Correia, proposes a fault tolerant algorithm to address Byzantine fault in MapReduce. The algorithm executes each task more than once and then compares the results obtained by the different executions disregarding nonmatching outputs. The effectiveness of this algorithm has been proven through in depth evaluation based on both analytical model and real experiments.
To optimize MapReduce application performance is also the objective of the last paper of the special issue Part II “Orchestrating an Ensemble of MapReduce Jobs for Minimizing Their Makespan,” by Abhishek Verma, Ludmila Cherkasova, and Roy H. Campbell. In particular a set of production MapReduce jobs periodically executed on new data is considered with the aim of implementing a scheduler minimizing the job completion time and maximizing the resource utilization. An extensive set of simulations on realistic workloads has been performed to evaluate the performance of a scheduler based on the Johnsons algorithm. To overcome the limitations of this algorithm a novel heuristic is proposed, demonstrating by simulation it overperforms the Johnsons one.
We believe that the papers included in Part II of this special issue provide a good, even if partial, overview of the state-of-the-art research on Cloud computing dependability and performance, addressing several open issues on the topic. All the papers included in this issue provide both significant advances to the state of the art and practical guidelines on how to deal with these problems on real applications.
Kishor S. Trivedi
S. Distefano is with the Politecnico di Milano, Piazza L. da Vinci 32, 20133 Milano, Italy. E-mail: firstname.lastname@example.org.
A. Puliafito is with the University of Messina, Contrada di Dio, 98166 Messina, Italy. E-mail: email@example.com.
K.S. Trivedi is with the Duke University, 27708 Durham, NC, USA. E-mail: firstname.lastname@example.org.
For information on obtaining reprints of this paper, please send e-mail to: email@example.com.
received, in October 2001, the master's degree in computer engineering from the University of Catania. In 2006 he achieved the PhD degree on advanced technologies for the information engineering from the University of Messina. His research interests include stochastic modeling, performance evaluation, reliability techniques, parallel and distributed computing and software engineering. During his research activity he participated to the development of the WebSPN, ArgoPerformance and GS3 tools. He has been involved in several national and international research projects. He is member of international conference committees and he is on the editorial boards of several international journals on dependability and distributed computing topics. At this time, he is an assistant professor at Politecnico di Milano. He is a member of the IEEE.
is a full professor of computer engineering at the University of Messina, Italy. His interests include parallel and distributed systems, networking, wireless and Cloud computing. He has contributed to the development of the software tools WebSPN and ArgoPerformance. He is coauthor (with R. Sahner and K.S. Trivedi) of the text entitled Performance and Reliability Analysis of Computer Systems: An Example-Based Approach Using the SHARPE Software Package, edited by Kluwer Academic Publishers. He is also the responsible of two big Grid Projects (TriGrid VL and PI2S2) funded by the Sicilian Regional Government and by the Italian MIUR, respectively. He is currently a member of the general assembly and of the technical committee of the Reservoir and Vision, IP projects funded from the EU to explore the deployment and management of IT services and data across different administrative domains. He is also the main investigator of the Italian PRIN2008 project Cloud@Home, trying to combine cloud and volunteer computing. He is a member of the IEEE.
Kishor S. Trivedi
holds the Hudson Chair in the Department of Electrical and Computer Engineering at Duke University, Durham, North Carolina. He was the Duke-Site Director of an National Science Foundation Industry-University Cooperative Research Center between NC State University and Duke University for carrying out applied research in computing and communications. He has served as a principal investigator on various AFOSR, ARO, Burroughs, DARPA, Draper Lab, NEC, IBM, DEC, Alcatel, Telcordia, Motorola, NASA, NIH, ONR, NSWC, Boeing, Union Switch and Signals, NSF, Cisco, Huawei, NATO, JPL and SPC funded projects and as a consultant to industry and research laboratories. He was an editor of the IEEE Transactions on Computers
from 1983-1987. He was on the editorial board of the IEEE Transactions on Dependable and Secure Systems
. He is a codesigner of NASA's HARP, IBM's SAVE, SHARPE, SPNP, Boeing's IRAP and SREPT modeling packages. He is the author of a well known text entitled, Probability and Statistics with Reliability, Queuing and Computer Science Applications. He is a fellow of the IEEE.