This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2011 International Green Computing Conference and Workshops
DASCA: Data Aware Scaling Down to provide power proportionality for distributed data processing frameworks
Orlando, FL
July 25-July 28
ISBN: 978-1-4577-1222-7
Hyeong S. Kim, Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Dong In Shin, Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Young Jin Yu, Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Hyeonsang Eom, Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Heon Y. Yeom, Sch. of Comput. Sci. & Eng., Seoul Nat. Univ., Seoul, South Korea
Distributed systems have led to the adoption of cloud computing concepts among countless enterprises. A large number of companies have already benefited from delegating IT services to cloud service providers. At the same time, the interest on energy efficiency has dramatically increased. Energy efficiency in large distributed systems is a big concern for system engineers. In addition, the proliferation of distributed data processing frameworks such as MapReduce have led to a vast amount of research and practices. In this paper, we are particularly interested in providing energy proportionality for MapReduce. To provide energy proportionality, we propose Data Aware Scaling Down (DASCA), a scaling down framework for MapReduce. There are two problems we must address in order to support scaling down for MapReduce. The first is to choose a proper set of nodes to suspend, which we call candidate set. The second is to minimize the replica redistribution which occurs during the initiation of power save mode. To address these problems, we use the data awareness of the MapReduce framework. To address the first problem, we provide two greedy algorithms which exploit the data awareness of MapReduce. To address the second problem, we propose locality aware replica redistribution to efficiently redistribute the lost replicas while preserving the availability of replicas and performance of distributed processing.
Index Terms:
distributed processing, DASCA, data aware scaling down, power proportionality, distributed data processing frameworks, cloud computing, IT services, cloud service providers, energy efficiency, distributed systems, energy proportionality, MapReduce framework, replica redistribution, power save mode
Citation:
Hyeong S. Kim, Dong In Shin, Young Jin Yu, Hyeonsang Eom, Heon Y. Yeom, "DASCA: Data Aware Scaling Down to provide power proportionality for distributed data processing frameworks," igcc, pp.1-8, 2011 International Green Computing Conference and Workshops, 2011
Usage of this product signifies your acceptance of the Terms of Use.