The Community for Technology Leaders
2015 31st Symposium on Mass Storage Systems and Technologies (MSST) (2015)
Santa Clara, CA, USA
May 30, 2015 to June 5, 2015
ISBN: 978-1-4673-7619-8
pp: 1-12
Wei Zhang , University of California at Santa Barbara
Daniel Agun , University of California at Santa Barbara
Tao Yang , University of California at Santa Barbara
Rich Wolski , University of California at Santa Barbara
Hong Tang , Alibaba Inc.
ABSTRACT
Data deduplication is important for snapshot backup of virtual machines (VMs) because of excessive redundant content. Fingerprint search for source-side duplicate detection is resource intensive when the backup service for VMs is co-located with other cloud services. This paper presents the design and analysis of a fast VM-centric backup service with a tradeoff for a competitive deduplication efficiency while using small computing resources, suitable for running on a converged cloud architecture that cohosts many other services. The design consideration includes VM-centric file system block management for the increased VM snapshot availability. This paper describes an evaluation of this VM-centric scheme to assess its deduplication efficiency, resource usage, and fault tolerance.
INDEX TERMS
Virtual machining, Cloud computing, Indexes, Servers, Computer architecture, Approximation methods
CITATION

W. Zhang, D. Agun, T. Yang, R. Wolski and H. Tang, "VM-centric snapshot deduplication for cloud data backup," 2015 31st Symposium on Mass Storage Systems and Technologies (MSST), Santa Clara, CA, USA, 2015, pp. 1-12.
doi:10.1109/MSST.2015.7208294
87 ms
(Ver 3.3 (11022016))