The Community for Technology Leaders
2016 IEEE International Symposium on Workload Characterization (IISWC) (2016)
Providence, RI, USA
Sept. 25, 2016 to Sept. 27, 2016
ISBN: 978-1-5090-3897-8
pp: 1-10
Min Fu , Wuhan National Lab for Optoelectronics School of Computer, Huazhong University of Science and Technology, Wuhan, China
Patrick P. C. Lee , Dept. of Computer Science and Engineering, The Chinese University of Hong Kong, HK
Dan Feng , Wuhan National Lab for Optoelectronics School of Computer, Huazhong University of Science and Technology, Wuhan, China
Zuoning Chen , National Engineering Research Center for Parallel Computer, Beijing, China
Yu Xiao , Wuhan National Lab for Optoelectronics School of Computer, Huazhong University of Science and Technology, Wuhan, China
ABSTRACT
Deduplication has been widely used to improve storage efficiency in modern primary and secondary storage systems, yet how deduplication fundamentally affects storage system reliability remains debatable. This paper aims to analyze and compare storage system reliability with and without deduplication in primary workloads using real-world file system snapshots. Specifically, we propose a trace-driven, deduplicationaware simulation framework that analyzes data loss in both chunk and file levels due to sector errors and whole-disk failures. Compared to without deduplication, our analysis shows that deduplication consistently reduces the damage of sector errors due to intra-file redundancy elimination, but potentially increases the damages of whole-disk failures if the highly referenced chunks are not carefully placed on disk. To improve reliability, we examine a deliberate copy technique that stores and repairs first the most referenced chunks in a small dedicated physical area (e.g., 1% of the physical capacity), and demonstrate its effectiveness through our simulation framework.
INDEX TERMS
Maintenance engineering, Metadata, Redundancy, Measurement, Analytical models, Layout
CITATION

M. Fu, P. P. C. Lee, D. Feng, Z. Chen and Y. Xiao, "A simulation analysis of reliability in primary storage deduplication," 2016 IEEE International Symposium on Workload Characterization (IISWC), Providence, RI, USA, 2016, pp. 1-10.
doi:10.1109/IISWC.2016.7581280
93 ms
(Ver 3.3 (11022016))