Issue No. 01 - Jan. (2014 vol. 63)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TC.2013.167
Henry C.H. Chen , The Chinese University of Hong Kong, Hong Kong
Yuchong Hu , The Chinese University of Hong Kong, Hong Kong
Patrick P.C. Lee , The Chinese University of Hong Kong, Hong Kong
Yang Tang , Columbia University, New York and The Chinese University of Hong Kong, Hong Kong
To provide fault tolerance for cloud storage, recent studies propose to stripe data across multiple cloud vendors. However, if a cloud suffers from a permanent failure and loses all its data, we need to repair the lost data with the help of the other surviving clouds to preserve data redundancy. We present a proxy-based storage system for fault-tolerant multiple-cloud storage called NCCloud, which achieves cost-effective repair for a permanent single-cloud failure. NCCloud is built on top of a network-coding-based storage scheme called the functional minimum-storage regenerating (FMSR) codes, which maintain the same fault tolerance and data redundancy as in traditional erasure codes (e.g., RAID-6), but use less repair traffic and, hence, incur less monetary cost due to data transfer. One key design feature of our FMSR codes is that we relax the encoding requirement of storage nodes during repair, while preserving the benefits of network coding in repair. We implement a proof-of-concept prototype of NCCloud and deploy it atop both local and commercial clouds. We validate that FMSR codes provide significant monetary cost savings in repair over RAID-6 codes, while having comparable response time performance in normal cloud storage operations such as upload/download.
Maintenance engineering, Cloud computing, Fault tolerance, Fault tolerant systems, Encoding, Transient analysis, Network coding
H. C. Chen, Y. Hu, P. P. Lee and Y. Tang, "NCCloud: A Network-Coding-Based Storage System in a Cloud-of-Clouds," in IEEE Transactions on Computers, vol. 63, no. 1, pp. 31-44, 2013.