Issue No.12 - Dec. (2012 vol.23)
Yunghsiang S. Han , National Taiwan University of Science and Technology, Taipei
Soji Omiwade , University of Houston, Houston
Rong Zheng , University of Houston, Houston
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2012.67
We propose a decentralized progressive data retrieval (PDR) mechanism for data reconstruction in a network of Byzantine and crash-stop nodes. The scheme progressively retrieves stored data, such that it achieves the minimum communication cost possible. In particular, PDR gracefully adapts the cost of successful data retrieval to the number of Byzantine and crash-stop storage nodes. At the core of PDR is an incremental Reed-Solomon decoding (IRD) procedure that is highly computation efficient for data reconstruction. IRD's computation efficiency arises from its ability to utilize intermediate computation results. In addition, we provide an in-depth analysis of PDR and compare it to decentralized erasure coding and decentralized fountain coding algorithms for distributed storage systems. Moreover, our implementation results show that PDR has up to 35 times lower computation time over the state-of-the-art error-erasure decoding scheme for distributed storage systems. In our analysis, we also show that the code structure of PDR and the number of available storage nodes are independent of each other, and they can be used to control both the data dissemination and retrieval complexity.
Decoding, Reed Solomon codes, Encoding, Distributed databases, Computer crashes, Fault tolerance, byzantine failure, Fault tolerance, error control codes, Reed-Solomon codes
Yunghsiang S. Han, Soji Omiwade, Rong Zheng, "Progressive Data Retrieval for Distributed Networked Storage", IEEE Transactions on Parallel & Distributed Systems, vol.23, no. 12, pp. 2303-2314, Dec. 2012, doi:10.1109/TPDS.2012.67