2013 International Conference on Parallel and Distributed Systems (ICPADS) (2013)
Seoul, Korea (South)
Dec. 15, 2013 to Dec. 18, 2013
Most recent distributed file systems have adopted architecture with an independent metadata server cluster. However, potential multiple hotspots and flash crowds access patterns often cause a metadata service that violates performance Service Level Objectives. To maximize the throughput of the metadata service, an adaptive request load balancing framework is critical. We present a distributed cache framework above the distributed metadata management schemes to manage hotspots rather than managing all metadata to achieve request load balancing. This benefits the metadata hierarchical locality and the system scalability. Compared with data, metadata has its own distinct characteristics, such as small size and large quantity. The cost of useless metadata prefetching is much less than data prefetching. In light of this, we devise a time period-based prefetching strategy and a perfecting-based adaptive replacement cache algorithm to improve the performance of the distributed caching layer to adapt constantly changing workloads. Finally, we evaluate our approach with a hadoop distributed file system cluster.
Prefetching, Load management, Monitoring, Load modeling, Servers, Smoothing methods, Accuracy
Y. Sun, J. Liu, D. Ye and H. Zhong, "A Distributed Cache Framework for Metadata Service of Distributed File Systems," 2013 International Conference on Parallel and Distributed Systems (ICPADS), Seoul, Korea (South), 2013, pp. 51-58.