2012 IEEE Seventh International Conference on Networking, Architecture, and Storage (2012)
Xiamen, China China
June 28, 2012 to June 30, 2012
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/NAS.2012.32
Caching is widely known to be an effective method for improving I/O performance by storing frequently used data on higher speed storage components. However, most existing studies that focus on caching performance evaluate fairly small files populating a relatively small cache. Few reports are available that detail the performance of traditional cache replacement policies on extremely large caches. Do such traditional caching policies still work effectively when applied to systems with petabytes of data? In this paper, we comprehensively evaluate the performance of several cache policies, which include First-In-First-Out (FIFO), Least Recently Used (LRU) and Least Frequently Used (LFU), on the global satellite imagery distribution application maintained by the U.S. Geological Survey (USGS) Earth Resources Observation and Science Center (EROS). Evidence is presented suggesting traditional caching policies are capable of providing performance gains when applied to large data sets as with smaller data sets. Our evaluation is based on approximately three million real-world satellite images download requests representing global user download behavior since October 2008.
R. Fares, B. Romoser, Z. Zong, M. Nijim and X. Qin, "Performance Evaluation of Traditional Caching Policies on a Large System with Petabytes of Data," 2012 IEEE 7th International Conference on Networking, Architecture, and Storage (NAS), Xiamen, Fujian, 2012, pp. 227-234.