Issue No.12 - December (1998 vol.47)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/12.737683
<p><b>Abstract</b>—High instruction cache hit rates are key to high performance. One known technique to improve the hit rate of caches is to minimize cache interference by improving the layout of the basic blocks of the code. However, the performance impact of this technique has been reported for application code only, even though there is evidence that the operating system often uses the cache heavily and with less uniform patterns than applications. It is unknown how well existing optimizations perform for systems code and whether better optimizations can be found. We address this problem in this paper. This paper characterizes, in detail, the locality patterns of the operating system code and shows that there is substantial locality. Unfortunately, caches are not able to extract much of it: Rarely-executed special-case code disrupts spatial locality, loops with few iterations that call routines make loop locality hard to exploit, and plenty of loop-less code hampers temporal locality. Based on our observations, we propose an algorithm to expose these localities and reduce interference in the cache. For a range of cache sizes, associativities, lines sizes, and organizations, we show that we reduce total instruction miss rates by 31-86 percent, or up to 2.9 absolute points. Using a simple model, this corresponds to execution time reductions of the order of 10-25 percent. In addition, our optimized operating system combines well with optimized and unoptimized applications.</p>
Cache miss rates, instruction caches, code layout optimization.
Josep Torrellas, Chun Xia, Russell L. Daigle, "Optimizing the Instruction Cache Performance of the Operating System", IEEE Transactions on Computers, vol.47, no. 12, pp. 1363-1381, December 1998, doi:10.1109/12.737683