The Community for Technology Leaders
Green Image
Issue No. 03 - March (2010 vol. 59)
ISSN: 0018-9340
pp: 332-344
Yuho Jin , University of Southern California, Los Angeles
Eun Jung Kim , Texas A&M University, College Station
Ki Hwan Yum , University of Texas at San Antonio, San Antonio
ABSTRACT
Switched networks have been adopted in on-chip communication for their scalability and efficient resource sharing. However, using a general network for a specific domain may result in unnecessary high cost and low performance when the interconnects are not optimized for the domain. Designing an optimal network for the specific domain is challenging because in-depth knowledge of interconnects and the application domain is required. Recently proposed Nonuniform Cache Architectures (NUCAs) use wormhole-routed 2D mesh networks in L2 caches. We observe that in NUCAs, network resources are underutilized with the considerable area cost (41 percent of cache) and the network delay is significantly large (63 percent of cache access time). Motivated by our observations, we investigate both router architecture and network topology for communication behaviors in large-scale cache systems. We present Fast-LRU replacement, where cache replacement overlaps with data request delivery. Next, we propose a deadlock-free XYX routing algorithm in a mesh network and present a new halo network topology to reduce the required links. Finally, we introduce a single-cycle multicast router that needs small modification of the unicast router design. Simulation results show that our design improves the average IPC by 38 percent over the mesh design with Multicast Promotion replacement and uses 12 percent of the interconnection area of the mesh network.
INDEX TERMS
On-chip interconnection networks, nonuniform cache architecture, domain-specific design.
CITATION
Yuho Jin, Eun Jung Kim, Ki Hwan Yum, "Design and Analysis of On-Chip Networks for Large-Scale Cache Systems", IEEE Transactions on Computers, vol. 59, no. , pp. 332-344, March 2010, doi:10.1109/TC.2009.130
97 ms
(Ver )