|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines
Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems
Toronto, Ontario Canada
April 29-May 01
ISBN: 978-0-7695-4699-5
| ASCII Text | x | ||
| Jongsok Choi, Kevin Nam, Andrew Canis, Jason Anderson, Stephen Brown, Tomasz Czajkowski, "Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems," Field-Programmable Custom Computing Machines, Annual IEEE Symposium on, pp. 17-24, 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines, 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/FCCM.2012.13, author = {Jongsok Choi and Kevin Nam and Andrew Canis and Jason Anderson and Stephen Brown and Tomasz Czajkowski}, title = {Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems}, journal ={Field-Programmable Custom Computing Machines, Annual IEEE Symposium on}, volume = {0}, year = {2012}, isbn = {978-0-7695-4699-5}, pages = {17-24}, doi = {http://doi.ieeecomputersociety.org/10.1109/FCCM.2012.13}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Field-Programmable Custom Computing Machines, Annual IEEE Symposium on TI - Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems SN - 978-0-7695-4699-5 SP17 EP24 A1 - Jongsok Choi, A1 - Kevin Nam, A1 - Andrew Canis, A1 - Jason Anderson, A1 - Stephen Brown, A1 - Tomasz Czajkowski, PY - 2012 VL - 0 JA - Field-Programmable Custom Computing Machines, Annual IEEE Symposium on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/FCCM.2012.13
We describe new multi-ported cache designs suitable for use in FPGA-based processor/parallel-accelerator systems, and evaluate their impact on application performance and area. The baseline system comprises a MIPS soft processor and custom hardware accelerators with a shared memory architecture: on-FPGA L1 cache backed by off-chip DDR2 SDRAM. Within this general system model, we evaluate traditional cache design parameters (cache size, line size, associativity). In the parallel accelerator context, we examine the impact of the cache design and its interface. Specifically, we look at how the number of cache ports affects performance when multiple hardware accelerators operate (and access memory) in parallel, and evaluate two different hardware implementations of multi-ported caches using: 1) multi-pumping, and 2) a recently-published approach based on the concept of a live-value table. Results show that application performance depends strongly on the cache interface and architecture: for a system with 6 accelerators, depending on the cache design, speed up swings from 0.73�? to 6.14�?, on average, relative to a baseline sequential system (with a single accelerator and a direct-mapped, 2KB cache with 32B lines). Considering both performance and area, the best architecture is found to be a 4-port multi-pump direct-mapped cache with a 16KB cache size and a 128B line size.
Citation:
Jongsok Choi, Kevin Nam, Andrew Canis, Jason Anderson, Stephen Brown, Tomasz Czajkowski, "Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems," fccm, pp.17-24, 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines, 2012
Usage of this product signifies your acceptance of the Terms of Use.
