The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - August (2010 vol.59)
pp: 1033-1046
Hongzhong Zheng , University of Illinois at Chicago, Chicago
Zhichun Zhu , University of Illinois at Chicago, Chicago
ABSTRACT
DRAM memory is playing an increasingly important role in the overall power profile of latest-generation servers with multicore processors. With many power saving techniques adopted into processor design, memory power consumption can now exceed processor power consumption when a system runs memory-intensive workloads. There is an urgent need to fully evaluate the memory power profile of contemporary DRAM memories and to re-investigate DRAM memory designs, configurations, and optimizations from both power and performance perspectives. This study fills the gap by studying the performance and power consumption of multicore systems with DDR3 memory under different configurations. It includes comprehensive results regarding memory power breakdown, including background, operation, read/write, and I/O power, as well as performance. Comparisons with DDR2 and FB-DIMM are also included. The results show clearly that DRAM system configurations, including page policy, power mode, device configuration, burst length, channel organization, and the selection of DRAM technology, affects the memory power consumption significantly besides the performance. The optimal choice of some configurations is application-dependent, suggesting that reconfigurable or hybrid configurations are worth further studies.
INDEX TERMS
Multicore processors, DRAM systems, power, performance.
CITATION
Hongzhong Zheng, Zhichun Zhu, "Power and Performance Trade-Offs in Contemporary DRAM System Designs for Multicore Processors", IEEE Transactions on Computers, vol.59, no. 8, pp. 1033-1046, August 2010, doi:10.1109/TC.2010.108
REFERENCES
[1] J. Lin, H. Zheng, Z. Zhu, E. Gorbatov, H. David, and Z. Zhang, "Software Thermal Management of DRAM Memory for Multicore Systems," Proc. ACM SIGMETRICS, pp. 337-348, 2008.
[2] V. Cuppu, B. Jacob, B. Davis, and T. Mudge, "A Performance Comparison of Contemporary DRAM Architectures," Proc. 26th Int'l Symp. Computer Architecture, pp. 222-233, 1999.
[3] V. Cuppu and B. Jacob, "Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance?" Proc. 28th Int'l Symp. Computer Architecture, pp. 62-71, June 2001.
[4] D.T. Wang, "Modern DRAM Memory Systems: Performance Analysis and a High Performance, Power-Constrained DRAM-Scheduling Algorithm," PhD dissertation, Dept. of Electrical and Computer Eng., Univ. of Maryland at College Park, 2005.
[5] B. Ganesh, A. Jaleel, D. Wang, and B. Jacob, "Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling" Proc. 13th Int'l Symp. High-Performance Computer Architecture, pp. 109-120, Feb. 2007.
[6] A.R. Lebeck, X. Fan, H. Zeng, and C. Ellis, "Power Aware Page Allocation," Proc. Ninth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 105-116, 2000.
[7] V. Delaluz, M. Kandemir, N. Vijaykrishnan, A. Sivasubramaniam, and M.J. Irwin, "DRAM Energy Management Using Software and Hardware Directed Power Mode Control," Proc. Seventh Int'l Symp. High-Performance Computer Architecture, 2001.
[8] H. Huang, P. Pillai, and K.G. Shin, "Design and Implementation of Power-Aware Virtual Memory," Proc. 2003 USENIX Ann. Technical Conf., pp. 57-70, 2003.
[9] X. Li, Z. Li, F. David, P. Zhou, Y. Zhou, S. Adve, and S. Kumar, "Performance Directed Energy Management for Main Memory and Disks," Proc. 11th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 271-283, 2004.
[10] H. Huang, K.G. Shin, C. Lefurgy, and T. Keller, "Improving Energy Efficiency by Making DRAM Less Randomly Accessed," Proc. 2005 Int'l Symp. Low Power Electronics and Design, pp. 393-398, 2005.
[11] B. Diniz, D. Guedes, J. Wagner Meira, and R. Bianchini, "Limiting the Power Consumption of Main Memory," Proc. 34th Int'l Symp. Computer Architecture, pp. 290-301, 2007.
[12] H. Zheng, J. Lin, Z. Zhang, E. Gorbatov, H. David, and Z. Zhu, "Mini-Rank: Adaptive DRAM Architecture for Improving Memory Power Efficiency," Proc. 41st Int'l Symp. Microarchitecture, pp. 210-221, Nov. 2008.
[13] Micron Technology, Inc. "TN-41-01: Calculating Memory System Power for DDR3," http://download.micron.com/pdf/technotes/ ddr3TN41_01DDR3%20Power.pdf, Aug. 2007.
[14] Micron Technology, Inc. "MT41J128M8BY-187E," http:// download.micron.com/pdf/datasheets/ dram/ddr31Gb%20DDR3%20SDRAM.pdf, 2009.
[15] Z. Zhang, Z. Zhu, and X. Zhang, "A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conflicts and Exploit Data Locality," Proc. 33rd Int'l Symp. Microarchitecture, pp. 32-41, 2000.
[16] W. Lin, S.K. Reinhardt, and D. Burger, "Reducing DRAM Latencies with an Integrated Memory Hierarchy Design," Proc. Seventh Int'l Symp. High-Performance Computer Architecture, pp. 301-312, Jan. 2001.
[17] S. Rixner, W.J. Dally, U.J. Kapasi, P. Mattson, and J.D. Owens, "Memory Access Scheduling," Proc. 27th Int'l Symp. Computer Architecture, pp. 128-138, June 2000.
[18] S.A. McKee and W.A. Wulf, "Access Ordering and Memory-Conscious Cache Utilization," Proc. First IEEE Symp. High-Performance Computer Architecture, pp. 253-262, Jan. 1995.
[19] I. Hur and C. Lin, "Adaptive History-Based Memory Schedulers," Proc. 37th Int'l Symp. Microarchitecture, pp. 343-354, Dec. 2004.
[20] Z. Zhu and Z. Zhang, "A Performance Comparison of DRAM Memory System Optimizations for SMT Processors," Proc. 11th Int'l Symp. High-Performance Computer Architecture, pp. 213-224, 2005.
[21] K.J. Nesbit, N. Aggarwal, J. Laudon, and J.E. Smith, "Fair Queuing CMP Memory Systems," Proc. 39th Int'l Symp. Microarchitecture, pp. 208-222, Dec. 2006.
[22] O. Mutlu and T. Moscibroda, "Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors," Proc. 40th Int'l Symp. Microarchitecture, Dec. 2007.
[23] Micron Technology, Inc. "DDR3 SDRAM System-Power Calculator," http://download.micron.com/downloads/misc ddr3_power_calc.xls, 2010.
[24] Micron Technology, Inc. "MT47H64M16HR-25E," http:// download.micron.com/pdf/datasheets/ dram/ddr21GbDDR2.pdf, 2010.
[25] N.L. Binkert, R.G. Dreslinski, L.R. Hsu, K.T. Lim, A.G. Saidi, and S.K. Reinhardt, "The m5 Simulator: Modeling Networked Systems," IEEE Micro, vol. 26, no. 4, pp. 52-60, July/Aug. 2006.
[26] Micron Technology, Inc. "DDR2 SDRAM System-Power Calculator," http://download.micron.com/downloads/misc ddr2_power_calc.xls, 2010.
[27] Intel Corp. "Intel Fully Buffered DIMM Specification Addendum," http://www.intel.com/technology/memory/FBDIMM/ specIntel_FBD_Spec_Addendum_rev_p9.pdf , 2006.
[28] T. Sherwood, E. Perelman, G. Hamerly, and B. Calder, "Automatically Characterizing Large Scale Program Behavior," Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 45-57, Oct. 2002.
[29] A. Snavely, D.M. Tullsen, and G. Voelker, "Symbiotic Jobscheduling with Priorities for a Simultaneous Multithreading Processor," Proc. ACM SIGMETRICS, pp. 66-76, 2002.
[30] T.J. Dell, "A White Paper on the Benefits of Chipkill-Correct ECC for PC Server Main Memory," IBM Corporation, 1997.
[31] I. Hur and C. Lin, "A Comprehensive Approach to DRAM Power Management," Proc. 13th Int'l Symp. High-Performance Computer Architecure, 2008.
[32] W. Felter, K. Rajamani, T. Keller, and C. Rusu, "A Performance-Conserving Approach for Reducing Peak Power Consumption in Server Systems," Proc. 19th Int'l Conf. Supercomputing, pp. 293-302, 2005.
[33] X. Fan, W.-D. Weber, and L.A. Barroso, "Power Provisioning for a Warehouse-Sized Computer," Proc. 34th Int'l Symp. Computer Architecture, pp. 13-23, 2007.
[34] D. Burger, J.R. Goodman, and A. Kagi, "Memory Bandwidth Limitations of Future Microprocessors," Proc. 23rd Int'l Symp. Computer Architecture, pp. 78-89, 1996.
[35] X. Fan, C. Ellis, and A. Lebeck, "Memory Controller Policies for DRAM Power Management," Proc. 2001 Int'l Symp. Low Power Electronics and Design, pp. 129-134, 2001.
[36] V. Delaluz, A. Sivasubramaniam, M. Kandemir, N. Vijaykrishnan, and M.J. Irwin, "Scheduler-Based DRAM Energy Management," Proc. 39th Conf. Design Automation, pp. 697-702, 2002.
[37] P. Zhou, V. Pandey, J. Sundaresan, A. Raghuraman, Y. Zhou, and S. Kumar, "Dynamic Tracking of Page Miss Ratio Curve for Memory Management," Proc. 11th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 177-188, 2004.
[38] V. Pandey, W. Jiang, Y. Zhou, and R. Bianchini, "DMA-Aware Memory Energy Management," Proc. 12th Int'l Symp. High-Performance Computer Architecture, pp. 133-144, Feb. 2006.
[39] M. Ghosh and H.-H.S. Lee, "Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs," Proc. 40th Int'l Symp. Microarchitecture, pp. 134-145, 2007.
39 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool