The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - May-June (2013 vol.33)
pp: 16-27
Hadi Esmaeilzadeh , University of Washington
Adrian Sampson , University of Washington
Luis Ceze , University of Washington
Doug Burger , Microsoft Research
ABSTRACT
This work proposes an approximate algorithmic transformation and a new class of accelerators, called neural processing units (NPUs). NPUs leverage the approximate algorithmic transformation that converts regions of code from a Von Neumann model to a neural model. NPUs achieve an average 2.3× speedup and 3.0× energy savings for general-purpose approximate programs. This new class of accelerators shows that significant performance and efficiency gains are possible when the abstraction of full accuracy is relaxed in general-purpose computing.
INDEX TERMS
Computer architecture, Neural networks, Approximation algorithms, Algorithm design and analysis, Accelerators, NPUs, approximate computing, accelerators, neural networks, Parrot algorithmic transformation, neural processing units
CITATION
Hadi Esmaeilzadeh, Adrian Sampson, Luis Ceze, Doug Burger, "Neural Acceleration for General-Purpose Approximate Programs", IEEE Micro, vol.33, no. 3, pp. 16-27, May-June 2013, doi:10.1109/MM.2013.28
REFERENCES
1. H. Esmaeilzadeh et al., "Dark Silicon and the End of Multicore Scaling," Proc. 38th Ann. Int'l Symp. Computer Architecture (ISCA 11), ACM, 2011, pp. 365-376.
2. R. Hameed et al., "Understanding Sources of Inefficiency in General-Purpose Chips," Proc. 37th Ann. Int'l Symp. Computer Architecture (ISCA 10), ACM, 2010, pp. 37-47.
3. A. Sampson et al., "EnerJ: Approximate Data Types for Safe and General Low-Power Computation," Proc. 32nd ACM SIGPLAN Conf. Programming Language Design and Implementation (PLDI 11), 2011, pp. 164-174.
4. Y. Fang, H. Li, and X. Li, "A Fault Criticality Evaluation Framework of Digital Systems for Error Tolerant Video Applications," Proc. Asian Test Symp. (ATS 11), IEEE CS, 2011, pp. 329-334.
5. M. de Kruijf and K. Sankaralingam, "Exploring the Synergy of Emerging Workloads and Silicon Reliability Trends," Proc. IEEE 9th Workshop Silicon Errors in Logic—System Effects, 2009.
6. X. Li and D. Yeung, "Exploiting Soft Computing for Increased Fault Tolerance," Workshop Architectural Support for Gigascale Integration, 2006.
7. C. Alvarez et al., "Fuzzy Memoization for Floating-Point Multimedia Applications," IEEE Trans. Computers, July 2005, pp. 922-927.
8. M. de Kruijf, S. Nomura, and K. Sankaralingam, "Relax: An Architectural Framework for Software Recovery of Hardware Faults," Proc. 37th Ann. Int'l Symp. Computer Architecture (ISCA 10), 2010, pp. 497-508.
9. H. Esmaeilzadeh et al., "Architecture Support for Disciplined Approximate Programming," Proc. 17th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, ACM, 2012, pp. 301-312.
10. L. Leem et al., "ERSA: Error Resilient System Architecture for Probabilistic Applications," Proc. Conf. Design, Automation and Test in Europe (DATE 10), European Design and Automation Assn., 2010, pp. 1560-1565.
11. L.N. Chakrapani et al., "Ultra-Efficient (Embedded) SOC Architectures Based on Probabilistic CMOS (PCMOS) Technology," Proc. Conf. Design, Automation and Test in Europe (DATE 06), European Design and Automation Assn., 2006, pp. 1110-1115.
12. S. Gupta et al., "Bundled Execution of Recurring Traces for Energy-Efficient General Purpose Processing," Proc. 44th Ann. IEEE/ACM Int'l Symp. Microarchitecture, ACM, 2011, pp. 12-23.
13. G. Venkatesh et al., "Conservation Cores: Reducing the Energy of Mature Computations," Proc. 15th Int'l Conf. Architectural Support for Programming Languages and Operating Systems, ACM, 2010, pp. 205-218.
14. V. Govindaraju et al., "Dynamically Specialized Datapaths for Energy Efficient Computing," Proc. IEEE 17th Int'l Symp. High Performance Computer Architecture, IEEE CS, 2011, pp. 503-514.
15. S. Sidiroglou-Douskos et al., "Managing Performance vs. Accuracy Trade-offs with Loop Perforation," Proc. 13th European Conf. Foundations of Software Eng., ACM, 2011, pp. 124-134.
16. W. Baek and T.M. Chilimbi, "Green: A Framework for Supporting Energy-Conscious Programming Using Controlled Approximation," Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation (PLDI 10), ACM, 2010, pp. 198-209.
17. H. Esmaeilzadeh et al., "Neural Acceleration for General-Purpose Approximate Programs," Proc. 45th Ann. IEEE/ACM Int'l Symp. Microarchitecture, IEEE CS, 2012, pp. 449-460.
18. S. Misailovic et al., "Quality of Service Profiling," Proc. 32nd ACM/IEEE Int'l Conf. Software Engineering—Vol. 1 (ICSE 10), ACM, 2010, pp. 25-34.
19. A. Patel et al., "MARSSx86: A Full System Simulator for x86 CPUs," Proc. 48th ACM/EDAC/IEEE Design Automation Conf., IEEE CS, 2011, pp. 1050-1055.
20. S. Galal and M. Horowitz, "Energy-Efficient Floating-Point Unit Design," IEEE Trans. Computers, July 2011, pp. 913-922.
9 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool