The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March/April (2008 vol.28)
pp: 39-55
Erik Lindholm , NVIDIA
John Nickolls , NVIDIA
John Montrym , NVIDIA
ABSTRACT
To enable flexible, programmable graphics and high-performance computing, NVIDIA has developed the Tesla scalable unified graphics and parallel computing architecture. Its scalable parallel array of processors is massively multithreaded and programmable in C or via graphics APIs.
INDEX TERMS
Hot Chips 19, GPU, parallel processor, SIMT, SIMD, unified graphics and parallel computing architecture, graphics processing unit, cooperative thread array, Tesla
CITATION
Erik Lindholm, John Nickolls, Stuart Oberman, John Montrym, "NVIDIA Tesla: A Unified Graphics and Computing Architecture", IEEE Micro, vol.28, no. 2, pp. 39-55, March/April 2008, doi:10.1109/MM.2008.31
REFERENCES
1. J. Montrym and H. Moreton, "The GeForce 6800," IEEE Micro, vol. 25, no. 2, Mar./Apr. 2005, pp. 41-51.
2. CUDA Technology, NVIDIA, 2007, http://www.nvidia.comCUDA.
3. CUDA Programming Guide 1.1, NVIDIA, 2007; http://developer.download.nvidia.com/compute/ cuda/1_1NVIDIA_CUDA_Programming_Guide_1.1.pdf .
4. J. Nickolls, I. Buck, K. Skadron,, and M. Garland, "Scalable Parallel Programming with CUDA," ACM Queue, vol. 6, no. 2, Mar./Apr. 2008, pp. 40-53.
5. DX Specification, Microsoft; http://msdn.microsoft.comdirectx.
6. E. Lindholm, M.J. Kilgard, and H. Moreton, "A User-Programmable Vertex Engine," Proc. 28th Ann. Conf. Computer Graphics and Interactive Techniques (Siggraph 01), ACM Press, 2001, pp. 149-158.
7. G. Elder, "Radeon 9700," Eurographics/Siggraph Workshop Graphics Hardware, Hot 3D Session, 2002, http://www.graphicshardware.org/previous/ www_2002/presentationsHot3D-RADEON9700.ppt .
8. Microsoft DirectX 9 Programmable Graphics Pipeline, Microsoft Press, 2003.
9. J. Andrews and N. Baker, "Xbox 360 System Architecture," IEEE Micro, vol. 26, no. 2, Mar./Apr. 2006, pp. 25-37.
10. D. Blythe, "The Direct3D 10 System," ACM Trans. Graphics, vol. 25, no. 3, July 2006, pp. 724-734.
11. S.F. Oberman and M.Y. Siu, "A High-Performance Area-Efficient Multifunction Interpolator," Proc. 17th IEEE Symp. Computer Arithmetic (Arith-17), IEEE Press, 2005, pp. 272-279.
12. J.E. Stone et al., "Accelerating Molecular Modeling Applications with Graphics Processors," J. Computational Chemistry, vol. 28, no. 16, 2007, pp. 2618-2640.
13. L. Nyland and M. Harris, J. Prins, "Fast N-Body Simulation with CUDA," GPU Gems 3, H. Nguyen, ed. Addison-Wesley, 2007, pp. 677-695.
14. S.S. Stone et al., "How GPUs Can Improve the Quality of Magnetic Resonance Imaging," Proc. 1st Workshop on General Purpose Processing on Graphics Processing Units,, 2007; , http://www.gigascale.org/pubs1175.html.
15. A.L. Shimpi, and D. Wilson, "NVIDIA's GeForce 8800 (G80): GPUs Re-architected for DirectX 10," AnandTech, http://www.anandtech.com/videoshowdoc.aspx?i = 2870 . Nov. 2006.
399 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool