This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
NVIDIA Tesla: A Unified Graphics and Computing Architecture
March/April 2008 (vol. 28 no. 2)
pp. 39-55
Erik Lindholm, NVIDIA
John Nickolls, NVIDIA
John Montrym, NVIDIA
To enable flexible, programmable graphics and high-performance computing, NVIDIA has developed the Tesla scalable unified graphics and parallel computing architecture. Its scalable parallel array of processors is massively multithreaded and programmable in C or via graphics APIs.

1. J. Montrym and H. Moreton, "The GeForce 6800," IEEE Micro, vol. 25, no. 2, Mar./Apr. 2005, pp. 41-51.
2. CUDA Technology, NVIDIA, 2007, http://www.nvidia.comCUDA.
3. CUDA Programming Guide 1.1, NVIDIA, 2007; http://developer.download.nvidia.com/compute/ cuda/1_1NVIDIA_CUDA_Programming_Guide_1.1.pdf .
4. J. Nickolls, I. Buck, K. Skadron,, and M. Garland, "Scalable Parallel Programming with CUDA," ACM Queue, vol. 6, no. 2, Mar./Apr. 2008, pp. 40-53.
5. DX Specification, Microsoft; http://msdn.microsoft.comdirectx.
6. E. Lindholm, M.J. Kilgard, and H. Moreton, "A User-Programmable Vertex Engine," Proc. 28th Ann. Conf. Computer Graphics and Interactive Techniques (Siggraph 01), ACM Press, 2001, pp. 149-158.
7. G. Elder, "Radeon 9700," Eurographics/Siggraph Workshop Graphics Hardware, Hot 3D Session, 2002, http://www.graphicshardware.org/previous/ www_2002/presentationsHot3D-RADEON9700.ppt .
8. Microsoft DirectX 9 Programmable Graphics Pipeline, Microsoft Press, 2003.
9. J. Andrews and N. Baker, "Xbox 360 System Architecture," IEEE Micro, vol. 26, no. 2, Mar./Apr. 2006, pp. 25-37.
10. D. Blythe, "The Direct3D 10 System," ACM Trans. Graphics, vol. 25, no. 3, July 2006, pp. 724-734.
11. S.F. Oberman and M.Y. Siu, "A High-Performance Area-Efficient Multifunction Interpolator," Proc. 17th IEEE Symp. Computer Arithmetic (Arith-17), IEEE Press, 2005, pp. 272-279.
12. J.E. Stone et al., "Accelerating Molecular Modeling Applications with Graphics Processors," J. Computational Chemistry, vol. 28, no. 16, 2007, pp. 2618-2640.
13. L. Nyland and M. Harris, J. Prins, "Fast N-Body Simulation with CUDA," GPU Gems 3, H. Nguyen, ed. Addison-Wesley, 2007, pp. 677-695.
14. S.S. Stone et al., "How GPUs Can Improve the Quality of Magnetic Resonance Imaging," Proc. 1st Workshop on General Purpose Processing on Graphics Processing Units,, 2007; , http://www.gigascale.org/pubs1175.html.
15. A.L. Shimpi, and D. Wilson, "NVIDIA's GeForce 8800 (G80): GPUs Re-architected for DirectX 10," AnandTech, http://www.anandtech.com/videoshowdoc.aspx?i = 2870 . Nov. 2006.

Index Terms:
Hot Chips 19, GPU, parallel processor, SIMT, SIMD, unified graphics and parallel computing architecture, graphics processing unit, cooperative thread array, Tesla
Citation:
Erik Lindholm, John Nickolls, Stuart Oberman, John Montrym, "NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro, vol. 28, no. 2, pp. 39-55, March-April 2008, doi:10.1109/MM.2008.31
Usage of this product signifies your acceptance of the Terms of Use.