The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - April (2008 vol.57)
pp: 490-504
ABSTRACT
A unified computation method of vector and transcendental function is proposed for handheld 3D graphics systems. It unifies vector operations like vector multiply, multiply-and-add, divide, divide-by-square-root and dot product and transcendental functions like trigonometric, inverse trigonometric, hyperbolic, inverse hyperbolic, power (x<sup>y</sup> with two variables) and logarithm to arbitrary base in a single four-way arithmetic platform. A number system called fixed-point hybrid number system (FXP-HNS), which combines the fixed-point number system (FXP) and logarithmic number system (LNS), is proposed for the power- and area-efficient unification. Low power and small area logarithmic and antilogarithmic conversion schemes are also proposed for the data conversion between FXP and LNS in the FXP-HNS and achieve 0.41 percent and 0.08 percent maximum conversion error, respectively. The unified arithmetic unit based on the proposed schemes is presented with fully pipelined architecture and achieves single cycle throughput with maximum four-cycle latency for all the supported operations.
INDEX TERMS
Computer Arithmetic, unified arithmetic unit, vector operations, transcendental functions, logarithmic number system, 3D computer graphics, handheld systems
CITATION
Byeong-Gyu Nam, Hyejung Kim, Hoi-Jun Yoo, "Power and Area-Efficient Unified Computation of Vector and Elementary Functions for Handheld 3D Graphics Systems", IEEE Transactions on Computers, vol.57, no. 4, pp. 490-504, April 2008, doi:10.1109/TC.2008.12
REFERENCES
[1] Khro nos Group, OpenGL-ES 2.0, http:/www.khronos.org, 2007.
[2] J.-H. Sohn, Y.-H. Park, C.-W. Yoon, R. Woo, S.-J. Park, and H.-J. Yoo, “Low-Power 3D Graphics Processors for Mobile Terminals,” IEEE Comm. Magazine, pp. 90-99, Dec. 2005.
[3] OpenGL ARB, OpenGL Programming Guide, third ed. Addison-Wesley, 1999.
[4] Analog Devices, ADSP-21000 Family Application Handbook, 1994.
[5] B.-G. Nam, H. Kim, and H.-J. Yoo, “A Low-Power Unified Arithmetic Unit for Programmable Handheld 3-D Graphics Systems,” Proc. IEEE Custom Integrated Circuits Conf., pp. 535-538, Sept. 2006.
[6] B.-G. Nam, H. Kim, and H.-J. Yoo, “A 210MHz 15mW Unified Vector and Transcendental Function Unit for Programmable Handheld 3-D Graphics Systems,” Proc. IEEE Asian Solid-State Circuits Conf., pp. 95-98, Nov. 2006.
[7] J.N. Mitchell Jr., “Computer Multiplication and Division Using Binary Logarithms,” IRE Trans. Electronic Computers, vol. 11, pp.512-517, Aug. 1962.
[8] R. Zhang, J.H. Han, A.T. Erdogan, and T. Arslan, “Low Power CORDIC IP Core Implementation,” Proc. Int'l Conf. Acoustic, Speech, and Signal Processing, pp. 956-959, May 2006.
[9] J.S. Walther, “A Unified Algorithm for Elementary Functions,” Proc. Spring Joint Computer Conf., pp. 379-385, 1971.
[10] W.F. Wong and E. Goto, “Fast Hardware-Based Algorithms for Elementary Function Computations Using Rectangular Multipliers,” IEEE Trans. Computers, vol. 43, no. 3, pp. 278-294, Mar. 1994.
[11] J.-A. Pineiro, S.F. Oberman, J.-M. Muller, and J.D. Bruguera, “High-Speed Function Approximation Using a Minimax Quadratic Interpolator,” IEEE Trans. Computers, vol. 54, no. 3, pp. 304-318, Mar. 2005.
[12] D. Das, K. Mukhopadhyaya, and B.P. Sinha, “Implementation of Four Common Functions on an LNS Co-Processor,” IEEE Trans. Computers, vol. 44, no. 1, pp. 155-161, Jan. 1995.
[13] D. Harris, “A Powering Unit for an OpenGL Lighting Engine,” Proc. 35th Asilomar Conf. Signals, Systems, and Computers, pp. 1641-1645, 2001.
[14] IEEE Standard for Binary Floating-Point Arithmetic, ANSI/IEEE Standard 754-1985, 1985.
[15] F.-S. Lai and C.-F.E. Wu, “A Hybrid Number System Processor with Geometric and Complex Arithmetic Capabilities,” IEEE Trans. Computers, vol. 40, no. 8, pp. 952-962, Aug. 1991.
[16] G.K. Kolli, “3D Graphics Optimization for ARM Architecture,” Proc. Game Developer Conf., 2002.
[17] J.-H. Sohn, R. Woo, and H.-J. Yoo, “A Programmable Vertex Shader with Fixed-Point SIMD Datapath for Low Power Wireless Applications,” Proc. ACM Siggraph/Eurographics Workshop Graphics Hardware, pp. 107-114, Aug. 2004.
[18] M.J. Schulte and E.E. Swartzlander Jr., “Hardware Designs for Exactly Rounded Elementary Functions,” IEEE Trans. Computers, vol. 43, no. 8, pp. 964-973, Aug. 1994.
[19] M. Combet, H. Zonneveld, and L. Verbeek, “Computation of the Base Two Logarithm of Binary Numbers,” IEEE Trans. Electronic Computers, vol. 14, pp. 863-867, Dec. 1965.
[20] E.L. Hall, D.D. Lynch, and S.J. Dwyer III, “Generation of Products and Quotients Using Approximate Binary Logarithms for Digital Filtering Applications,” IEEE Trans. Computers, vol. 19, no. 2, pp.97-105, Feb. 1970.
[21] K.H. Abed and R.E. Siferd, “CMOS VLSI Implementation of a Low-Power Logarithmic Converter,” IEEE Trans. Computers, vol. 52, no. 11, pp. 1421-1433, Nov. 2003.
[22] V. Oklobdzija, “An Algorithmic and Novel Design of a Leading Zero Detector Circuit: Comparison with Logic Synthesis,” IEEE Trans. VLSI Systems, vol. 2, no. 1, pp. 124-128, Mar. 1994.
[23] K.H. Abed and R.E. Siferd, “VLSI Implementation of a Low-Power Antilogarithmic Converter,” IEEE Trans. Computers, vol. 52, no. 9, pp. 1221-1228, Sept. 2003.
[24] Microsoft Corp., Microsoft DirectX Technology Overview, http://www.microsoft.com/windowsdirectx, 2007.
[25] J.-A. Pineiro and J.D. Bruguera, “High-Speed Double-Precision Computation of Reciprocal, Division, Square Root, and Inverse Square Root,” IEEE Trans. Computers, vol. 51, no. 12, pp. 1377-1388, Dec. 2002.
41 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool