The Community for Technology Leaders
RSS Icon
Issue No.02 - February (2012 vol.61)
pp: 284-288
Earl E. Swartzlander , University of Texas at Austin, Austin
Hani H.M. Saleh , Intel, Austin
This paper describes two fused floating-point operations and applies them to the implementation of fast Fourier transform (FFT) processors. The fused operations are a two-term dot product and an add-subtract unit. The FFT processors use "butterfly” operations that consist of multiplications, additions, and subtractions of complex valued data. Both radix-2 and radix-4 butterflies are implemented efficiently with the two fused floating-point operations. When placed and routed using a high performance standard cell technology, the fused FFT butterflies are about 15 percent faster and 30 percent smaller than a conventional implementation. Also the numerical results of the fused implementations are slightly more accurate, since they use fewer rounding operations.
Floating-point arithmetic, fused floating-point operations, fast Fourier transform, Radix-2 FFT butterfly, Radix-4 FFT butterfly.
Earl E. Swartzlander, Hani H.M. Saleh, "FFT Implementation with Fused Floating-Point Operations", IEEE Transactions on Computers, vol.61, no. 2, pp. 284-288, February 2012, doi:10.1109/TC.2010.271
[1] IEEE Standard for Floating-Point Arithmetic, ANSI/IEEE Standard 754-2008, Aug. 2008.
[2] R.K. Montoye, E. Hokenek, and S.L. Runyon, "Design of the IBM RISC System/6000 Floating-Point Execution Unit," IBM J. Research and Development, vol. 34, pp. 59-70, 1990.
[3] E. Hokenek, R.K. Montoye, and P.W. Cook, "Second-Generation RISC Floating Point with Multiply-Add Fused," IEEE J. Solid-State Circuits, vol. 25, no. 5, pp. 1207-1213, Oct. 1990.
[4] D. Takahashi, "A Radix-16 FFT Algorithm Suitable for Multiply-Add Instruction Based on Goedecker Method," Proc. Int'l Conf. Multimedia and Expo, vol. 2, pp. II-845-II-848, July 2003.
[5] J.H. McClellan and R.J. Purdy, "Applications of Digital Signal Processing to Radar," Applications of Digital Signal Processing, A.V. Oppenheim, ed., pp. 239-329, Prentice-Hall, 1978.
[6] B. Gold and T. Bially, "Parallelism in Fast Fourier Transform Hardware," IEEE Trans. Audio and Electroacoustics, vol. AU-21, no. 1, pp. 5-16, Feb. 1973.
[7] H.H. Saleh and E.E. Swartzlander,Jr., "A Floating-Point Fused Dot-Product Unit," Proc. IEEE Int'l Conf. Computer Design (ICCD), pp. 427-431, 2008.
[8] M.P. Farmwald, "On the Design of High-Performance Digital Arithmetic Units," PhD thesis, Stanford Univ., 1981.
[9] P.-M. Seidel and G. Even, "Delay-Optimized Implementation of IEEE Floating-Point Addition," IEEE Trans. Computers, vol. 53, no. 2, pp. 97-113, Feb. 2004.
[10] H. Saleh and E.E. Swartzlander,Jr., "A Floating-Point Fused Add-Subtract Unit," Proc. IEEE Midwest Symp. Circuits and Systems (MWSCAS), pp. 519-522, 2008.
[11] H.H. Saleh, "Fused Floating-Point Arithmetic for DSP," PhD dissertation, Univ. of Texas, 2008.
4 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool