Subscribe
Issue No.09 - Sept. (2012 vol.61)
pp: 1243-1255
Dong Wang , Inst. of Microelectron., Tsinghua Univ., Beijing, China
M. D. Ercegovac , UCLA Comput. Sci. Dept., Univ. of California, Los Angeles, CA, USA
ABSTRACT
We present a novel design of a radix-16 combined unit for complex division and square root in fixed-point format. A new digit-recurrence algorithm with two-step operand prescaling is developed for complex square root to avoid postscaling of the result. A combined recurrence algorithm is generalized and a scalable hardware architecture is proposed. Designs with different operand precision are implemented in Altera Stratix-II FPGA and cost and performance are evaluated and compared with reference designs of complex division or square root implemented combined or separately. The results show advantages of the proposed combined design in cost and performance.
INDEX TERMS
prescalers, field programmable gate arrays, fixed point arithmetic, Altera Stratix-II FPGA, radix-16 combined complex division-square root, fixed-point format, digit-recurrence algorithm, two-step operand prescaling, hardware architecture, Algorithm design and analysis, Field programmable gate arrays, Table lookup, Hardware, Signal processing algorithms, Zirconium, Digital signal processing, radix-16 algorithms., Complex division, complex square root, digit-recurrence, operand prescaling, postscaling, combined design, field-programmable gate array (FPGA)
CITATION
Dong Wang, M. D. Ercegovac, "A Radix-16 Combined Complex Division/Square Root Unit with Operand Prescaling", IEEE Transactions on Computers, vol.61, no. 9, pp. 1243-1255, Sept. 2012, doi:10.1109/TC.2011.143
REFERENCES
 [1] Exocortex.DSP, an Open Source C# Complex Number and FFT Library for Microsoft .NET, http://www.exocortex.orgdsp/, 2011. [2] CoStLy: Complex Standard Functions Library, http://iamlasun8. mathematik.uni-karlsruhe.de/ ~ae16CoStLy.htmll, 2011. [3] D. Bindel, J. Demmel, W. Kahan, and O. Marques, “On Computing Givens Rotations Reliably and Efficiently,” ACM Trans. Math. Software, vol. 28, pp. 206-238, 2002. [4] N. Hemkumar and J. Cavallaro, “A Systolic VLSI Architecture for Complex SVD,” Proc. IEEE Int'l Symp. Circuits and Systems, pp. 1061-1064, 1992. [5] M.A. Elliot, G.A. Walker, A. Swift, and K. Vandenborne, “Spectral Quantitation by Principal Component Analysis Using Complex Singular Value Decomposition,” Magnetic Resonance in Medicine, vol. 41, pp. 450-455, 1999. [6] J. Mitroy and I.A. Ivallov, “Quantum Defect Theory for the Study of Hadronic Atoms,” J. Physics, vol. 27, pp. 1-13, 2001. [7] J. Salo, J. Fagerholm, A.T. Friberg, and M.M. Salomaa, “Unified Description of Nondiffracting X and Y Waves,” Physics Rev., vol. 62, pp. 4261-4275, 2000. [8] R.L. Smith, “Algorithm 116: Complex division,” Comm. the ACM, vol. 5, no. 8, p. 435, 1962. [9] G.W. Stewart, “A Note on Complex Division,” ACM Trans. Math. Software, vol. 11, no. 3, pp. 238-241, 1985. [10] M.D. Ercegovac and J.-M. Muller, “Complex Division with Prescaling of Operands,” Proc. IEEE Int'l Conf. Application-Specific Systems, Architectures and Processors, pp. 304-314, 2003. [11] M.D. Ercegovac and J.-M. Muller, “Design of a Complex Divider,” Proc. SPIE Advanced Signal Processing Algorithms, Architectures, and Implementations XII, pp. 51-59, 2004. [12] M.D. Ercegovac and J.-M. Muller, “Complex Square Root with Operand Prescaling,” Proc. IEEE Int'l Conf. Applications-Specific Systems, Architectures and Processors, pp. 293-303, 2004. [13] M.D. Ercegovac and J.-M. Muller, “Complex Square Root with Operand Prescaling,” The J. Very Large-Scale Integration Signal Processing, vol. 49, pp. 19-30, 2007. [14] M.D. Ercegovac and T. Lang, Division and Square Root: Digit-Recurrence Algorithms and Implementations. Kluwer Academic Publishers, 1994. [15] D. Wang, M.D. Ercegovac, and N. Zheng, “A Radix-8 Complex Divider for FPGA Implementations,” Proc. 19th Int'l Conf. Field Programmable Logic and Applications, (FPL '09), 2009. [16] D. Wang and M.D. Ercegovac, “A Design of Complex Square Root for FPGA Implementation,” Proc. SPIE Advanced Signal Processing Algorithms, Architectures, and Implementations XII, Aug. 2009. [17] F. Edman and V. Oewall, “Fixed-Point Implementation of a Robust Complex Valued Divider Architecture,” Proc. European Conf. Circuit Theory and Design, Aug. 2005. [18] J. Liu, B. Weaver, and Y. Zakharov, “FPGA Implementation of Multiplication-Free Complex Division,” Electronic Letters, vol. 44, no. 2, pp. 95-96, Jan. 2008. [19] P. Dormiani, M.D. Ercegovac, and J.-M. Muller, “Design and Implementation of a Radix-4 Complex Division Unit with Prescaling,” Proc. IEEE 20th Int'l Conf. Application-Specific Systems, Architectures and Processors, July 2009. [20] T. Lang and P. Montuschi, “Very High Radix Square Root with Prescaling and Rounding and a Combined Division/Square Root Unit,” IEEE Trans. Computers, vol. 48, no. 8, pp. 827-841, Aug. 1999. [21] D. Lewis et al, “The Stratix-II Logic and Routing Architecture,” Proc. Int'l Symp. Field Programmable Gate Arrays, pp. 14-20, 2005. [22] lpm_mult Megafunction User Guide V1.0, Altera Corp., www. altera.com, Sept. 2004. [23] E. Maeland, “On the Comparison of Interpolation Methods,” IEEE Trans. Medical Imaging, vol. 7, no. 3, pp. 213-217, Sept. 1988. [24] C. Lin, M. Sheu, H. Chiang, C. Liaw, and Z. Wu, “The Efficient VLSI Design of Bi-Cubic Convolution Interpolation for Digital Image Processing,” Proc. IEEE Int'l Symp. Circuits and Systems, pp. 480-483, 2008. [25] D. Das Sarma and D. Matula, “Faithful Bipartite ROM Reciprocal Tables,” Proc. 12th Symp. Computer Arithmetic, S. Knowles and W. McAllister, eds., pp. 17-28, 1995. [26] M. Schulte and J. Stine, “Approximating Elementary Functions with Symmetric Bipartite Tables,” IEEE Trans. Computers, vol. 48, no. 8, pp. 842-847, Aug. 1999. [27] F. Dinechin and A. Tisserand, “Multipartite Table Method,” IEEE Trans. Computers, vol. 54, no. 3, pp. 319-330, Mar. 2005. [28] M.D. Ercegovac and T. Lang, “Fast Multiplication without Carry-Propagate Addition,” IEEE Trans. Computers, vol. 13, no. 11, pp. 1385-1390, Nov. 1990. [29] M.D. Ercegovac and T. Lang, “On-the-Fly Rounding,” IEEE Trans. Computers, vol. 41, no. 12, pp. 1497-1503, Dec. 1992. [30] M.D. Ercegovac and T. Lang, Digital Arithmetic. Morgan Kaufmann Publishers, 2004. [31] Polynomial Curve Fitting Using Matlab, http://www.mathworks. com/access/helpdesk/ help/techdoc/refpolyfit. html, 2011. [32] S. Hitotumatu, “Complex Arithmetic Trough CORDIC,” KODAI Math. Seminar Reports, vol. 26, nos. 2/3, pp. 176-186, 1975.