loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Generating high performance pruned FFT implementations
Taipei, Taiwan
April 19-April 24
ISBN: 978-1-4244-2353-8
Franz Franchetti, Carnegie Mellon University, Department of Electrical and Computer Engineering, Pittsburgh, PA, United States
Markus Puschel, Carnegie Mellon University, Department of Electrical and Computer Engineering, Pittsburgh, PA, United States
We derive a recursive general-radix pruned Cooley-Tukey fast Fourier transform (FFT) algorithm in Kronecker product notation. The algorithm is compatible with vectorization and parallelization required on state-of-the-art multicore CPUs. We include the pruned FFT algorithm into the program generation system Spiral, and automatically generate optimized implementations of the pruned FFT for the Intel Core2Duo multicore processor. Experimental results show that using the pruned FFT can indeed speed up the fastest available FFT implementations by up to 30% when the problem size and the pattern of unused inputs and outputs are known in advance.
Citation:
Franz Franchetti, Markus Puschel, "Generating high performance pruned FFT implementations," icassp, pp.549-552, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009
Usage of this product signifies your acceptance of the Terms of Use.