
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Hatem Ltaief, Jakub Kurzak, Jack Dongarra, "Parallel TwoSided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures," IEEE Transactions on Parallel and Distributed Systems, vol. 21, no. 4, pp. 417423, April, 2010.  
BibTex  x  
@article{ 10.1109/TPDS.2009.79, author = {Hatem Ltaief and Jakub Kurzak and Jack Dongarra}, title = {Parallel TwoSided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {21}, number = {4}, issn = {10459219}, year = {2010}, pages = {417423}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPDS.2009.79}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Parallel and Distributed Systems TI  Parallel TwoSided Matrix Reduction to Band Bidiagonal Form on Multicore Architectures IS  4 SN  10459219 SP417 EP423 EPD  417423 A1  Hatem Ltaief, A1  Jakub Kurzak, A1  Jack Dongarra, PY  2010 KW  Bidiagonal reduction KW  singular value decomposition KW  tile algorithms KW  multicores. VL  21 JA  IEEE Transactions on Parallel and Distributed Systems ER   
[1] http:/www.top500.org, 2009.
[2] http://www.intel.com/cd/software/products/ asmona/eng307757.htm, 2009.
[3] E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J.D. Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen, LAPACK Users' Guide, third ed. Soc. Industrial and Applied Math., 1999.
[4] J.L. Barlow, N. Bosner, and Z. Drmač, "A New Stable Bidiagonal Reduction Algorithm," Linear Algebra and Its Applications, vol. 397, no. 1, pp. 3584, Mar. 2005.
[5] M.W. Berry, J.J. Dongarra, and Y. Kim, "LAPACK Working Note 68: A Highly Parallel Algorithm for the Reduction of a Nonsymmetric Matrix to Block UpperHessenberg Form," Technical Report UTCS94221, Dept. of Computer Science, Univ. of Tennessee, Feb. 1994.
[6] N. Bosner and J.L. Barlow, "Block and Parallel Versions of OneSided Bidiagonalization," SIAM J. Matrix Analysis and Applications, vol. 29, no. 3, pp. 927953, 2007.
[7] A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, "Parallel Tiled QR Factorization for Multicore Architectures," Concurrency and Computation, vol. 20, no. 13, pp. 15731590, 2008.
[8] T.F. Chan, "An Improved Algorithm for Computing the Singular Value Decomposition," ACM Trans. Math. Software, vol. 8, no. 1, pp. 7283, Mar. 1982.
[9] J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov, A. Petitet, K. Stanley, D. Walker, and R.C. Whaley, "ScaLAPACK, a Portable Linear Algebra Library for Distributed Memory ComputersDesign Issues and Performance," Computer Physics Comm., vol. 97, nos. 1/2, pp. 115, 1996.
[10] D.M. Christopher, K. Eugenia, and M. Takemasa, "Estimating and Correcting Global Weather Model Error," Monthly Weather Rev., vol. 135, no. 2, pp. 281299, 2007.
[11] E. Elmroth and F.G. Gustavson, "New Serial and Parallel Recursive QR Factorization Algorithms for SMP Systems," Proc. Fourth Int'l Workshop Applied Parallel Computing, Large Scale Scientific and Industrial Problems (PARA '98), pp. 120128, June 1998.
[12] E. Elmroth and F.G. Gustavson, "Applying Recursion to Serial and Parallel QR Factorization Leads to Better Performance," IBM J. Research and Development, vol. 44, no. 4, pp. 605624, 2000.
[13] E. Elmroth and F.G. Gustavson, "HighPerformance Library Software for QR Factorization," Proc. Fifth Int'l Workshop, Applied Parallel Computing, New Paradigms for HPC in Industry and Academia (PARA '00), pp. 5363. June 2000, http://dx.doi.org/10.10073540707344_9 .
[14] G.H. Golub and C.F. Van Loan, Matrix Computation, John Hopkins Studies in the Math. Sciences, third ed. Johns Hopkins Univ. Press, 1996.
[15] G.H. Golub and W. Kahan, "Calculating the Singular Values and the Pseudo Inverse of a Matrix," SIAM J. Numerical Analysis, vol. 2, pp. 205224, 1965.
[16] B. Grosser and B. Lang, "Efficient Parallel Reduction to Bidiagonal Form," Parallel Computing, vol. 25, no. 8, pp. 969986, 1999.
[17] B.C. Gunter and R.A. van de Geijn, "Parallel OutofCore Computation and Updating of the QR Factorization," ACM Trans. Math. Software, vol. 31, no. 1, pp. 6078, Mar. 2005.
[18] J. Kurzak, A. Buttari, and J.J. Dongarra, "Solving Systems of Linear Equation on the CELL Processor Using Cholesky Factorization," IEEE Trans. Parallel and Distributed Systems, vol. 19, no. 9, pp. 11751186, Sept. 2008.
[19] J. Kurzak, A. Buttari, and J.J. Dongarra, "Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization," IEEE Trans. Parallel and Distributed Systems, vol. 19, no. 9, pp. 111, Sept. 2008.
[20] J. Kurzak and J.J. Dongarra, "QR Factorization for the CELL Processor," J. Scientific Programming, special issue on high performance computing on CELL B.E. processors, pp. 112, 2008.
[21] B. Lang, "Parallel Reduction of Banded Matrices to Bidiagonal Form," Parallel Computing, vol. 22, no. 1, pp. 118, 1996.
[22] H. Ltaief, J. Kurzak, and J. Dongarra, "LAPACK Working Note 208: Parallel Block Hessenberg Reduction Using AlgorithmsbyTiles for Multicore Architectures Revisited," Technical Report UTCS08624, Dept. of Computer Science, Univ. of Tennessee, Aug. 2008.
[23] E.S. QuintanaOrtí and R.A. van de Geijn, "Updating an LU Factorization with Pivoting," ACM Trans. Math. Software, vol. 35, no. 2, July 2008.
[24] G. QuintanaOrtí, E.S. QuintanaOrtí, E. Chan, R.A. van de Geijn, and F.G. Van Zee, "Scheduling of QR Factorization Algorithms on SMP and MultiCore Architectures," Proc. Int'l Conf. Parallel, Distributed and NetworkBased Processing (PDP), pp. 301310, 2008.
[25] R. Ralha, "OneSided Reduction to Bidiagonal Form," Linear Algebra and Its Applications, vol. 358, pp. 219238, Jan. 2003.
[26] R. Schreiber and C. Van Loan, "A Storage Efficient WY Representation for Products of Householder Transformations," SIAM J. Scientific and Statistical Computing, vol. 10, pp. 5357, 1989.
[27] G.W. Stewart, Matrix Algorithms Volume I: Matrix Decompositions. SIAM, 1998.
[28] L.N. Trefethen and D. Bau, Numerical Linear Algebra. SIAM, 1997.
[29] E.L. Yip, "Fortran Subroutines for OutofCore Solutions of Large Complex Linear Systems," Technical Report CR159142, NASA, Nov. 1979.
[30] K. Yotov, T. Roeder, K. Pingali, J. Gunnels, and F. Gustavson, "An Experimental Comparison of CacheOblivious and CacheConscious Programs," Proc. 19th Ann. ACM Symp. Parallel Algorithms and Architectures (SPAA '07), pp. 93104, 2007.