An Implementation of the Tile QR Factorization for a GPU and Multiple CPUs
Jakub Kurzak, Rajib Nath, Peng Du, and Jack Dongarra
The tile QR factorization provides an efficient and scalable way for factoring a dense matrix in parallel on multi-core processors. This article presents a way of efficiently implementing the algorithm on a system with a powerful GPU and many multi-core CPUs.
Published 2010-09-15 04:00:00 as ut-cs-10-657 (ID:59)