Skip to content Skip to main navigation Report an accessibility issue

EECS Publication

Accelerating the Reduction to Upper Hessenberg Form through Hybrid GPU-Based Computing

Stanimire Tomov and Jack Dongarra

We present a Hessenberg reduction (HR) algorithm for hybrid multi-core + GPU systems that gets more than 16x performance improvement over the current LAPACK algorithm running just on current multicores (in double precision arithmetic). This enormous acceleration is due to proper matching of algorithmic requirements to architectural strengths of the hybrid components. The reduction itself is an important linear algebra problem, especially with its relevance to eigenvalue problems. The results described in this paper are significant because Hessenberg reduction has not yet been accelerated on multi-core architectures, and it plays a significant role in solving nonsymmetric eigenvalue problems. The approach can be applied to the symmetric problem and in general, to two-sided matrix transformations. The work further motivates and highlights the strengths of hybrid computing: to harness the strengths of the components of a hybrid architecture to get significant computational acceleration which otherwise may have been impossible.

Published  2009-05-24 04:00:00  as  ut-cs-09-642 (ID:74)

ut-cs-09-642.pdf

« Back to Listing