All Publications
-
POMPEI: Programming with OpenMP4 for Exascale Investigations
, December 2017, ut-eecs-17-754.pdf -
Roadmap for the Development of a Linear Algebra Library for Exascale Computing: SLATE: Software for Linear Algebra Targeting Exascale
, September 2017, ut-eecs-17-752.pdf -
C++ API for BLAS and LAPACK
, September 2017, ut-eecs-17-753.pdf http://www.netlib.org/blas/ http://www.netlib.org/lapack/ -
PLASMA 17 Performance Report
, June 2017, ut-eecs-17-750.pdf -
PLASMA 17.1 Functionality Report
, June 2017, ut-eecs-17-751.pdf -
Small Tensor Operations on Advanced Architectures for High-Order Applications
, April 2017, ut-eecs-17-749.pdf -
Fast Cholesky Factorization on GPUs for Batch and Native Modes in MAGMA
, January 2017, ut-eecs-16-748.pdf -
High-performance Cholesky Factorization for GPU-only Execution
, December 2016, ut-eecs-16-747.pdf