All Publications
-
Analysis of Dynamically Scheduled Tile Algorithms for Dense Linear Algebra on Multicore Architectures
, March 2011, ut-cs-11-666.pdf -
Heuristics for Optimizing Matrix-Based Erasure Codes for Fault-Tolerant Storage Systems
, December 2010, ut-cs-10-664.pdf -
Kernel Assisted Collective Intra-node Communication Among Multicore and Manycore CPUs
, November 2010, ut-cs-10-663.pdf -
Molecular Coordination of Hierarchical Self-Assembly
, November 2010, ut-cs-10-662.pdf -
Reducing the time to tune parallel dense linear algebra routines with partial execution and performance modelling
, October 2010, ut-cs-10-661.pdf -
An Implementation of the Tile QR Factorization for a GPU and Multiple CPUs
, September 2010, ut-cs-10-657.pdf -
Faster, Cheaper, Better - A Hybridization Methodology to Develop Linear Algebra Software for GPUs
, September 2010, ut-cs-10-658.pdf -
DAGuE: A generic distributed DAG engine for high performance computing
, September 2010, ut-cs-10-659.pdf