Sukkari, Dalal; Ltaief, Hatem; Esposito, Aniello; Keyes, David A QDWH-based SVD software framework on distributed-memory manycore systems. (English) Zbl 1471.65023 ACM Trans. Math. Softw. 45, No. 2, Article No. 18, 21 p. (2019). MSC: 65F15 65Y10 PDFBibTeX XMLCite \textit{D. Sukkari} et al., ACM Trans. Math. Softw. 45, No. 2, Article No. 18, 21 p. (2019; Zbl 1471.65023) Full Text: DOI Link
Charara, Ali; Keyes, David; Ltaief, Hatem Batched triangular dense linear algebra kernels for very small matrix sizes on GPUs. (English) Zbl 1471.65028 ACM Trans. Math. Softw. 45, No. 2, Article No. 15, 28 p. (2019). MSC: 65F99 65Y10 65Y15 PDFBibTeX XMLCite \textit{A. Charara} et al., ACM Trans. Math. Softw. 45, No. 2, Article No. 15, 28 p. (2019; Zbl 1471.65028) Full Text: DOI Link
Boukaram, Wajih; Turkiyyah, George; Keyes, David Hierarchical matrix operations on GPUs. Matrix-vector multiplication and compression. (English) Zbl 1471.65027 ACM Trans. Math. Softw. 45, No. 1, Article No. 3, 28 p. (2019). MSC: 65F99 65Y10 PDFBibTeX XMLCite \textit{W. Boukaram} et al., ACM Trans. Math. Softw. 45, No. 1, Article No. 3, 28 p. (2019; Zbl 1471.65027) Full Text: DOI arXiv
Sukkari, Dalal; Ltaief, Hatem; Keyes, David A high performance QDWH-SVD solver using hardware accelerators. (English) Zbl 1369.65058 ACM Trans. Math. Softw. 43, No. 1, Article No. 6, 25 p. (2016). MSC: 65F15 15A23 65Y10 65Y20 PDFBibTeX XMLCite \textit{D. Sukkari} et al., ACM Trans. Math. Softw. 43, No. 1, Article No. 6, 25 p. (2016; Zbl 1369.65058) Full Text: DOI Link
Abdelfattah, Ahmad; Keyes, David; Ltaief, Hatem KBLAS: an optimized library for dense matrix-vector multiplication on GPU accelerators. (English) Zbl 1369.65042 ACM Trans. Math. Softw. 42, No. 3, Article No. 18, 31 p. (2016). MSC: 65Fxx 65Y10 65Y15 65Y20 PDFBibTeX XMLCite \textit{A. Abdelfattah} et al., ACM Trans. Math. Softw. 42, No. 3, Article No. 18, 31 p. (2016; Zbl 1369.65042) Full Text: DOI arXiv