Tomov, Stanimire; Dongarra, Jack; Baboulin, Marc Towards dense linear algebra for hybrid GPU accelerated manycore systems. (English) Zbl 1204.68268 Parallel Comput. 36, No. 5-6, 232-240 (2010). MSC: 68W10 68M99 65F99 65Y05 PDFBibTeX XMLCite \textit{S. Tomov} et al., Parallel Comput. 36, No. 5--6, 232--240 (2010; Zbl 1204.68268) Full Text: DOI Link
Göddeke, Dominik; Strzodka, Robert; Turek, Stefan Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. (English) Zbl 1188.68084 Int. J. Parallel Emergent Distrib. Syst. 22, No. 4, 221-256 (2007). MSC: 68M20 68M99 PDFBibTeX XMLCite \textit{D. Göddeke} et al., Int. J. Parallel Emergent Distrib. Syst. 22, No. 4, 221--256 (2007; Zbl 1188.68084) Full Text: DOI
Nievergelt, Yves Scalar fused multiply-add instructions produce floating-point matrix arithmetic provably accurate to the penultimate digit. (English) Zbl 1069.68505 ACM Trans. Math. Softw. 29, No. 1, 27-48 (2003). MSC: 68M07 68W99 65Y99 68M99 PDFBibTeX XMLCite \textit{Y. Nievergelt}, ACM Trans. Math. Softw. 29, No. 1, 27--48 (2003; Zbl 1069.68505) Full Text: DOI