PACO 2019: 3rd Workshop on Power-Aware Computing

Name: PACO 2019: 3rd Workshop on Power-Aware Computing
Start: 2019-11-05T08:00:00+01:00
End: 2019-11-06T13:00:00+01:00
Location: Max Planck Institute for Dynamics of Complex Technical Systems

5–6 Nov 2019

Max Planck Institute for Dynamics of Complex Technical Systems

Europe/Berlin timezone

Contact

Iterative Refinement in Three Precisions

5 Nov 2019, 13:45

45m

Main/groundfloor-V0.05/2+3 - Prigogine (Max Planck Institute for Dynamics of Complex Technical Systems)

Main/groundfloor-V0.05/2+3 - Prigogine

Max Planck Institute for Dynamics of Complex Technical Systems

Sandtorstr. 1, 39106 Magdeburg, Saxony Anhalt, Germany

100

Keynote

Erin Carson (Charles University)

Support for floating point arithmetic in multiple precisions is becoming increasingly common in emerging architectures.
For example, half precision is now available on the NVIDIA V100 GPUs, on which it runs twice as fast as single precision with a proportional savings in energy consumption. Further, the NVIDIA V100's half-precision tensor cores can provide up to a 16x speedup over double precision.

We present a general algorithm for solving an n-by-n nonsingular linear system Ax = b based on iterative refinement in three precisions. The working precision is combined with possibly different precisions for solving for the correction term and for computing the residuals. Our rounding error analysis of the algorithm provides sufficient conditions for convergence and bounds for the attainable normwise forward error and normwise and componentwise backward errors, generalizing and unifying many existing rounding error analyses for iterative refinement.

We show further that by solving the correction equations by GMRES preconditioned by the LU factors the restriction on the condition number can be weakened to allow for the solution of systems which are extremely ill-conditioned with respect to the working precision. Compared with a standard $Ax = b$ solver that uses LU factorization in single precision, these results suggest that on architectures for which half precision is efficiently implemented it will be possible to solve certain linear systems $Ax = b$ in less time and with greater accuracy.

We present recent performance results on the latest GPU architectures which show that this approach can result in practical speedups and also discuss recent work in extending this approach to iterative refinement for least squares problems.

Erin Carson (Charles University) Nicholas J. Higham (School of Mathematics, University of Manchester)

There are no materials yet.

PACO 2019: 3rd Workshop on Power-Aware Computing

Contact

Iterative Refinement in Three Precisions

Main/groundfloor-V0.05/2+3 - Prigogine

Max Planck Institute for Dynamics of Complex Technical Systems

Speaker

Description

Authors

Presentation materials

Choose timezone

PACO 2019: 3rd Workshop on Power-Aware Computing

Contact

Speaker

Description

Authors

Presentation materials