PACO 2019: 3rd Workshop on Power-Aware Computing

Name: PACO 2019: 3rd Workshop on Power-Aware Computing
Start: 2019-11-05T08:00:00+01:00
End: 2019-11-06T13:00:00+01:00
Location: Max Planck Institute for Dynamics of Complex Technical Systems

5–6 Nov 2019

Max Planck Institute for Dynamics of Complex Technical Systems

Europe/Berlin timezone

Contact

Session

Day I

5 Nov 2019, 09:50

Main/groundfloor-V0.05/2+3 - Prigogine (Max Planck Institute for Dynamics of Complex Technical Systems)

Main/groundfloor-V0.05/2+3 - Prigogine

Max Planck Institute for Dynamics of Complex Technical Systems

Sandtorstr. 1, 39106 Magdeburg, Saxony Anhalt, Germany

100

There are no materials yet.

15. Exploiting Nested Task-Parallelism in the LU Factorization of Hierarchical Matrices

Rocío Carratalá-Sáez (Universitat Jaume I)

05/11/2019, 09:50

Talk

Hierarchical matrices (H-matrices) lie in-between dense and sparse scenarios. Therefore, it is natural to tackle the LU factorization of H-Matrices via a task-parallel approach, which has recently reported successful results for related linear algebra problems. In this work, we will describe how to discover the data-ﬂow parallelism intrinsic to the operation at execution time, via the analysis...

19. Unleashing the sptrsv method in FPGAs

Federico Favaro (Facultad de Ingeniería, Universidad de la República)

05/11/2019, 10:20

Talk

Field--Programmable Gate Arrays (FPGAs) as hardware accelerators offer great flexibility and performance, and recently are emerging as a more energy--efficient alternative than other many--core devices.

The traditional methods for FPGA design involve the use of low--level Hardware Description Languages such as VHDL or Verilog. These impose a vastly different programming model than standard...

20. Towards an efficient many-core implementation of the IRKA

Matías Valdés (Universidad de la República)

05/11/2019, 10:40

Talk

The modeling of physical phenomena as Linear Time Invariant systems is a common practise across science and industry. It is often the case that the order of these models is so large that it renders them unuseful at the time of simulating the studied system. In these cases, practitioners can appeal to Model Order Reduction (MOR) techniques, which departing from the original model produce a...

21. Automatic selection of GPU sparse triangular solvers based on energy consumption

Raúl Marichal (Universidad de la República)

05/11/2019, 11:00

Talk

Preconditioned Krylov-subspace methods to solve general sparse linear systems are the computational bottleneck of the solution of many science and engineering problems. In particular, it is the preconditioner application on each iteration of the solver, the stage that concentrate the most of the processing time. Many times this stage implies the solution of a number of sparse triangular linear...

25. Gingko's load-balancing COO SpMV on NVIDIA and AMD GPU architectures

YU-HSIANG TSAI (Karlsruhe Institute of Technology), Terry Cojean (Karlsruhe Institute of Technology)

05/11/2019, 11:20

Talk

Efficiently processing unbalanced and irregular matrices on manycore architectures is a challenging problem. With the load-balancing Sparse Matrix Vector Multiplication (SpMV) based on the coordinate format (COO), we have designed an SpMV kernel that provides attractive performance across a wide range of matrices. In this contribution, we present the load-balancing COO SpMV kernel, elaborate...

22. Parallel multiprecision iterative Krylov subspace solver

Xenia Rosa Volk (Karlsruhe Institute of Technology)

05/11/2019, 14:50

Talk

The use of multiprecision numerics is becoming increasingly attractive as modern processor architectures often achieve significantly higher performance and throughput rates when using lower precision than IEEE double precision. Error analysis aims at investigating how rounding errors introduced by using different precision formats propagate throughout the algorithms and potentially impact the...

11. Energy-Time Analysis of Heterogeneous Clusters for EEG Classification

Julio Ortega (University of Granada, Granada, Spain)

05/11/2019, 15:20

Talk

Power-aware computing introduces an additional dimension in the development of efficient parallel codes for heterogeneous computing architectures. Along with experimental frameworks that facilitate the accomplishment of experimental measures, there is a need for data analysis strategies and programming guidelines and strategies that jointly consider speed and consumption performance, among...

9. Convolutional Neural Nets for the Run Time and Energy Consumption Estimation of the Sparse Matrix–Vector Product

Manuel F. Dolz (Universitat Jaume I)

05/11/2019, 15:50

Talk

Introduction. Modeling the execution time and the energy efficiency of the Sparse Matrix-Vector product (SpMV) on a current CPU architecture is especially complex due to i) irregular memory accesses; ii) indirect memory referencing; and iii) low arithmetic intensity. While analytical models may yield accurate estimates for the total number of cache hits/misses, they often fail to predict...

16. Revisiting the idea of multiprecision block-Jacobi preconditioning - where do we stand 2 years later?

Hartwig Anzt (Karlsruhe Institute of Technology)

05/11/2019, 16:20

Talk

At the PACO workshop 2017, we presented the idea of decoupling the memory precision from the arithmetic precision, and storing a block-Jacobi preconditioner such that the precision format of each diagonal block is optimized to the numerical characteristics. The idea is to reduce the pressure on the memory bandwidth while preserving regularity of the preconditioner and the convergence of the...

26. In-application energy measurement on Megware SlideSX systems

Martin Köhler (Max Planck Institute for Dynamics of Complex Technical Systems)

05/11/2019, 16:50

Talk

The design of energy efficient applications requires a proper energy measurement inside the compute servers. The Megware SlideSX chassis for HPC servers provide an energy measurement directly between the power supply and the mainboard of the system with a sampling rate up to 100Hz. This enables the users to detect the energy consuming parts of their applications. In order to obtain the energy...

Building timetable...

Choose timezone

PACO 2019: 3rd Workshop on Power-Aware Computing

Contact

Presentation materials