Score: 1

Enabling mixed-precision in spectral element codes

Published: March 3, 2025 | arXiv ID: 2503.02134v2

By: Yanxiang Chen , Pablo de Oliveira Castro , Paolo Bientinesi and more

Potential Business Impact:

Makes supercomputers run faster and use less power.

Business Areas:

Quantum Computing Science and Engineering

Mixed-precision computing has the potential to significantly reduce the cost of exascale computations, but determining when and how to implement it in programs can be challenging. In this article, we propose a methodology for enabling mixed-precision with the help of computer arithmetic tools, roofline model, and computer arithmetic techniques. As case studies, we consider Nekbone, a mini-application for the Computational Fluid Dynamics (CFD) solver Nek5000, and a modern Neko CFD application. With the help of the Verificarlo tool and computer arithmetic techniques, we introduce a strategy to address stagnation issues in the preconditioned Conjugate Gradient method in Nekbone and apply these insights to implement a mixed-precision version of Neko. We evaluate the derived mixed-precision versions of these codes by combining metrics in three dimensions: accuracy, time-to-solution, and energy-to-solution. Notably, mixed-precision in Nekbone reduces time-to-solution by roughly 1.62x and energy-to-solution by 2.43x on MareNostrum 5, while in the real-world Neko application, the gain is up to 1.3x in both time and energy, with the accuracy that matches double-precision results.

Scaling the memory wall using mixed-precision -- HPG-MxP on an exascale machine

Distributed, Parallel, and Cluster Computing

Makes supercomputers run science problems 1.6x faster.

15 Jul 2025 0

87%

Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach

Distributed, Parallel, and Cluster Computing

Makes computers solve problems faster and use less power.

20 Aug 2025 0

86%

Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8

Numerical Analysis

Makes computers faster and use less power.

28 Sep 2025 0

View PDF Login to Bookmark

Page Count

16 pages

Enabling mixed-precision in spectral element codes

Makes supercomputers run faster and use less power.

Technical Abstract

Scaling the memory wall using mixed-precision -- HPG-MxP on an exascale machine

Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach

Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8