Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Mixed Precision Low Rank Approximations and their Application to Block Low Rank LU Factorization

Abstract : We introduce a novel approach to exploit mixed precision arithmetic for low-rank approximations. Our approach is based on the observation that singular vectors associated with small singular values can be stored in lower precisions while preserving high accuracy overall. We provide an explicit criterion to determine which level of precision is needed for each singular vector. We apply this approach to block low-rank (BLR) matrices, most of whose off-diagonal blocks have low rank. We propose a new BLR LU factorization algorithm that exploits the mixed precision representation of the blocks. We carry out the rounding error analysis of this algorithm and prove that the use of mixed precision arithmetic does not compromise the numerical stability of BLR LU factorization. Moreover our analysis determines which level of precision is needed for each floating-point operation (flop), and therefore guides us towards an implementation that is both robust and efficient. We evaluate the potential of this new algorithm on a range of matrices coming from real-life problems in industrial and academic applications. We show that a large fraction of the entries in the LU factors and flops to perform the BLR LU factorization can be safely switched to lower precisions, leading to significant reductions of the storage and flop costs, of up to a factor three using fp64, fp32, and bfloat16 arithmetics.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03251738
Contributor : Matthieu Gerest <>
Submitted on : Tuesday, June 8, 2021 - 12:00:37 PM
Last modification on : Tuesday, July 13, 2021 - 3:27:39 AM

File

mixedBLR.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03251738, version 2

Citation

Patrick Amestoy, Olivier Boiteau, Alfredo Buttari, Matthieu Gerest, Fabienne Jézéquel, et al.. Mixed Precision Low Rank Approximations and their Application to Block Low Rank LU Factorization. 2021. ⟨hal-03251738v2⟩

Share

Metrics

Record views

155

Files downloads

99