LRAMM -- Low precision approximates GEMM via RSVD

Hongyaoxing Gu

arXiv:2405.16917·math.NA·May 28, 2024

LRAMM -- Low precision approximates GEMM via RSVD

Hongyaoxing Gu

PDF

Open Access

TL;DR

This paper introduces LRAMM, a high-performance approximate matrix multiplication method that combines mixed-precision quantization with RSVD and low-rank decomposition to improve efficiency while maintaining acceptable accuracy.

Contribution

LRAMM is a novel algorithm integrating mixed-precision quantization and RSVD, optimizing matrix multiplication speed with controlled error margins.

Findings

01

Significant speedup in matrix multiplication tasks.

02

Maintains accuracy within low-precision error bounds.

03

Effective use of low-rank decomposition for approximation.

Abstract

Matrix multiplication computation acceleration has been a research hotspot across various domains. Due to the characteristics of some applications, approximate matrix multiplication can achieve significant performance improvements without losing much precision. In this paper, we propose LRAMM - a high-performance matrix multiplication approximation algorithm that combines mixed-precision quantized matrix multiplication with RSVD techniques, further enhancing efficiency within the error range of low-precision matrix multiplication by utilizing matrix low-rank decomposition technology.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Numerical Analysis Techniques · Advanced Measurement and Metrology Techniques · Image and Object Detection Techniques