High-Rate Nested-Lattice Quantized Matrix Multiplication with Small Lookup Tables
Iris Kaplan, Or Ordentlich

TL;DR
This paper introduces a hierarchical nested lattice quantization method for matrix multiplication that allows high-rate quantization with small lookup tables, reducing memory requirements while maintaining low distortion.
Contribution
It proposes a novel rate-splitting hierarchical nested lattice quantization framework enabling high-rate quantization with small LUTs, improving efficiency over prior low-rate methods.
Findings
LUT size is reduced exponentially with the number of layers.
Analytic bounds show negligible loss compared to standard nested lattice quantizers.
Numerical results confirm low distortion with small LUTs.
Abstract
Recent work have shown that the quantization for matrix multiplication problem can be optimally solved by quantizing each column in each matrix using a nested lattice code, and then multiplying the de-quantized matrices. It was further demonstrated that when product codes of sub-dimension and rate are used, the de-quantization and inner product operations can be implemented with querying a lookup table (LUT) of size , but this is only useful when is sufficiently small. This in turn limits LUT-based inner product decoding to low-rate quantizers. In this work, we develop a rate hierarchical nested lattice quantization framework, which quantizes each vector to layers, and admits LUT-based inner product decoding using an LUT of size , allowing for high-rate quantization. We provide analytic bounds on the loss of the developed scheme compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Sparse and Compressive Sensing Techniques
