PrismQuant: Rate-Distortion-Optimal Vector Quantization for Gaussian-Mixture Sources
Bumsu Park, Chanho Park, Youngmok Park, Namyoon Lee

TL;DR
PrismQuant introduces a rate-distortion optimal vector quantization method for Gaussian-mixture sources, effectively handling heterogeneous local geometries and outperforming transformer-based codecs in real-world data.
Contribution
The paper develops a constructive RD theory for Gaussian-mixture sources and proposes PrismQuant, a practical coding scheme that approaches theoretical bounds.
Findings
PrismQuant closely approaches the theoretical RD bound on synthetic data.
PrismQuant outperforms transformer-based codecs on real-world CSI data.
The method achieves a vanishing asymptotic gap to the RD limit.
Abstract
For a Gaussian source under mean-squared error (MSE), classical transform coding is rate--distortion (RD) optimal: the Karhunen--Loeve transform (KLT) diagonalizes the covariance, reverse waterfilling allocates the bits, and scalar quantization closes the loop. This elegant story breaks down for multimodal sources, where no single covariance can capture heterogeneous local geometries, and the RD function loses its closed form. We revisit this problem through Gaussian-mixture sources and develop a constructive RD theory for them. Our key finding is that the mixture structure incurs only a component label cost. Conditioned on the active mixture component, each branch is Gaussian; the challenge is allocating bits across heterogeneous branches. We prove that the genie-aided conditional RD function is governed by a single global reverse-waterfilling level shared across all components and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
