Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study

Xianghong Fang; Litao Guo; Hengchao Chen; Yuxuan Zhang; XiaofanXia; Dingjie Song; Yexin Liu; Hao Wang; Harry Yang; Yuan Yuan; Qiang Sun

arXiv:2506.15078·cs.CV·June 19, 2025

Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study

Xianghong Fang, Litao Guo, Hengchao Chen, Yuxuan Zhang, XiaofanXia, Dingjie Song, Yexin Liu, Hao Wang, Harry Yang, Yuan Yuan, Qiang Sun

PDF

Open Access

TL;DR

This paper introduces a distributional matching approach using Wasserstein distance to improve vector quantization in autoregressive models, addressing training instability and codebook collapse, with validated empirical and theoretical results.

Contribution

It proposes a novel distributional matching method with Wasserstein distance to enhance codebook utilization and reduce quantization errors in vector quantization.

Findings

01

Near 100% codebook utilization achieved.

02

Significant reduction in quantization error.

03

Validated effectiveness through empirical and theoretical analysis.

Abstract

The success of autoregressive models largely depends on the effectiveness of vector quantization, a technique that discretizes continuous features by mapping them to the nearest code vectors within a learnable codebook. Two critical issues in existing vector quantization methods are training instability and codebook collapse. Training instability arises from the gradient discrepancy introduced by the straight-through estimator, especially in the presence of significant quantization errors, while codebook collapse occurs when only a small subset of code vectors are utilized during training. A closer examination of these issues reveals that they are primarily driven by a mismatch between the distributions of the features and code vectors, leading to unrepresentative code vectors and significant data information loss during compression. To address this, we employ the Wasserstein distance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Neural Networks and Applications