Balance of Number of Embedding and their Dimensions in Vector Quantization
Hang Chen, Sankepally Sainath Reddy, Ziwei Chen, Dianbo Liu

TL;DR
This paper investigates the balance between embedding dimensions and codebook size in vector quantization, proposing an adaptive method that dynamically optimizes these parameters to improve VQ-VAE performance.
Contribution
It introduces a novel adaptive dynamic quantization approach using Gumbel-Softmax to automatically select optimal codebook configurations during training.
Findings
Augmenting codebook size while reducing embedding dimensions enhances VQ-VAE effectiveness.
The proposed method outperforms static hyperparameter settings across benchmark datasets.
Dynamic quantization significantly improves model flexibility and performance.
Abstract
The dimensionality of the embedding and the number of available embeddings ( also called codebook size) are critical factors influencing the performance of Vector Quantization(VQ), a discretization process used in many models such as the Vector Quantized Variational Autoencoder (VQ-VAE) architecture. This study examines the balance between the codebook sizes and dimensions of embeddings in VQ, while maintaining their product constant. Traditionally, these hyper parameters are static during training; however, our findings indicate that augmenting the codebook size while simultaneously reducing the embedding dimension can significantly boost the effectiveness of the VQ-VAE. As a result, the strategic selection of codebook size and embedding dimensions, while preserving the capacity of the discrete codebook space, is critically important. To address this, we propose a novel adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques
MethodsVQ-VAE
