Switchable Token-Specific Codebook Quantization For Face Image Compression
Yongbo Wang, Haonan Wang, Guodong Mu, Ruixin Zhang, Jiaqi Chen, Jingyun Zhang, Jun Wang, Yuan Xie, Zhizhong Zhang, Shouhong Ding

TL;DR
This paper introduces a switchable token-specific codebook quantization method for face image compression, which learns category-specific codebooks to improve reconstruction quality at low bits per pixel, outperforming existing global codebook strategies.
Contribution
It proposes a novel approach that assigns independent codebooks to tokens based on image categories, enhancing expressive capacity and reconstruction performance in face image compression.
Findings
Achieves 93.51% accuracy on face recognition datasets at 0.05 bpp.
Outperforms traditional global codebook methods in low bpp scenarios.
Demonstrates generalizability to existing codebook-based learning approaches.
Abstract
With the ever-increasing volume of visual data, the efficient and lossless transmission, along with its subsequent interpretation and understanding, has become a critical bottleneck in modern information systems. The emerged codebook-based solution utilize a globally shared codebook to quantize and dequantize each token, controlling the bpp by adjusting the number of tokens or the codebook size. However, for facial images, which are rich in attributes, such global codebook strategies overlook both the category-specific correlations within images and the semantic differences among tokens, resulting in suboptimal performance, especially at low bpp. Motivated by these observations, we propose a Switchable Token-Specific Codebook Quantization for face image compression, which learns distinct codebook groups for different image categories and assigns an independent codebook to each token. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
