Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables

Futoshi Futami; Masahiro Fujisawa

arXiv:2505.19470·stat.ML·November 7, 2025

Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables

Futoshi Futami, Masahiro Fujisawa

PDF

Open Access 1 Video

TL;DR

This paper extends information-theoretic analysis to VQ-VAEs, deriving bounds on generalization error and data distribution divergence, highlighting the role of latent variables in unsupervised learning.

Contribution

It introduces a novel data-dependent prior and derives new bounds linking latent variables, generalization, and data generation in VQ-VAEs.

Findings

01

Generalization error bound depends on latent variables and encoder complexity.

02

Upper bound on Wasserstein distance explains the impact of LV regularization.

03

Latent variables significantly influence data generation quality.

Abstract

Latent variables (LVs) play a crucial role in encoder-decoder models by enabling effective data compression, prediction, and generation. Although their theoretical properties, such as generalization, have been extensively studied in supervised learning, similar analyses for unsupervised models such as variational autoencoders (VAEs) remain insufficiently underexplored. In this work, we extend information-theoretic generalization analysis to vector-quantized (VQ) VAEs with discrete latent spaces, introducing a novel data-dependent prior to rigorously analyze the relationship among LVs, generalization, and data generation. We derive a novel generalization error bound of the reconstruction loss of VQ-VAEs, which depends solely on the complexity of LVs and the encoder, independent of the decoder. Additionally, we provide the upper bound of the 2-Wasserstein distance between the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Information-theoretic Generalization Analysis for VQ-VAEs: A Role of Latent Variables· slideslive

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection