Information-Geometric Decomposition of Generalization Error in Unsupervised Learning
Gilhan Kim

TL;DR
This paper presents an exact information-geometric decomposition of the generalization error in unsupervised learning, applied specifically to epsilon-PCA, revealing optimal model complexity and phase transitions.
Contribution
It introduces an exact decomposition of the KL divergence-based generalization error into interpretable components for e-flat models, with a detailed analysis of epsilon-PCA.
Findings
Decomposition into model error, data bias, and variance components.
Optimal cutoff eigenvalue at epsilon, balancing error and bias.
Identification of phase regimes: retain-all, interior, and collapse.
Abstract
We decompose the Kullback--Leibler generalization error (GE) -- the expected KL divergence from the data distribution to the trained model -- of unsupervised learning into three non-negative components: model error, data bias, and variance. The decomposition is exact for any e-flat model class and follows from two identities of information geometry: the generalized Pythagorean theorem and a dual e-mixture variance identity. As an analytically tractable demonstration, we apply the framework to -PCA, a regularized principal component analysis in which the empirical covariance is truncated at rank and discarded directions are pinned at a fixed noise floor . Although rank-constrained -PCA is not itself e-flat, it admits a technical reformulation with the same total GE on isotropic Gaussian data, under which each component of the decomposition takes closed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
