Evidence Slopes and Effective Dimension in Singular Linear Models
Kalyaan Rao

TL;DR
This paper investigates the limitations of traditional Bayesian model selection methods in singular models, demonstrating how the real log canonical threshold (RLCT) provides a more accurate measure of effective dimension and improves evidence estimation.
Contribution
It analytically and empirically characterizes the error growth of Laplace/BIC in singular models and introduces RLCT-aware corrections for accurate evidence estimation.
Findings
Laplace/BIC error grows linearly with (d/2 - RLCT) log n.
RLCT-aware correction recovers correct evidence slope.
Evidence slopes can estimate effective dimension in linear models.
Abstract
Bayesian model selection commonly relies on Laplace approximation or the Bayesian Information Criterion (BIC), which assume that the effective model dimension equals the number of parameters. Singular learning theory replaces this assumption with the real log canonical threshold (RLCT), an effective dimension that can be strictly smaller in overparameterized or rank-deficient models. We study linear-Gaussian rank models and linear subspace (dictionary) models in which the exact marginal likelihood is available in closed form and the RLCT is analytically tractable. In this setting, we show theoretically and empirically that the error of Laplace/BIC grows linearly with (d/2 minus lambda) times log n, where d is the ambient parameter dimension and lambda is the RLCT. An RLCT-aware correction recovers the correct evidence slope and is invariant to overcomplete reparameterizations that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
