On model misspecification and KL separation for Gaussian graphical models
Varun Jog, Po-Ling Loh

TL;DR
This paper derives bounds on the KL divergence between Gaussian graphical models based on their edge differences, highlighting the importance of accurate edge structure estimation for distribution approximation.
Contribution
It establishes tight bounds on KL divergence in terms of graph edge differences and provides sample size requirements for correct model selection.
Findings
KL divergence is bounded below by a constant when graphs differ by at least one edge.
Sample size bounds are derived for accurate model selection using maximum likelihood.
Accurate edge estimation is crucial for close approximation of the true distribution.
Abstract
We establish bounds on the KL divergence between two multivariate Gaussian distributions in terms of the Hamming distance between the edge sets of the corresponding graphical models. We show that the KL divergence is bounded below by a constant when the graphs differ by at least one edge; this is essentially the tightest possible bound, since classes of graphs exist for which the edge discrepancy increases but the KL divergence remains bounded above by a constant. As a natural corollary to our KL lower bound, we also establish a sample size requirement for correct model selection via maximum likelihood estimation. Our results rigorize the notion that it is essential to estimate the edge structure of a Gaussian graphical model accurately in order to approximate the true distribution to close precision.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
