On model misspecification and KL separation for Gaussian graphical   models

Varun Jog; Po-Ling Loh

arXiv:1501.02320·cs.IT·April 6, 2015

On model misspecification and KL separation for Gaussian graphical models

Varun Jog, Po-Ling Loh

PDF

TL;DR

This paper derives bounds on the KL divergence between Gaussian graphical models based on their edge differences, highlighting the importance of accurate edge structure estimation for distribution approximation.

Contribution

It establishes tight bounds on KL divergence in terms of graph edge differences and provides sample size requirements for correct model selection.

Findings

01

KL divergence is bounded below by a constant when graphs differ by at least one edge.

02

Sample size bounds are derived for accurate model selection using maximum likelihood.

03

Accurate edge estimation is crucial for close approximation of the true distribution.

Abstract

We establish bounds on the KL divergence between two multivariate Gaussian distributions in terms of the Hamming distance between the edge sets of the corresponding graphical models. We show that the KL divergence is bounded below by a constant when the graphs differ by at least one edge; this is essentially the tightest possible bound, since classes of graphs exist for which the edge discrepancy increases but the KL divergence remains bounded above by a constant. As a natural corollary to our KL lower bound, we also establish a sample size requirement for correct model selection via maximum likelihood estimation. Our results rigorize the notion that it is essential to estimate the edge structure of a Gaussian graphical model accurately in order to approximate the true distribution to close precision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.