Learning Gaussian Tree Models: Analysis of Error Exponents and Extremal Structures
Vincent Y. F. Tan, Animashree Anandkumar, Alan S. Willsky

TL;DR
This paper analyzes the error rates in learning Gaussian tree models from data, revealing that star and chain structures are respectively the hardest and easiest to learn, depending on correlation strengths and graph topology.
Contribution
It characterizes the error exponents for structure learning in Gaussian trees and identifies extremal structures that minimize or maximize learning difficulty.
Findings
Star structure minimizes the error exponent, making it hardest to learn.
Chain structure maximizes the error exponent, making it easiest to learn.
Correlation decay explains the difficulty in distinguishing distant nodes.
Abstract
The problem of learning tree-structured Gaussian graphical models from independent and identically distributed (i.i.d.) samples is considered. The influence of the tree structure and the parameters of the Gaussian distribution on the learning rate as the number of samples increases is discussed. Specifically, the error exponent corresponding to the event that the estimated tree structure differs from the actual unknown tree structure of the distribution is analyzed. Finding the error exponent reduces to a least-squares problem in the very noisy learning regime. In this regime, it is shown that the extremal tree structure that minimizes the error exponent is the star for any fixed set of correlation coefficients on the edges of the tree. If the magnitudes of all the correlation coefficients are less than 0.63, it is also shown that the tree structure that maximizes the error exponent is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
