Multi-variate correlation and mixtures of product measures
Tim Austin

TL;DR
This paper extends the theory of total correlation and dual total correlation to general random variables and reveals that small TC or DTC implies the distribution is close to a product measure or a mixture of such measures, respectively.
Contribution
It generalizes the definitions of TC and DTC beyond finite variables and establishes new structural results linking small DTC to mixtures of product measures.
Findings
Small TC implies distribution is close to a product measure.
Small DTC implies distribution is a mixture of near-product measures.
Provides new theoretical insights into the structure of distributions with low correlation.
Abstract
Total correlation (`TC') and dual total correlation (`DTC') are two classical ways to quantify the correlation among an -tuple of random variables. They both reduce to mutual information when . The first part of this paper sets up the theory of TC and DTC for general random variables, not necessarily finite-valued. This generality has not been exposed in the literature before. The second part considers the structural implications when a joint distribution has small TC or DTC. If , then is close to a product measure according to a suitable transportation metric: this follows directly from Marton's classical transportation-entropy inequality. If , then the structural consequence is more complicated: is a mixture of a controlled number of terms, most of them close to product measures in the transportation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
