Reducing the Variance of Variational Estimates of Mutual Information by   Limiting the Critic's Hypothesis Space to RKHS

P Aditya Sreekar; Ujjwal Tiwari; Anoop Namboodiri

arXiv:2011.08651·cs.LG·November 18, 2020

Reducing the Variance of Variational Estimates of Mutual Information by Limiting the Critic's Hypothesis Space to RKHS

P Aditya Sreekar, Ujjwal Tiwari, Anoop Namboodiri

PDF

1 Repo

TL;DR

This paper proposes constraining the critic's hypothesis space to RKHS to reduce variance in variational MI estimates, improving reliability especially for high MI scenarios.

Contribution

It introduces a novel regularization approach by limiting the critic's hypothesis space to RKHS, supported by theoretical analysis and empirical validation across multiple MI estimators.

Findings

01

Reduced variance in MI estimates with RKHS constraint

02

Effective bias-variance tradeoff achieved through regularization

03

Improved estimation accuracy demonstrated on four variational bounds

Abstract

Mutual information (MI) is an information-theoretic measure of dependency between two random variables. Several methods to estimate MI, from samples of two random variables with unknown underlying probability distributions have been proposed in the literature. Recent methods realize parametric probability distributions or critic as a neural network to approximate unknown density ratios. The approximated density ratios are used to estimate different variational lower bounds of MI. While these methods provide reliable estimation when the true MI is low, they produce high variance estimates in cases of high MI. We argue that the high variance characteristic is due to the uncontrolled complexity of the critic's hypothesis space. In support of this argument, we use the data-driven Rademacher complexity of the hypothesis space associated with the critic's architecture to analyse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blackPython/mi_estimator
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.