MINE: Mutual Information Neural Estimation

Mohamed Ishmael Belghazi; Aristide Baratin; Sai Rajeswar; Sherjil; Ozair; Yoshua Bengio; Aaron Courville; R Devon Hjelm

arXiv:1801.04062·cs.LG·August 17, 2021·308 cites

MINE: Mutual Information Neural Estimation

Mohamed Ishmael Belghazi, Aristide Baratin, Sai Rajeswar, Sherjil, Ozair, Yoshua Bengio, Aaron Courville, R Devon Hjelm

PDF

Open Access 5 Repos

TL;DR

MINE introduces a neural network-based method for efficiently estimating mutual information in high-dimensional data, enabling improved applications in generative modeling and supervised classification.

Contribution

The paper presents a scalable, trainable neural estimator for mutual information that is consistent and applicable to various machine learning tasks.

Findings

01

MINE is linearly scalable with data dimensionality and sample size.

02

Applying MINE improves the performance of adversarially trained generative models.

03

Using MINE enhances the effectiveness of the Information Bottleneck in classification tasks.

Abstract

We argue that the estimation of mutual information between high dimensional continuous random variables can be achieved by gradient descent over neural networks. We present a Mutual Information Neural Estimator (MINE) that is linearly scalable in dimensionality as well as in sample size, trainable through back-prop, and strongly consistent. We present a handful of applications on which MINE can be used to minimize or maximize mutual information. We apply MINE to improve adversarially trained generative models. We also use MINE to implement Information Bottleneck, applying it to supervised classification; our results demonstrate substantial improvement in flexibility and performance in these settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning