Deep Sufficient Representation Learning via Mutual Information

Siming Zheng; Yuanyuan Lin; Jian Huang

arXiv:2207.10772·stat.ML·July 25, 2022

Deep Sufficient Representation Learning via Mutual Information

Siming Zheng, Yuanyuan Lin, Jian Huang

PDF

Open Access

TL;DR

This paper introduces MSRL, a mutual information-based method using deep neural networks to learn sufficient data representations, with proven consistency, error bounds, and superior performance in experiments.

Contribution

It presents a novel deep neural network approach for mutual information-based sufficient representation learning with theoretical guarantees and empirical validation.

Findings

01

MSRL is consistent in approximating conditional densities.

02

Non-asymptotic error bounds are established for MSRL.

03

MSRL outperforms existing nonlinear dimension reduction methods.

Abstract

We propose a mutual information-based sufficient representation learning (MSRL) approach, which uses the variational formulation of the mutual information and leverages the approximation power of deep neural networks. MSRL learns a sufficient representation with the maximum mutual information with the response and a user-selected distribution. It can easily handle multi-dimensional continuous or categorical response variables. MSRL is shown to be consistent in the sense that the conditional probability density function of the response variable given the learned representation converges to the conditional probability density function of the response variable given the predictor. Non-asymptotic error bounds for MSRL are also established under suitable conditions. To establish the error bounds, we derive a generalized Dudley's inequality for an order-two U-process indexed by deep neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Statistical Methods and Inference