A Differential Entropy Estimator for Training Neural Networks

Georg Pichler; Pierre Colombo; Malik Boudiaf; G\"unther; Koliander; Pablo Piantanida

arXiv:2202.06618·cs.LG·June 22, 2022·1 cites

A Differential Entropy Estimator for Training Neural Networks

Georg Pichler, Pierre Colombo, Malik Boudiaf, G\"unther, Koliander, Pablo Piantanida

PDF

Open Access 1 Repo

TL;DR

This paper introduces KNIFE, a novel differentiable kernel-based estimator for differential entropy, enabling its use as a regularizer in neural network training across various high-dimensional tasks.

Contribution

The paper presents KNIFE, a fully parameterized, differentiable estimator for differential entropy and mutual information, overcoming previous limitations and broadening their application in neural network training.

Findings

01

KNIFE effectively estimates differential entropy in high-dimensional data.

02

Using KNIFE improves neural network training in tasks like domain adaptation and fair classification.

03

The method is validated on synthetic and real-world datasets, demonstrating versatility and accuracy.

Abstract

Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when learn disentangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of applications than MI, off-the-shelf DE estimators are either non differentiable, computationally intractable or fail to adapt to changes in the underlying distribution. These drawbacks prevent them from being used as regularizers in neural networks training. To address shortcomings in previously proposed estimators for DE, here we introduce KNIFE, a fully parameterized, differentiable kernel-based estimator of DE. The flexibility of our approach also allows us to construct KNIFE-based estimators…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

g-pichler/knife
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis

MethodsNetwork On Network