Combating the Instability of Mutual Information-based Losses via   Regularization

Kwanghee Choi; Siyeong Lee

arXiv:2011.07932·cs.LG·June 22, 2022

Combating the Instability of Mutual Information-based Losses via Regularization

Kwanghee Choi, Siyeong Lee

PDF

Open Access 1 Repo

TL;DR

This paper addresses the instability issues in mutual information-based neural network training by introducing a novel regularization technique, demonstrating improved stability and performance across various benchmarks.

Contribution

The authors propose a new regularization method that stabilizes MI-based loss training and introduce a benchmark for evaluating MI estimation and downstream task performance.

Findings

01

Regularization stabilizes MI-based loss training.

02

Regularized losses outperform unregularized counterparts.

03

Benchmark evaluates MI estimation and downstream task performance.

Abstract

Notable progress has been made in numerous fields of machine learning based on neural network-driven mutual information (MI) bounds. However, utilizing the conventional MI-based losses is often challenging due to their practical and mathematical limitations. In this work, we first identify the symptoms behind their instability: (1) the neural network not converging even after the loss seemed to converge, and (2) saturating neural network outputs causing the loss to diverge. We mitigate both issues by adding a novel regularization term to the existing losses. We theoretically and experimentally demonstrate that added regularization stabilizes training. Finally, we present a novel benchmark that evaluates MI-based losses on both the MI estimation power and its capability on the downstream tasks, closely following the pre-existing supervised and contrastive learning settings. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

siyeong-lee/deconstructing-mine
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Adversarial Robustness in Machine Learning

MethodsContrastive Learning