On the Information Plane of Autoencoders

Nicol\'as I. Tapia; Pablo A. Est\'evez

arXiv:2005.07783·cs.LG·October 6, 2020

On the Information Plane of Autoencoders

Nicol\'as I. Tapia, Pablo A. Est\'evez

PDF

1 Repo

TL;DR

This paper provides a theoretical analysis of the Information Plane in autoencoders, clarifying how information compression depends on bottleneck size, and introduces a new MI estimation adjustment to improve experimental accuracy.

Contribution

It derives a theoretical convergence for the IP of autoencoders and proposes a new MI estimator adjustment to align experiments with theory.

Findings

01

Large bottleneck autoencoders do not compress input information.

02

Small bottleneck autoencoders cause compression in encoder layers.

03

The proposed MI adjustment improves experimental IP estimates.

Abstract

The training dynamics of hidden layers in deep learning are poorly understood in theory. Recently, the Information Plane (IP) was proposed to analyze them, which is based on the information-theoretic concept of mutual information (MI). The Information Bottleneck (IB) theory predicts that layers maximize relevant information and compress irrelevant information. Due to the limitations in MI estimation from samples, there is an ongoing debate about the properties of the IP for the supervised learning case. In this work, we derive a theoretical convergence for the IP of autoencoders. The theory predicts that ideal autoencoders with a large bottleneck layer size do not compress input information, whereas a small size causes compression only in the encoder layers. For the experiments, we use a Gram-matrix based MI estimator recently proposed in the literature. We propose a new rule to adjust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nicolasigor/entropy
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.