TL;DR
This paper investigates the role of the compression phase in autoencoders' information processing, finding that it is not essential for generalization across different autoencoder types, challenging previous assumptions.
Contribution
The study demonstrates that the compression phase is not universally necessary for generalization, using experiments with various autoencoders and mutual information estimation.
Findings
Vanilla autoencoders show a compression phase with sufficient data.
Sparsity regularization amplifies the compression phase.
Some autoencoders, like variational autoencoders, generalize well without compression.
Abstract
The outstanding performance of deep learning in various fields has been a fundamental query, which can be potentially examined using information theory that interprets the learning process as the transmission and compression of information. Information plane analyses of the mutual information between the input-hidden-output layers demonstrated two distinct learning phases of fitting and compression. It is debatable if the compression phase is necessary to generalize the input-output relations extracted from training data. In this study, we investigated this through experiments with various species of autoencoders and evaluated their information processing phase with an accurate kernel-based estimator of mutual information. Given sufficient training data, vanilla autoencoders demonstrated the compression phase, which was amplified after imposing sparsity regularization for hidden…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
