Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth
Kevin K\"ogler, Alexander Shevchenko, Hamed Hassani, Marco Mondelli

TL;DR
This paper provides theoretical insights into autoencoders' ability to capture data structure, revealing limitations in shallow models for sparse data and proposing methods to improve compression performance through nonlinearities and depth.
Contribution
It proves that shallow autoencoders ignore sparsity in data, identifies a phase transition in gradient descent solutions, and introduces denoising and multi-layer techniques to enhance compression of sparse data.
Findings
Gradient descent ignores sparsity in shallow autoencoders.
A phase transition in the minimizer shape depends on data sparsity.
Adding denoising and multiple layers improves compression performance.
Abstract
Autoencoders are a prominent model in many empirical branches of machine learning and lossy data compression. However, basic theoretical questions remain unanswered even in a shallow two-layer setting. In particular, to what degree does a shallow autoencoder capture the structure of the underlying data distribution? For the prototypical case of the 1-bit compression of sparse Gaussian data, we prove that gradient descent converges to a solution that completely disregards the sparse structure of the input. Namely, the performance of the algorithm is the same as if it was compressing a Gaussian source - with no sparsity. For general data distributions, we give evidence of a phase transition phenomenon in the shape of the gradient descent minimizer, as a function of the data sparsity: below the critical sparsity level, the minimizer is a rotation taken uniformly at random (just like in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Neural Networks and Applications
