Compression of Structured Data with Autoencoders: Provable Benefit of   Nonlinearities and Depth

Kevin K\"ogler; Alexander Shevchenko; Hamed Hassani; Marco Mondelli

arXiv:2402.05013·cs.LG·February 8, 2024·1 cites

Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth

Kevin K\"ogler, Alexander Shevchenko, Hamed Hassani, Marco Mondelli

PDF

Open Access

TL;DR

This paper provides theoretical insights into autoencoders' ability to capture data structure, revealing limitations in shallow models for sparse data and proposing methods to improve compression performance through nonlinearities and depth.

Contribution

It proves that shallow autoencoders ignore sparsity in data, identifies a phase transition in gradient descent solutions, and introduces denoising and multi-layer techniques to enhance compression of sparse data.

Findings

01

Gradient descent ignores sparsity in shallow autoencoders.

02

A phase transition in the minimizer shape depends on data sparsity.

03

Adding denoising and multiple layers improves compression performance.

Abstract

Autoencoders are a prominent model in many empirical branches of machine learning and lossy data compression. However, basic theoretical questions remain unanswered even in a shallow two-layer setting. In particular, to what degree does a shallow autoencoder capture the structure of the underlying data distribution? For the prototypical case of the 1-bit compression of sparse Gaussian data, we prove that gradient descent converges to a solution that completely disregards the sparse structure of the input. Namely, the performance of the algorithm is the same as if it was compressing a Gaussian source - with no sparsity. For general data distributions, we give evidence of a phase transition phenomenon in the shape of the gradient descent minimizer, as a function of the data sparsity: below the critical sparsity level, the minimizer is a rotation taken uniformly at random (just like in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques · Neural Networks and Applications