Dimensionality compression and expansion in Deep Neural Networks

Stefano Recanatesi; Matthew Farrell; Madhu Advani; Timothy Moore,; Guillaume Lajoie; Eric Shea-Brown

arXiv:1906.00443·cs.LG·October 29, 2019·42 cites

Dimensionality compression and expansion in Deep Neural Networks

Stefano Recanatesi, Matthew Farrell, Madhu Advani, Timothy Moore,, Guillaume Lajoie, Eric Shea-Brown

PDF

Open Access

TL;DR

This paper investigates how deep neural networks learn low-dimensional manifolds from high-dimensional data, revealing a two-phase process of expansion and compression that relates to their generalization ability.

Contribution

It introduces a novel analysis of neural network training dynamics, showing how intrinsic dimensionality evolves and influences generalization, supported by new dimensionality estimation techniques.

Findings

01

Neural networks undergo a two-phase process: expansion then compression of data representations.

02

Noise from stochastic gradient descent acts as a regularizer balancing representation dimensionality.

03

Low-dimensional representations are linked to better generalization performance.

Abstract

Datasets such as images, text, or movies are embedded in high-dimensional spaces. However, in important cases such as images of objects, the statistical structure in the data constrains samples to a manifold of dramatically lower dimensionality. Learning to identify and extract task-relevant variables from this embedded manifold is crucial when dealing with high-dimensional problems. We find that neural networks are often very effective at solving this task and investigate why. To this end, we apply state-of-the-art techniques for intrinsic dimensionality estimation to show that neural networks learn low-dimensional manifolds in two phases: first, dimensionality expansion driven by feature generation in initial layers, and second, dimensionality compression driven by the selection of task-relevant features in later layers. We model noise generated by Stochastic Gradient Descent and show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Advanced Image Processing Techniques