Depth and Representation in Vision Models
Benjamin L. Badger

TL;DR
This paper investigates how deep convolutional models encode input information across layers, revealing that deeper layers produce more abstract, less invertible representations that transform inputs to match training data manifolds, linking recognition and generation tasks.
Contribution
It demonstrates the non-invertibility and abstraction in deep layer representations and shows how training enhances early layer clarity but not late layers, supporting the inseparability of recognition and generation.
Findings
Deeper layers have less accurate input reconstructions before training.
Increased training improves early layer representations but not late layers.
Deep representations transform inputs to match training data manifolds.
Abstract
Deep learning models develop successive representations of their input in sequential layers, the last of which maps the final representation to the output. Here we investigate the informational content of these representations by observing the ability of convolutional image classification models to autoencode the model's input using embeddings existing in various layers. We find that the deeper the layer, the less accurate that layer's representation of the input is before training. Inaccurate representation results from non-uniqueness in which various distinct inputs give approximately the same embedding. Non-unique representation is a consequence of both exact and approximate non-invertibility of transformations present in the forward pass. Learning to classify natural images leads to an increase in representation clarity for early but not late layers, which instead form abstract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
