Internal Representations of Vision Models Through the Lens of Frames on Data Manifolds
Henry Kvinge, Grayson Jorgenson, Davis Brown, Charles Godfrey, Tegan, Emerson

TL;DR
This paper introduces a novel 'neural frame' approach to analyze how deep learning models process local data variations, revealing insights into invariance, generalization, and the effects of training choices.
Contribution
It proposes a new geometric framework for understanding internal representations in vision models, linking perturbations to model behavior across layers.
Findings
Training with augmentation induces model invariance.
Trade-off observed between adversarial robustness and generalization.
Layer-wise processing of local variations offers new insights.
Abstract
While the last five years have seen considerable progress in understanding the internal representations of deep learning models, many questions remain. This is especially true when trying to understand the impact of model design choices, such as model architecture or training algorithm, on hidden representation geometry and dynamics. In this work we present a new approach to studying such representations inspired by the idea of a frame on the tangent bundle of a manifold. Our construction, which we call a neural frame, is formed by assembling a set of vectors representing specific types of perturbations of a data point, for example infinitesimal augmentations, noise perturbations, or perturbations produced by a generative model, and studying how these change as they pass through a network. Using neural frames, we make observations about the way that models process, layer-by-layer,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
