Function space analysis of deep learning representation layers
Oren Elisha, Shai Dekel

TL;DR
This paper introduces a function space method to analyze deep learning layers, showing how the Besov smoothness index quantifies feature space geometry and correlates with network depth and data quality.
Contribution
It presents a novel application of Besov smoothness analysis to deep learning representations, linking geometric properties to network performance and generalization.
Findings
Besov smoothness increases across layers in well-trained networks.
Adding mis-labeling decreases the Besov smoothness of representations.
The approach helps understand deep learning generalization and feature geometry.
Abstract
In this paper we propose a function space approach to Representation Learning and the analysis of the representation layers in deep learning architectures. We show how to compute a weak-type Besov smoothness index that quantifies the geometry of the clustering in the feature space. This approach was already applied successfully to improve the performance of machine learning algorithms such as the Random Forest and tree-based Gradient Boosting. Our experiments demonstrate that in well-known and well-performing trained networks, the Besov smoothness of the training set, measured in the corresponding hidden layer feature map representation, increases from layer to layer. We also contribute to the understanding of generalization by showing how the Besov smoothness of the representations, decreases as we add more mis-labeling to the training data. We hope this approach will contribute to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
