An unsupervised tour through the hidden pathways of deep neural networks
Diego Doimo

TL;DR
This thesis explores the internal mechanisms of deep neural networks, introducing unsupervised methods to analyze their representations, the evolution of probability densities across layers, and insights into generalization and redundancy.
Contribution
It presents novel unsupervised tools for understanding neural representations, including a method for intrinsic dimension estimation and analysis of density evolution and redundancy in deep networks.
Findings
Layer-wise density peaks mirror semantic hierarchies
Redundant neurons emerge with regularization and zero training error
Intrinsic dimension can be estimated efficiently without data decimation
Abstract
The goal of this thesis is to improve our understanding of the internal mechanisms by which deep artificial neural networks create meaningful representations and are able to generalize. We focus on the challenge of characterizing the semantic content of the hidden representations with unsupervised learning tools, partially developed by us and described in this thesis, which allow harnessing the low-dimensional structure of the data. Chapter 2. introduces Gride, a method that allows estimating the intrinsic dimension of the data as an explicit function of the scale without performing any decimation of the data set. Our approach is based on rigorous distributional results that enable the quantification of uncertainty of the estimates. Moreover, our method is simple and computationally efficient since it relies only on the distances among nearest data points. In Chapter 3, we study the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
