Emergence of hierarchical modes from deep learning
Chan Li, Haiping Huang

TL;DR
This paper introduces a mode decomposition learning method that interprets deep neural network weights as hierarchical latent modes, reducing training costs and enhancing interpretability by revealing a compact, disentangled latent space.
Contribution
It proposes a novel mode decomposition approach that explains network performance through hierarchical modes, which grow logarithmically with network width and simplify understanding of deep learning models.
Findings
Modes increase logarithmically with network width
Mode decomposition reduces training costs significantly
Modes provide a more disentangled, interpretable latent space
Abstract
Large-scale deep neural networks consume expensive training costs, but the training results in less-interpretable weight matrices constructing the networks. Here, we propose a mode decomposition learning that can interpret the weight matrices as a hierarchy of latent modes. These modes are akin to patterns in physics studies of memory networks, but the least number of modes increases only logarithmically with the network width, and becomes even a constant when the width further grows. The mode decomposition learning not only saves a significant large amount of training costs, but also explains the network performance with the leading modes, displaying a striking piecewise power-law behavior. The modes specify a progressively compact latent space across the network hierarchy, making a more disentangled subspaces compared to standard training. Our mode decomposition learning is also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Quantum many-body systems · Spectroscopy and Quantum Chemical Studies
