Emergence of hierarchical modes from deep learning

Chan Li; Haiping Huang

arXiv:2208.09859·cs.LG·April 13, 2023

Emergence of hierarchical modes from deep learning

Chan Li, Haiping Huang

PDF

Open Access

TL;DR

This paper introduces a mode decomposition learning method that interprets deep neural network weights as hierarchical latent modes, reducing training costs and enhancing interpretability by revealing a compact, disentangled latent space.

Contribution

It proposes a novel mode decomposition approach that explains network performance through hierarchical modes, which grow logarithmically with network width and simplify understanding of deep learning models.

Findings

01

Modes increase logarithmically with network width

02

Mode decomposition reduces training costs significantly

03

Modes provide a more disentangled, interpretable latent space

Abstract

Large-scale deep neural networks consume expensive training costs, but the training results in less-interpretable weight matrices constructing the networks. Here, we propose a mode decomposition learning that can interpret the weight matrices as a hierarchy of latent modes. These modes are akin to patterns in physics studies of memory networks, but the least number of modes increases only logarithmically with the network width, and becomes even a constant when the width further grows. The mode decomposition learning not only saves a significant large amount of training costs, but also explains the network performance with the leading modes, displaying a striking piecewise power-law behavior. The modes specify a progressively compact latent space across the network hierarchy, making a more disentangled subspaces compared to standard training. Our mode decomposition learning is also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Quantum many-body systems · Spectroscopy and Quantum Chemical Studies