Learning Structures for Deep Neural Networks
Jinhui Yuan, Fei Pan, Chunting Zhou, Tao Qin, Tie-Yan Liu

TL;DR
This paper introduces an unsupervised structure learning method for deep neural networks based on the efficient coding principle, maximizing mutual information and output entropy to automatically determine network architecture.
Contribution
It proposes a novel unsupervised approach using global group sparse coding to learn network structure, including depth and connectivity, without labeled data.
Findings
Achieves classification accuracy comparable to expert-designed CNNs.
Automatically discovers local connectivity and invariance structures.
Balances performance gain with network depth.
Abstract
In this paper, we focus on the unsupervised setting for structure learning of deep neural networks and propose to adopt the efficient coding principle, rooted in information theory and developed in computational neuroscience, to guide the procedure of structure learning without label information. This principle suggests that a good network structure should maximize the mutual information between inputs and outputs, or equivalently maximize the entropy of outputs under mild assumptions. We further establish connections between this principle and the theory of Bayesian optimal classification, and empirically verify that larger entropy of the outputs of a deep neural network indeed corresponds to a better classification accuracy. Then as an implementation of the principle, we show that sparse coding can effectively maximize the entropy of the output signals, and accordingly design an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms
