NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
Eunwoo Kim, Chanho Ahn, Songhwai Oh

TL;DR
NestedNet introduces a multi-level sparse neural network architecture that shares parameters across levels, enabling resource adaptability and multi-task learning within a single model, improving efficiency and versatility.
Contribution
It proposes a novel nested sparse network framework with shared parameters, allowing resource-aware adaptation and multi-task learning in deep neural networks.
Findings
Performs competitively in compression and distillation tasks
Achieves resource adaptability for diverse device constraints
Enables multi-task learning within a single network
Abstract
Recently, there have been increasing demands to construct compact deep architectures to remove unnecessary redundancy and to improve the inference speed. While many recent works focus on reducing the redundancy by eliminating unneeded weight parameters, it is not possible to apply a single deep architecture for multiple devices with different resources. When a new device or circumstantial condition requires a new deep architecture, it is necessary to construct and train a new network from scratch. In this work, we propose a novel deep learning framework, called a nested sparse network, which exploits an n-in-1-type nested structure in a neural network. A nested sparse network consists of multiple levels of networks with a different sparsity ratio associated with each level, and higher level networks share parameters with lower level networks to enable stable nested learning. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and ELM
