NestedNet: Learning Nested Sparse Structures in Deep Neural Networks

Eunwoo Kim; Chanho Ahn; Songhwai Oh

arXiv:1712.03781·cs.CV·March 28, 2018·1 cites

NestedNet: Learning Nested Sparse Structures in Deep Neural Networks

Eunwoo Kim, Chanho Ahn, Songhwai Oh

PDF

Open Access

TL;DR

NestedNet introduces a multi-level sparse neural network architecture that shares parameters across levels, enabling resource adaptability and multi-task learning within a single model, improving efficiency and versatility.

Contribution

It proposes a novel nested sparse network framework with shared parameters, allowing resource-aware adaptation and multi-task learning in deep neural networks.

Findings

01

Performs competitively in compression and distillation tasks

02

Achieves resource adaptability for diverse device constraints

03

Enables multi-task learning within a single network

Abstract

Recently, there have been increasing demands to construct compact deep architectures to remove unnecessary redundancy and to improve the inference speed. While many recent works focus on reducing the redundancy by eliminating unneeded weight parameters, it is not possible to apply a single deep architecture for multiple devices with different resources. When a new device or circumstantial condition requires a new deep architecture, it is necessary to construct and train a new network from scratch. In this work, we propose a novel deep learning framework, called a nested sparse network, which exploits an n-in-1-type nested structure in a neural network. A nested sparse network consists of multiple levels of networks with a different sparsity ratio associated with each level, and higher level networks share parameters with lower level networks to enable stable nested learning. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and ELM