Embedding Differentiable Sparsity into Deep Neural Network

Yongjin Lee

arXiv:2006.13716·cs.LG·June 25, 2020

Embedding Differentiable Sparsity into Deep Neural Network

Yongjin Lee

PDF

Open Access

TL;DR

This paper introduces a method to embed differentiable sparsity into deep neural networks, enabling simultaneous learning of network structure and weights with exact zero parameters during training.

Contribution

It presents a novel approach that allows neural networks to learn both sparse structures and weights simultaneously through differentiable sparsity embedding.

Findings

01

Supports both structured and unstructured sparsity

02

Allows exact zero parameters during training

03

Enables simultaneous learning of structure and weights

Abstract

In this paper, we propose embedding sparsity into the structure of deep neural networks, where model parameters can be exactly zero during training with the stochastic gradient descent. Thus, it can learn the sparsified structure and the weights of networks simultaneously. The proposed approach can learn structured as well as unstructured sparsity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis