Neural Network Compression Via Sparse Optimization

Tianyi Chen; Bo Ji; Yixin Shi; Tianyu Ding; Biyi Fang; Sheng Yi; Xiao; Tu

arXiv:2011.04868·cs.LG·November 12, 2020·6 cites

Neural Network Compression Via Sparse Optimization

Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao, Tu

PDF

Open Access

TL;DR

This paper introduces a neural network compression method based on sparse stochastic optimization, achieving significant FLOPs reduction with minimal accuracy loss, and is adaptable to various applications.

Contribution

It presents a novel model compression framework leveraging sparse stochastic optimization, which is easier to implement across different scenarios than existing heuristic methods.

Findings

01

Achieved up to 7.2x FLOPs reduction on VGG16 for CIFAR10.

02

Achieved up to 2.9x FLOPs reduction on ResNet50 for ImageNet.

03

Demonstrated effectiveness on benchmark compression tasks.

Abstract

The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yielding sparse solutions naturally fits the compression requirement, but due to the limited study of sparse optimization in stochastic learning, its extension and application onto model compression is rarely well explored. In this work, we propose a model compression framework based on the recent progress on sparse stochastic optimization. Compared to existing model compression techniques, our method is effective and requires fewer extra engineering efforts to incorporate with varying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms