DeepTwist: Learning Model Compression via Occasional Weight Distortion

Dongsoo Lee; Parichay Kapoor; Byeongwook Kim

arXiv:1810.12823·cs.LG·October 31, 2018·21 cites

DeepTwist: Learning Model Compression via Occasional Weight Distortion

Dongsoo Lee, Parichay Kapoor, Byeongwook Kim

PDF

Open Access

TL;DR

DeepTwist introduces a simple, efficient framework for model compression that distorts weights occasionally, significantly improving compression rates across various techniques with minimal additional effort.

Contribution

It proposes a novel weight distortion method that enhances compression efficiency without altering existing training algorithms.

Findings

01

Improves compression rates for pruning, quantization, and low-rank approximation.

02

Reduces need for retraining and hyper-parameter tuning.

03

Provides regularization benefits.

Abstract

Model compression has been introduced to reduce the required hardware resources while maintaining the model accuracy. Lots of techniques for model compression, such as pruning, quantization, and low-rank approximation, have been suggested along with different inference implementation characteristics. Adopting model compression is, however, still challenging because the design complexity of model compression is rapidly increasing due to additional hyper-parameters and computation overhead in order to achieve a high compression ratio. In this paper, we propose a simple and efficient model compression framework called DeepTwist which distorts weights in an occasional manner without modifying the underlying training algorithms. The ideas of designing weight distortion functions are intuitive and straightforward given formats of compressed weights. We show that our proposed framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Anomaly Detection Techniques and Applications · Advanced Data Compression Techniques