Deep neural networks are robust to weight binarization and other   non-linear distortions

Paul Merolla; Rathinakumar Appuswamy; John Arthur; Steve K. Esser,; Dharmendra Modha

arXiv:1606.01981·cs.NE·June 8, 2016·79 cites

Deep neural networks are robust to weight binarization and other non-linear distortions

Paul Merolla, Rathinakumar Appuswamy, John Arthur, Steve K. Esser,, Dharmendra Modha

PDF

Open Access

TL;DR

Deep neural networks are highly robust to various weight distortions beyond quantization, maintaining performance even under significant noise and non-linear transformations, which can be exploited to improve efficiency and robustness.

Contribution

The paper demonstrates that neural networks are robust to a wide range of weight distortions and introduces new training methods that enhance this robustness, achieving state-of-the-art results.

Findings

01

Networks maintain low error with high noise levels

02

Training with alternative weight projections is effective

03

Proposed stochastic projection achieves new state-of-the-art on CIFAR-10

Abstract

Recent results show that deep neural networks achieve excellent performance even when, during training, weights are quantized and projected to a binary representation. Here, we show that this is just the tip of the iceberg: these same networks, during testing, also exhibit a remarkable robustness to distortions beyond quantization, including additive and multiplicative noise, and a class of non-linear projections where binarization is just a special case. To quantify this robustness, we show that one such network achieves 11% test error on CIFAR-10 even with 0.68 effective bits per weight. Furthermore, we find that a common training heuristic--namely, projecting quantized weights during backpropagation--can be altered (or even removed) and networks still achieve a base level of robustness during testing. Specifically, training with weight projections other than quantization also works,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning