Noise as a Resource for Learning in Knowledge Distillation

Elahe Arani; Fahad Sarfraz; Bahram Zonooz

arXiv:1910.05057·cs.LG·December 16, 2020

Noise as a Resource for Learning in Knowledge Distillation

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

PDF

TL;DR

This paper explores how adding noise during knowledge distillation can enhance model training, robustness, and efficiency, inspired by neuroscience insights, and introduces three methods to leverage noise for improved deep learning outcomes.

Contribution

It introduces three novel noise-based methods for knowledge distillation addressing performance gap, robustness, and label noise challenges.

Findings

01

Noise improves knowledge distillation effectiveness.

02

Proposed methods outperform baseline approaches.

03

Noise facilitates training of robust and efficient models.

Abstract

While noise is commonly considered a nuisance in computing systems, a number of studies in neuroscience have shown several benefits of noise in the nervous system from enabling the brain to carry out computations such as probabilistic inference as well as carrying additional information about the stimuli. Similarly, noise has been shown to improve the performance of deep neural networks. In this study, we further investigate the effect of adding noise in the knowledge distillation framework because of its resemblance to collaborative subnetworks in the brain regions. We empirically show that injecting constructive noise at different levels in the collaborative learning framework enables us to train the model effectively and distill desirable characteristics in the student model. In doing so, we propose three different methods that target the common challenges in deep neural networks:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation