Noise as a Resource for Learning in Knowledge Distillation
Elahe Arani, Fahad Sarfraz, Bahram Zonooz

TL;DR
This paper explores how adding noise during knowledge distillation can enhance model training, robustness, and efficiency, inspired by neuroscience insights, and introduces three methods to leverage noise for improved deep learning outcomes.
Contribution
It introduces three novel noise-based methods for knowledge distillation addressing performance gap, robustness, and label noise challenges.
Findings
Noise improves knowledge distillation effectiveness.
Proposed methods outperform baseline approaches.
Noise facilitates training of robust and efficient models.
Abstract
While noise is commonly considered a nuisance in computing systems, a number of studies in neuroscience have shown several benefits of noise in the nervous system from enabling the brain to carry out computations such as probabilistic inference as well as carrying additional information about the stimuli. Similarly, noise has been shown to improve the performance of deep neural networks. In this study, we further investigate the effect of adding noise in the knowledge distillation framework because of its resemblance to collaborative subnetworks in the brain regions. We empirically show that injecting constructive noise at different levels in the collaborative learning framework enables us to train the model effectively and distill desirable characteristics in the student model. In doing so, we propose three different methods that target the common challenges in deep neural networks:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
