Deep Model Compression: Distilling Knowledge from Noisy Teachers

Bharat Bhusan Sau; Vineeth N. Balasubramanian

arXiv:1610.09650·cs.LG·November 3, 2016·99 cites

Deep Model Compression: Distilling Knowledge from Noisy Teachers

Bharat Bhusan Sau, Vineeth N. Balasubramanian

PDF

Open Access

TL;DR

This paper introduces a noise-regularized teacher-student framework for deep model compression that improves runtime and training efficiency, demonstrated through experiments on CIFAR-10, SVHN, and MNIST datasets.

Contribution

It extends the teacher-student framework by incorporating noise-based regularization to enhance student network performance and efficiency.

Findings

01

Improved student network performance with noise regularization.

02

Effective on multiple datasets including CIFAR-10, SVHN, and MNIST.

03

Promising empirical results demonstrating the approach's potential.

Abstract

The remarkable successes of deep learning models across various applications have resulted in the design of deeper networks that can solve complex problems. However, the increasing depth of such models also results in a higher storage and runtime complexity, which restricts the deployability of such very deep models on mobile and portable devices, which have limited storage and battery capacity. While many methods have been proposed for deep model compression in recent years, almost all of them have focused on reducing storage complexity. In this work, we extend the teacher-student framework for deep model compression, since it has the potential to address runtime and train time complexity too. We propose a simple methodology to include a noise-based regularizer while training the student from the teacher, which provides a healthy improvement in the performance of the student network.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Algorithms and Data Compression