Learning Student Networks via Feature Embedding

Hanting Chen; Yunhe Wang; Chang Xu; Chao Xu; Dacheng Tao

arXiv:1812.06597·cs.LG·December 18, 2018·6 cites

Learning Student Networks via Feature Embedding

Hanting Chen, Yunhe Wang, Chang Xu, Chao Xu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces a feature embedding approach for knowledge distillation that enables training lightweight student networks without additional parameters, maintaining high performance with lower computational and storage costs.

Contribution

It proposes a novel feature embedding method with locality preserving loss to transfer knowledge from teacher to student without auxiliary layers.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets

02

Reduces computational complexity significantly

03

Maintains high accuracy comparable to teacher networks

Abstract

Deep convolutional neural networks have been widely used in numerous applications, but their demanding storage and computational resource requirements prevent their applications on mobile devices. Knowledge distillation aims to optimize a portable student network by taking the knowledge from a well-trained heavy teacher network. Traditional teacher-student based methods used to rely on additional fully-connected layers to bridge intermediate layers of teacher and student networks, which brings in a large number of auxiliary parameters. In contrast, this paper aims to propagate information from teacher to student without introducing new variables which need to be optimized. We regard the teacher-student paradigm from a new perspective of feature embedding. By introducing the locality preserving loss, the student network is encouraged to generate the low-dimensional features which could…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and ELM · Machine Learning and Data Classification