Robust Student Network Learning

Tianyu Guo; Chang Xu; Shiyi He; Boxin Shi; Chao Xu; and Dacheng Tao

arXiv:1807.11158·cs.LG·August 1, 2018·1 cites

Robust Student Network Learning

Tianyu Guo, Chang Xu, Shiyi He, Boxin Shi, Chao Xu, and Dacheng Tao

PDF

Open Access

TL;DR

This paper proposes a teacher-student learning framework that enhances the robustness of lightweight student neural networks against perturbations while maintaining high accuracy, addressing a gap in existing methods focused mainly on accuracy and compression.

Contribution

It introduces a novel approach that improves student network robustness by optimizing prediction confidence and gradient-based objectives during training.

Findings

01

Student networks become more robust to perturbations.

02

The method achieves a good balance of accuracy and compactness.

03

Experimental results validate the effectiveness of the approach.

Abstract

Deep neural networks bring in impressive accuracy in various applications, but the success often relies on the heavy network architecture. Taking well-trained heavy networks as teachers, classical teacher-student learning paradigm aims to learn a student network that is lightweight yet accurate. In this way, a portable student network with significantly fewer parameters can achieve a considerable accuracy which is comparable to that of teacher network. However, beyond accuracy, robustness of the learned student network against perturbation is also essential for practical uses. Existing teacher-student learning frameworks mainly focus on accuracy and compression ratios, but ignore the robustness. In this paper, we make the student network produce more confident predictions with the help of the teacher network, and analyze the lower bound of the perturbation that will destroy the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning