Understanding Robustness in Teacher-Student Setting: A New Perspective

Zhuolin Yang; Zhaoxi Chen; Tiffany Cai; Xinyun Chen; Bo Li; Yuandong; Tian

arXiv:2102.13170·cs.LG·March 2, 2021

Understanding Robustness in Teacher-Student Setting: A New Perspective

Zhuolin Yang, Zhaoxi Chen, Tiffany Cai, Xinyun Chen, Bo Li, Yuandong, Tian

PDF

Open Access

TL;DR

This paper investigates the robustness of teacher-student neural network models against adversarial examples, revealing that student specialization within the data subspace correlates with robustness and providing insights for improving model resilience.

Contribution

It extends Tian (2019) to low-rank data, showing how student specialization relates to robustness and differences between teacher and student nodes outside the data subspace.

Findings

01

Student specialization correlates with model robustness.

02

Teacher and student nodes differ outside the data subspace, potentially leading to adversarial examples.

03

Various training methods show different robustness levels linked to student specialization.

Abstract

Adversarial examples have appeared as a ubiquitous property of machine learning models where bounded adversarial perturbation could mislead the models to make arbitrarily incorrect predictions. Such examples provide a way to assess the robustness of machine learning models as well as a proxy for understanding the model training process. Extensive studies try to explain the existence of adversarial examples and provide ways to improve model robustness (e.g. adversarial training). While they mostly focus on models trained on datasets with predefined labels, we leverage the teacher-student framework and assume a teacher model, or oracle, to provide the labels for given instances. We extend Tian (2019) in the case of low-rank input data and show that student specialization (trained student neuron is highly correlated with certain teacher neuron at the same layer) still happens within the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning