Understanding Robustness in Teacher-Student Setting: A New Perspective
Zhuolin Yang, Zhaoxi Chen, Tiffany Cai, Xinyun Chen, Bo Li, Yuandong, Tian

TL;DR
This paper investigates the robustness of teacher-student neural network models against adversarial examples, revealing that student specialization within the data subspace correlates with robustness and providing insights for improving model resilience.
Contribution
It extends Tian (2019) to low-rank data, showing how student specialization relates to robustness and differences between teacher and student nodes outside the data subspace.
Findings
Student specialization correlates with model robustness.
Teacher and student nodes differ outside the data subspace, potentially leading to adversarial examples.
Various training methods show different robustness levels linked to student specialization.
Abstract
Adversarial examples have appeared as a ubiquitous property of machine learning models where bounded adversarial perturbation could mislead the models to make arbitrarily incorrect predictions. Such examples provide a way to assess the robustness of machine learning models as well as a proxy for understanding the model training process. Extensive studies try to explain the existence of adversarial examples and provide ways to improve model robustness (e.g. adversarial training). While they mostly focus on models trained on datasets with predefined labels, we leverage the teacher-student framework and assume a teacher model, or oracle, to provide the labels for given instances. We extend Tian (2019) in the case of low-rank input data and show that student specialization (trained student neuron is highly correlated with certain teacher neuron at the same layer) still happens within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
