Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation
Shoumeng Qiu, Jie Chen, Xinrun Li, Ru Wan, Xiangyang Xue, and Jian Pu

TL;DR
This paper presents a novel label-assisted knowledge distillation method for semantic segmentation that enhances lightweight teacher models without complex architectures or extra sensors, improving generalization across multiple datasets and models.
Contribution
Introduces a label noise-based training strategy for teachers and a dual-path consistency training to improve knowledge distillation for semantic segmentation.
Findings
Boosts teacher model performance with label noise and dual-path training.
Achieves superior results across five challenging datasets.
Enhances flexibility in teacher-student model selection.
Abstract
In this paper, we introduce a novel knowledge distillation approach for the semantic segmentation task. Unlike previous methods that rely on power-trained teachers or other modalities to provide additional knowledge, our approach does not require complex teacher models or information from extra sensors. Specifically, for the teacher model training, we propose to noise the label and then incorporate it into input to effectively boost the lightweight teacher performance. To ensure the robustness of the teacher model against the introduced noise, we propose a dual-path consistency training strategy featuring a distance loss between the outputs of two paths. For the student model training, we keep it consistent with the standard distillation for simplicity. Our approach not only boosts the efficacy of knowledge distillation but also increases the flexibility in selecting teacher and student…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Intelligent Tutoring Systems and Adaptive Learning · Natural Language Processing Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Average Pooling · Dilated Convolution · Fully Convolutional Network · Pyramid Pooling Module · 1x1 Convolution · Auxiliary Classifier · PSPNet · Batch Normalization
