Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation

Ping Li; Junjie Chen; Li Yuan; Xianghua Xu; Mingli Song

arXiv:2309.12557·cs.CV·July 1, 2025·5 cites

Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation

Ping Li, Junjie Chen, Li Yuan, Xianghua Xu, Mingli Song

PDF

Open Access

TL;DR

This paper introduces TriKD, a semi-supervised semantic segmentation framework using three diverse encoders and a dual-frequency decoder to improve feature diversity and reduce redundancy, achieving better accuracy and efficiency.

Contribution

The paper proposes a novel triple-view encoder with different architectures and a dual-frequency decoder with attention, enhancing feature diversity and reducing memory costs in semi-supervised segmentation.

Findings

01

Outperforms existing methods on Pascal VOC 2012 and Cityscapes benchmarks.

02

Achieves a good balance between segmentation accuracy and inference speed.

03

Effectively utilizes diverse features through triple-view distillation and frequency domain attention.

Abstract

To alleviate the expensive human labeling, semi-supervised semantic segmentation employs a few labeled images and an abundant of unlabeled images to predict the pixel-level label map with the same size. Previous methods often adopt co-training using two convolutional networks with the same architecture but different initialization, which fails to capture the sufficiently diverse features. This motivates us to use tri-training and develop the triple-view encoder to utilize the encoders with different architectures to derive diverse features, and exploit the knowledge distillation skill to learn the complementary semantics among these encoders. Moreover, existing methods simply concatenate the features from both encoder and decoder, resulting in redundant features that require large memory cost. This inspires us to devise a dual-frequency decoder that selects those important features by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsKnowledge Distillation