Intra-class Patch Swap for Self-Distillation

Hongjun Choi; Eun Som Jeon; Ankita Shukla; Pavan Turaga

arXiv:2505.14124·cs.CV·May 21, 2025

Intra-class Patch Swap for Self-Distillation

Hongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan Turaga

PDF

1 Repo

TL;DR

This paper introduces a teacher-free self-distillation method using intra-class patch swap augmentation, which generates intra-class sample pairs within a single model to improve performance across multiple vision tasks.

Contribution

The paper proposes a simple, model-agnostic intra-class patch swap augmentation for self-distillation that eliminates the need for auxiliary components or complex training procedures.

Findings

01

Outperforms existing self-distillation methods and teacher-based KD approaches.

02

Effective across image classification, semantic segmentation, and object detection.

03

Highlights the importance of augmentation design in self-distillation success.

Abstract

Knowledge distillation (KD) is a valuable technique for compressing large deep learning models into smaller, edge-suitable networks. However, conventional KD frameworks rely on pre-trained high-capacity teacher networks, which introduce significant challenges such as increased memory/storage requirements, additional training costs, and ambiguity in selecting an appropriate teacher for a given student model. Although a teacher-free distillation (self-distillation) has emerged as a promising alternative, many existing approaches still rely on architectural modifications or complex training procedures, which limit their generality and efficiency. To address these limitations, we propose a novel framework based on teacher-free distillation that operates using a single student network without any auxiliary components, architectural modifications, or additional learnable parameters. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hchoi71/intra-class-patch-swap
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN