Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in   Cross-Modal Knowledge Distillation

Yilong Chen; Zongyi Xu; Xiaoshui Huang; Shanshan Zhao and; Xinqi Jiang; Xinyu Gao; Xinbo Gao

arXiv:2409.02438·cs.CV·September 5, 2024

Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation

Yilong Chen, Zongyi Xu, Xiaoshui Huang, Shanshan Zhao and, Xinqi Jiang, Xinyu Gao, Xinbo Gao

PDF

Open Access

TL;DR

This paper investigates how domain gaps between modalities affect cross-modal knowledge distillation, revealing that smaller distribution differences in non-target classes improve performance, supported by theoretical bounds and extensive experiments.

Contribution

It introduces the Non-Target Divergence Hypothesis (NTDH), providing a theoretical framework and empirical validation for understanding domain gaps in cross-modal knowledge distillation.

Findings

01

Domain gaps cause distribution differences in non-target classes.

02

Smaller non-target class differences lead to better distillation performance.

03

Theoretical bounds validate the impact of domain gaps on error.

Abstract

Compared to single-modal knowledge distillation, cross-modal knowledge distillation faces more severe challenges due to domain gaps between modalities. Although various methods have proposed various solutions to overcome these challenges, there is still limited research on how domain gaps affect cross-modal knowledge distillation. This paper provides an in-depth analysis and evaluation of this issue. We first introduce the Non-Target Divergence Hypothesis (NTDH) to reveal the impact of domain gaps on cross-modal knowledge distillation. Our key finding is that domain gaps between modalities lead to distribution differences in non-target classes, and the smaller these differences, the better the performance of cross-modal knowledge distillation. Subsequently, based on Vapnik-Chervonenkis (VC) theory, we derive the upper and lower bounds of the approximation error for cross-modal knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsKnowledge Distillation