A Generalization Theory of Cross-Modality Distillation with Contrastive   Learning

Hangyu Lin; Chen Liu; Chengming Xu; Zhengqi Gao; Yanwei Fu; Yuan Yao

arXiv:2405.03355·cs.LG·May 29, 2024

A Generalization Theory of Cross-Modality Distillation with Contrastive Learning

Hangyu Lin, Chen Liu, Chengming Xu, Zhengqi Gao, Yanwei Fu, Yuan Yao

PDF

Open Access

TL;DR

This paper introduces a theoretical framework for cross-modality contrastive distillation, providing convergence analysis and demonstrating improved performance across various modalities and tasks.

Contribution

It formulates a general contrastive distillation framework and offers the first convergence analysis linking modality distance to downstream task error.

Findings

01

Outperforms existing methods by 2-3% across multiple modalities.

02

Provides theoretical insights into the impact of modality distance on test error.

03

Validates the framework through extensive experiments on recognition and segmentation tasks.

Abstract

Cross-modality distillation arises as an important topic for data modalities containing limited knowledge such as depth maps and high-quality sketches. Such techniques are of great importance, especially for memory and privacy-restricted scenarios where labeled training data is generally unavailable. To solve the problem, existing label-free methods leverage a few pairwise unlabeled data to distill the knowledge by aligning features or statistics between the source and target modalities. For instance, one typically aims to minimize the L2 distance or contrastive loss between the learned features of pairs of samples in the source (e.g. image) and the target (e.g. sketch) modalities. However, most algorithms in this domain only focus on the experimental results but lack theoretical insight. To bridge the gap between the theory and practical method of cross-modality distillation, we first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems · Metaheuristic Optimization Algorithms Research

MethodsContrastive Learning · Focus