Cross-Modal Knowledge Transfer Without Task-Relevant Source Data

Sk Miraj Ahmed; Suhas Lohit; Kuan-Chuan Peng; Michael J. Jones and; Amit K. Roy-Chowdhury

arXiv:2209.04027·cs.CV·September 12, 2022

Cross-Modal Knowledge Transfer Without Task-Relevant Source Data

Sk Miraj Ahmed, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones and, Amit K. Roy-Chowdhury

PDF

Open Access

TL;DR

This paper introduces SOCKET, a novel method for transferring knowledge from a source model trained on RGB data to a target modality like depth or infrared, without access to source data, by reducing modality gap using paired data and feature normalization.

Contribution

The paper presents SOCKET, the first approach for source-free cross-modal knowledge transfer that effectively handles modality gaps without relying on source data.

Findings

01

SOCKET outperforms existing methods in classification tasks involving different modalities.

02

The method effectively reduces the modality gap using paired data and feature normalization.

03

Extensive experiments demonstrate significant improvements over prior source-free approaches.

Abstract

Cost-effective depth and infrared sensors as alternatives to usual RGB sensors are now a reality, and have some advantages over RGB in domains like autonomous navigation and remote sensing. As such, building computer vision and deep learning systems for depth and infrared data are crucial. However, large labeled datasets for these modalities are still lacking. In such cases, transferring knowledge from a neural network trained on a well-labeled large dataset in the source modality (RGB) to a neural network that works on a target modality (depth, infrared, etc.) is of great value. For reasons like memory and privacy, it may not be possible to access the source data, and knowledge transfer needs to work with only the source models. We describe an effective solution, SOCKET: SOurce-free Cross-modal KnowledgE Transfer for this challenging task of transferring knowledge from one source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques