Cross-Modal Knowledge Transfer Without Task-Relevant Source Data
Sk Miraj Ahmed, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones and, Amit K. Roy-Chowdhury

TL;DR
This paper introduces SOCKET, a novel method for transferring knowledge from a source model trained on RGB data to a target modality like depth or infrared, without access to source data, by reducing modality gap using paired data and feature normalization.
Contribution
The paper presents SOCKET, the first approach for source-free cross-modal knowledge transfer that effectively handles modality gaps without relying on source data.
Findings
SOCKET outperforms existing methods in classification tasks involving different modalities.
The method effectively reduces the modality gap using paired data and feature normalization.
Extensive experiments demonstrate significant improvements over prior source-free approaches.
Abstract
Cost-effective depth and infrared sensors as alternatives to usual RGB sensors are now a reality, and have some advantages over RGB in domains like autonomous navigation and remote sensing. As such, building computer vision and deep learning systems for depth and infrared data are crucial. However, large labeled datasets for these modalities are still lacking. In such cases, transferring knowledge from a neural network trained on a well-labeled large dataset in the source modality (RGB) to a neural network that works on a target modality (depth, infrared, etc.) is of great value. For reasons like memory and privacy, it may not be possible to access the source data, and knowledge transfer needs to work with only the source models. We describe an effective solution, SOCKET: SOurce-free Cross-modal KnowledgE Transfer for this challenging task of transferring knowledge from one source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
