SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal   Targets

Cody Simons; Dripta S. Raychaudhuri; Sk Miraj Ahmed; Suya You,; Konstantinos Karydis; Amit K. Roy-Chowdhury

arXiv:2308.11880·cs.CV·August 24, 2023

SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets

Cody Simons, Dripta S. Raychaudhuri, Sk Miraj Ahmed, Suya You,, Konstantinos Karydis, Amit K. Roy-Chowdhury

PDF

Open Access 1 Repo 1 Video

TL;DR

SUMMIT enables source-free adaptation of independently trained uni-modal models to multi-modal data in unlabeled target domains, using a switching framework that intelligently combines agreement filtering and entropy weighting, improving semantic segmentation performance.

Contribution

This work introduces a novel source-free adaptation method for uni-modal models to multi-modal targets, relaxing source data and paired data assumptions.

Findings

01

Achieves up to 12% mIoU improvement over baselines.

02

Performs comparably or better than methods with source data access.

03

Validated across seven challenging domain adaptation scenarios.

Abstract

Scene understanding using multi-modal data is necessary in many applications, e.g., autonomous navigation. To achieve this in a variety of situations, existing models must be able to adapt to shifting data distributions without arduous data annotation. Current approaches assume that the source data is available during adaptation and that the source consists of paired multi-modal data. Both these assumptions may be problematic for many applications. Source data may not be available due to privacy, security, or economic concerns. Assuming the existence of paired multi-modal data for training also entails significant data collection costs and fails to take advantage of widely available freely distributed pre-trained uni-modal models. In this work, we relax both of these assumptions by addressing the problem of adapting a set of models trained independently on uni-modal data to a target…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

csimo005/summit
pytorchOfficial

Videos

SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets· youtube

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning