SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets
Cody Simons, Dripta S. Raychaudhuri, Sk Miraj Ahmed, Suya You,, Konstantinos Karydis, Amit K. Roy-Chowdhury

TL;DR
SUMMIT enables source-free adaptation of independently trained uni-modal models to multi-modal data in unlabeled target domains, using a switching framework that intelligently combines agreement filtering and entropy weighting, improving semantic segmentation performance.
Contribution
This work introduces a novel source-free adaptation method for uni-modal models to multi-modal targets, relaxing source data and paired data assumptions.
Findings
Achieves up to 12% mIoU improvement over baselines.
Performs comparably or better than methods with source data access.
Validated across seven challenging domain adaptation scenarios.
Abstract
Scene understanding using multi-modal data is necessary in many applications, e.g., autonomous navigation. To achieve this in a variety of situations, existing models must be able to adapt to shifting data distributions without arduous data annotation. Current approaches assume that the source data is available during adaptation and that the source consists of paired multi-modal data. Both these assumptions may be problematic for many applications. Source data may not be available due to privacy, security, or economic concerns. Assuming the existence of paired multi-modal data for training also entails significant data collection costs and fails to take advantage of widely available freely distributed pre-trained uni-modal models. In this work, we relax both of these assumptions by addressing the problem of adapting a set of models trained independently on uni-modal data to a target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets· youtube
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
