Test-Time Adaptation for Combating Missing Modalities in Egocentric   Videos

Merey Ramazanova; Alejandro Pardo; Bernard Ghanem; Motasem; Alfarra

arXiv:2404.15161·cs.CV·March 4, 2025

Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos

Merey Ramazanova, Alejandro Pardo, Bernard Ghanem, Motasem, Alfarra

PDF

Open Access 1 Video

TL;DR

This paper introduces MiDl, a test-time adaptation method that enables models to handle missing modalities in egocentric videos without retraining, improving performance through mutual information minimization and self-distillation.

Contribution

The paper presents the first self-supervised, online test-time adaptation approach for missing modalities, eliminating the need for retraining models.

Findings

01

Significant performance gains on multiple datasets.

02

Effective handling of missing modalities without retraining.

03

Compatibility with various pretrained models.

Abstract

Understanding videos that contain multiple modalities is crucial, especially in egocentric videos, where combining various sensory inputs significantly improves tasks like action recognition and moment localization. However, real-world applications often face challenges with incomplete modalities due to privacy concerns, efficiency needs, or hardware issues. Current methods, while effective, often necessitate retraining the model entirely to handle missing modalities, making them computationally intensive, particularly with large training datasets. In this study, we propose a novel approach to address this issue at test time without requiring retraining. We frame the problem as a test-time adaptation task, where the model adjusts to the available unlabeled data at test time. Our method, MiDl~(Mutual information with self-Distillation), encourages the model to be insensitive to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos· slideslive

Taxonomy

TopicsMultimedia Communication and Technology · Video Analysis and Summarization