Uncovering the Hidden Dynamics of Video Self-supervised Learning under   Distribution Shifts

Pritam Sarkar; Ahmad Beirami; Ali Etemad

arXiv:2306.02014·cs.CV·October 31, 2023·2 cites

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Pritam Sarkar, Ahmad Beirami, Ali Etemad

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how six popular video self-supervised learning methods behave under various natural distribution shifts, revealing their strengths and weaknesses to guide future development of robust video representations.

Contribution

The study provides a comprehensive analysis of six VSSL methods under multiple distribution shifts using a new benchmark, highlighting their robustness and limitations.

Findings

01

v-MAE and supervised learning are more robust to context shifts.

02

v-MAE excels as a temporal learner.

03

Contrastive methods perform well against viewpoint shifts.

Abstract

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts. Our study uncovers a series of intriguing findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pritamqu/OOD-VSSL
pytorch

Videos

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Human Pose and Action Recognition