Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action   Recognition

Jinwoo Choi; Chen Gao; Joseph C. E. Messou; Jia-Bin Huang

arXiv:1912.05534·cs.CV·December 12, 2019·99 cites

Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition

Jinwoo Choi, Chen Gao, Joseph C. E. Messou, Jia-Bin Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to reduce scene bias in video action recognition by using adversarial and human mask confusion losses, leading to improved generalization across multiple tasks.

Contribution

It proposes a new debiasing method that mitigates scene bias in video representations, enhancing transferability to various action recognition tasks.

Findings

01

Consistent improvement over baseline models in multiple tasks

02

Effective reduction of scene bias in learned representations

03

Enhanced generalization to new action classes and tasks

Abstract

Human activities often occur in specific scene contexts, e.g., playing basketball on a basketball court. Training a model using existing video datasets thus inevitably captures and leverages such bias (instead of using the actual discriminative cues). The learned representation may not generalize well to new action classes or different tasks. In this paper, we propose to mitigate scene bias for video representation learning. Specifically, we augment the standard cross-entropy loss for action classification with 1) an adversarial loss for scene types and 2) a human mask confusion loss for videos where the human actors are masked out. These two losses encourage learning representations that are unable to predict the scene types and the correct actions when there is no evidence. We validate the effectiveness of our method by transferring our pre-trained model to three different tasks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vt-vl-lab/SDN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications