Discovering Multi-Label Actor-Action Association in a Weakly Supervised   Setting

Sovan Biswas; Juergen Gall

arXiv:2101.08567·cs.CV·January 22, 2021

Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting

Sovan Biswas, Juergen Gall

PDF

Open Access 1 Repo

TL;DR

This paper introduces a weakly supervised method for multi-label actor-action association in videos, effectively handling multiple persons and actions without bounding box annotations, using a novel set-based representation approach.

Contribution

It presents the first approach to address multi-label actor-action association in a weakly supervised setting with a set-based action representation.

Findings

01

Outperforms MIML baseline on AVA dataset

02

Competitive with fully supervised methods

03

Handles multiple persons and actions simultaneously

Abstract

Since collecting and annotating data for spatio-temporal action detection is very expensive, there is a need to learn approaches with less supervision. Weakly supervised approaches do not require any bounding box annotations and can be trained only from labels that indicate whether an action occurs in a video clip. Current approaches, however, cannot handle the case when there are multiple persons in a video that perform multiple actions at the same time. In this work, we address this very challenging task for the first time. We propose a baseline based on multi-instance and multi-label learning. Furthermore, we propose a novel approach that uses sets of actions as representation instead of modeling individual action classes. Since computing, the probabilities for the full power set becomes intractable as the number of action classes increases, we assign an action set to each detected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sovan-biswas/MultiLabelActorActionAssignment
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods